My name is Zhiyu Chen (陈知雨). Now I am working as a senior applied scientist at Amazon in Seattle, US. My first name “Zhiyu” comes from an ancient Chinese poem (好雨知时节), which means good rain knows the best time to fall. I received my PhD from Lehigh University at 2022 with Prof. Brian D. Davison. My research interests include data mining, machine learning, natural language processing and information retrieval.
Hi, I'm |

About Me
Education

Ph.D in Computer Science
Lehigh University, Bethlehem, PA, USA
2015 - 2022 | GPA: 3.97/4.0

Exchange Program
Ecole Supérieure d'Ingénieurs Léonard de Vinci, Paris, France
2014 - 2015 | GPA: 15.75/20

B.E. in Computer Science
Nanjing University of Aeronautics and Astronautics, Nanjing, China
2011 - 2015 | GPA: 4.2/5.0 (ranking: 1/94)
Past Experience

Applied Scientist Intern
Amazon, Seattle, WA, US (remote)
05/2021 - 09/2021
- Proposed a reinforcement learning method for conversational question answering, accepted by EMNLP Industry Track 2022.
- Mentors/Managers: Jie Zhao, Anjie Fang, Besnik Fetahu, Oleg Rokhlenko, Shervin Malmasi

Applied Science Intern
Zhuiyi Technology, Shenzhen, Guangdong, China
05/2019 - 09/2019
- Led a team solving SuperGLUE benchmark tasks, achieving a 2nd-place ranking in our submission, covered in China Daily (中国日报) and Sina Tech (新浪科技).
- Manager: Yinan Xu

Applied Science Intern
Bloomberg, Princeton, NJ, USA
06/2016 - 08/2016
- Analysis of Bloomberg Data Licence usage patterns.
- Manager: Michael Liebman
Publications
- Efficient Low-Rank Index Routing for High-Dimensional Approximate Nearest Neighbor Search. Information Processing & Management, 2026
- Dependency Relationships-Enhanced Attentive Group Recommendation in HINs. World Wide Web Journal, 2025
- LLM-based Dialogue Labeling for Multiturn Adaptive RAG. EMNLP 2025 (Industry)
- REIC: RAG-Enhanced Intent Classification at Scale. EMNLP 2025 (Industry)
- Wizard of Shopping: Target-Oriented E-commerce Dialogue Generation with Decision Tree Branching. ACL 2025
- Approximate Vector Set Search: A Bio-Inspired Approach for High-Dimensional Spaces. ICDE 2025
- Joinable Search Over Multi-Source Spatial Datasets: Overlap, Coverage, and Efficiency. ICDE 2025
- Identifying High Consideration E-Commerce Search Queries. EMNLP 2024 (Industry)
- Unbiased Learning-to-Rank Needs Unconfounded Propensity Estimation. SIGIR 2024
- Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data. LREC-COLING 2024
- Training-free Optimization of Generative Recommender Systems using Large Language Model Optimizers. ACL 2024
- InstructPTS: Instruction-Tuning LLMs for Product Title Summarization. EMNLP 2023 (Industry)
- Multi-Coner V2: A Large Multilingual Dataset for Fine-grained and Noisy Named Entity Recognition. EMNLP 2023 (Findings)
- Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search. ACL 2023
- Answering Unanswered Questions through Semantic Reformulations in Spoken QA. ACL 2023
- Model-based Unbiased Learning to Rank. WSDM 2023
- Reinforced Question Rewriting for Conversational Question Answering. EMNLP 2022
- StruBERT: Structure-aware BERT for Table Search and Matching. WWW 2022
- MGNETS: Multi-Graph Neural Networks for Table Search. CIKM 2021
- Neural Ranking Models for Document Retrieval. Information Retrieval Journal 2021
- WTR: A Test Collection for Web Table Retrieval. SIGIR 2021
- A Hybrid Deep Model for Learning to Rank Data Tables. IEEE BigData 2020
- Relational Graph Embeddings for Table Retrieval. BigGraphs 2020 (at IEEE BigData)
- Towards Knowledge Acquisition of Metadata on AI Progress. ISWC 2020 (Demo)
- Table Search Using a Deep Contextualized Language Model. SIGIR 2020
- Pretrained Generalized Autoregressive Model with Adaptive Probabilistic Label Clusters for Extreme Multi-label Text Classification. ICML 2020
- Leveraging Schema Labels to Enhance Dataset Search. ECIR 2020
- Generating Schema Labels through Dataset Content Analysis. WWW 2018 (Profiles & Data:Search Workshop) - Best Paper Award
Services
Area Chair
- ACL ARR 2023, 2025
Program Committee/Reviewer
- WSDM 2022-2026
- NeurIPS 2023-2025
- SIGIR 2022-2025
- CIKM 2020, 2025
- KDD 2022-2025
- ICLR 2024-2025
- TheWebConf 2022-2025
- COLING 2022, 2025
- NAACL 2025
- SemEval 2024-2025
- ECML PKDD 2023-2024
- EMNLP 2023-2024
- IJCAI 2023
- ACL 2023
- TKDE 2021
Honors and Awards
- 2021, ACM SIGIR Student Travel Grant
- 2021, Rossin Professional Development Program Award at Lehigh University
- 2020, ISWC Student Grant
- 2020, ACM SIGIR Student Travel Grant
- 2018, Best Paper Award of International Workshop on Profiles&Data:Search’18
- 2018, ACM SIGIR Student Travel Grant
- 2015, RCEAS Dean’s Fellowship
- 2014, Chinese Government Scholarship from China Scholarship Council
- 2013, First Prize, National Mathematical Contest in Modeling (Jiangsu Division)
- 2012, Honorable Mention for Social Practice
- 2012, Second Prize, 11-th Higher Mathematics Competition of Jiangsu Province
- 2011-2014, Student Scholarship
- 2011-2014, Merit Student
Hobbies
Coffee ☕
I make coffee every day.
Close-up Magic 🃏
My favorite two magicians: Richard Turner and Lennart Green.
Photography 📷
Capturing moments.