My name is Zhiyu Chen (陈知雨). Now I am working as an applied scientist at Amazon in Seattle, US. My first name “Zhiyu” comes from an ancient Chinese poem (好雨知时节), which means good rain knows the best time to fall. I received my PhD from Lehigh University at 2022 with Prof. Brian D. Davison. My research interests include data mining, machine learning, natural language processing and information retrieval.
Hi, I'm |

About Me
Education
Ph.D in Computer Science
Lehigh University, Bethlehem, PA, USA
2015 - 2022 | GPA: 3.97/4.0
Exchange Program
Ecole Supérieure d’Ingénieurs Léonard de Vinci, Paris, France
2014 - 2015 | GPA: 15.75/20
B.E. in Computer Science
Nanjing University of Aeronautics and Astronautics, Nanjing, China
2011 - 2015 | GPA: 4.2/5.0 (ranking: 1/94)
Past Experience
Applied Scientist Intern
Amazon, Seattle, WA, US (remote)
05/2021 - 09/2021
- Proposed a reinforcement learning method for conversational question answering, accepted by EMNLP Industry Track 2022.
- Mentors/Managers: Jie Zhao, Anjie Fang, Besnik Fetahu, Oleg Rokhlenko, Shervin Malmasi
Applied Science Intern
Zhuiyi Technology, Shenzhen, Guangdong, China
05/2019 - 09/2019
- Led a team solving SuperGLUE benchmark tasks, achieving a 2nd-place ranking in our submission, covered in China Daily (中国日报).
- Manager: Yinan Xu
Applied Science Intern
Bloomberg, Princeton, NJ, USA
06/2016 - 08/2016
- Analysis of Bloomberg Data Licence usage patterns.
- Manager: Michael Liebman
Publications
- Wizard of Shopping: Target-Oriented E-commerce Dialogue Generation with Decision Tree Branching. ACL 2025
- Approximate Vector Set Search: A Bio-Inspired Approach for High-Dimensional Spaces. ICDE 2025
- Joinable Search Over Multi-Source Spatial Datasets: Overlap, Coverage, and Efficiency. ICDE 2025
- Identifying High Consideration E-Commerce Search Queries. EMNLP 2024 (Industry)
- Unbiased Learning-to-Rank Needs Unconfounded Propensity Estimation. SIGIR 2024
- Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data. LREC-COLING 2024
- Training-free Optimization of Generative Recommender Systems using Large Language Model Optimizers. ACL 2024
- InstructPTS: Instruction-Tuning LLMs for Product Title Summarization. EMNLP 2023 (Industry)
- Multi-Coner V2: A Large Multilingual Dataset for Fine-grained and Noisy Named Entity Recognition. EMNLP 2023 (Findings)
- Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search. ACL 2023
- Answering Unanswered Questions through Semantic Reformulations in Spoken QA. ACL 2023
- Model-based Unbiased Learning to Rank. WSDM 2023
- Reinforced Question Rewriting for Conversational Question Answering. EMNLP 2022
- StruBERT: Structure-aware BERT for Table Search and Matching. WWW 2022
- MGNETS: Multi-Graph Neural Networks for Table Search. CIKM 2021
- Neural Ranking Models for Document Retrieval. Information Retrieval Journal 2021
- WTR: A Test Collection for Web Table Retrieval. SIGIR 2021
- A Hybrid Deep Model for Learning to Rank Data Tables. IEEE BigData 2020
- Relational Graph Embeddings for Table Retrieval. BigGraphs 2020 (at IEEE BigData)
- Towards Knowledge Acquisition of Metadata on AI Progress. ISWC 2020 (Demo)
- Table Search Using a Deep Contextualized Language Model. SIGIR 2020
- Pretrained Generalized Autoregressive Model with Adaptive Probabilistic Label Clusters for Extreme Multi-label Text Classification. ICML 2020
- Leveraging Schema Labels to Enhance Dataset Search. ECIR 2020
- Generating Schema Labels through Dataset Content Analysis. WWW 2018 (Profiles & Data:Search Workshop) - Best Paper Award
Services
Area Chair
- ACL ARR 2023, 2025
Program Committee/Reviewer
- WSDM 2022-2026
- NeurIPS 2023-2025
- SIGIR 2022-2025
- CIKM 2020, 2025
- KDD 2022-2025
- ICLR 2024-2025
- TheWebConf 2022-2025
- COLING 2022, 2025
- NAACL 2025
- SemEval 2024-2025
- ECML PKDD 2023-2024
- EMNLP 2023-2024
- IJCAI 2023
- ACL 2023
- TKDE 2021
Honors and Awards
- 2021, ACM SIGIR Student Travel Grant
- 2021, Rossin Professional Development Program Award at Lehigh University
- 2020, ISWC Student Grant
- 2020, ACM SIGIR Student Travel Grant
- 2018, Best Paper Award of International Workshop on Profiles&Data:Search’18
- 2018, ACM SIGIR Student Travel Grant
- 2015, RCEAS Dean’s Fellowship
- 2014, Chinese Government Scholarship from China Scholarship Council
- 2013, First Prize, National Mathematical Contest in Modeling (Jiangsu Division)
- 2012, Honorable Mention for Social Practice
- 2012, Second Prize, 11-th Higher Mathematics Competition of Jiangsu Province
- 2011-2014, Student Scholarship
- 2011-2014, Merit Student
Hobbies
Coffee ☕
I make coffee every day.
Close-up Magic 🃏
My favorite two magicians: Richard Turner and Lennart Green.
Photography 📷
Capturing moments.