2019年-2023年,蒙特利尔大学,博士
2016年-2019年,中国人民大学信息学院,硕士
2012年-2016年,中国人民大学信息学院,学士
2023年至今,中国人民大学高瓴人工智能学院,博士后,合作导师:窦志成教授
大语言模型、信息检索、对话系统
[1] Yutao Zhu, Huaying Yuan, Shuting Wang, Jiongnan Liu, Wenhan Liu, Chenlong Deng, Haonan Chen, Zheng Liu, Zhicheng Dou, and Ji-Rong Wen. Large Language Models for Information Retrieval: A Survey. ACM Transactions on Information Systems (TOIS), 2025. [PDF] [GitHub]
[2] Yutao Zhu, Zhaoheng Huang, Zhicheng Dou, and Ji-Rong Wen. One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025. [PDF] [GitHub]
[3] Xiaoxi Li, Jiajie Jin, Guanting Dong, Hongjin Qian, Yongkang Wu, Ji-Rong Wen, Yutao Zhu (通讯作者), and Zhicheng Dou. WebThinker: Empowering Large Reasoning Models with Deep Research Capability. Annual Conference on Neural Information Processing Systems (NeurIPS), 2025. [PDF] [GitHub]
[4] Jiongnan Liu, Yutao Zhu (通讯作者), Shuting Wang, Xiaochi Wei, Erxue Min, Yu Lu, Shuaiqiang Wang, Dawei Yin, and Zhicheng Dou. LLMs + Persona-Plug = Personalized LLMs. Annual Meeting of the Association for Computational Linguistics (ACL), 2025. [PDF]
[5] Jiajie Jin, Xiaoxi Li, Guanting Dong, Yuyao Zhang, Yutao Zhu (通讯作者), Yongkang Wu, Zhonghua Li, Ye Qi, and Zhicheng Dou. Hierarchical Document Refinement for Long-context Retrieval-augmented Generation. Annual Meeting of the Association for Computational Linguistics (ACL), 2025. [PDF]
[6] Yuying Shang, Xinyi Zeng, Yutao Zhu (共同第一作者), Xiao Yang, Zhengwei Fang, Jingyuan Zhang, Jiawei Chen, Zinan Liu, and Yu Tian. From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models. ACM International Conference on Multimedia (MM), 2025. [PDF]
[7] Yutao Zhu, Jiajie Jin, Hongjin Qian, Zheng Liu, Zhicheng Dou, and Ji-Rong Wen. Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization. Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025. [PDF] [GitHub]
[8] Zhaoheng Huang, Yutao Zhu (通讯作者), Ji-Rong Wen, and Zhicheng Dou. Enhancing LLM Text Detection with Retrieved Contexts and Logits Distribution Consistency. Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025. [PDF]
[9] Jiajie Jin, Yutao Zhu (通讯作者), Xinyu Yang, Chenghao Zhang, and Zhicheng Dou. FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research. Companion Proceedings of the ACM on Web Conference (WWW Resource), 2025. [PDF] [GitHub]
[10] Zhaoheng Huang, Yutao Zhu (通讯作者), Ji-Rong Wen, and Zhicheng Dou. UFO: a Unified and Flexible Framework for Evaluating Factuality of Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI Demo), 2026. [PDF]
[11] Yutao Zhu, Peitian Zhang, Chenghao Zhang, Yifei Chen, Binyu Xie, Zheng Liu, Ji-Rong Wen, and Zhicheng Dou. INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning. Annual Meeting of the Association for Computational Linguistics (ACL), 2024. [PDF] [GitHub]
[12] Zhaoheng Huang, Yutao Zhu (通讯作者), Zhicheng Dou, and Ji-Rong Wen. CAGS: Context-Aware Document Ranking with Contrastive Graph Sampling. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2024. [PDF]
[13] Yutao Zhu, Ruihua Song, Jian-Yun Nie, Pan Du, Zhicheng Dou, and Jin Zhou. Leveraging Narrative to Generate Movie Script. ACM Transactions on Information Systems (TOIS), 2022. [PDF] [GitHub]
[14] Yutao Zhu, Jian-Yun Nie, Yixuan Su, Haonan Chen, Xinyu Zhang, and Zhicheng Dou. From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking. ACM International Conference on Information and Knowledge Management (CIKM), 2022. [PDF] [GitHub]
[15] Yutao Zhu, Kun Zhou, Jian-Yun Nie, Shengchao Liu, and Zhicheng Dou. Neural Sentence Ordering Based on Constraint Graphs. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2021. [PDF] [GitHub]
[16] Yutao Zhu, Jian-Yun Nie, Zhicheng Dou, Zhengyi Ma, Xinyu Zhang, Pan Du, Xiaochen Zuo, and Hao Jiang. Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking. ACM International Conference on Information and Knowledge Management (CIKM), 2021. [PDF] [GitHub]
[17] Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, and Zhicheng Dou. Content Selection Network for Document-grounded Retrieval-based Chatbots. European Conference on Information Retrieval (ECIR), 2021. [PDF] [GitHub]
[18] Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, Hao Jiang, and Zhicheng Dou. Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals. ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR Short), 2021. [PDF] [GitHub]
[19] Yutao Zhu, Ruihua Song, Zhicheng Dou, Jian-Yun Nie, and Jin Zhou. ScriptWriter: Narrative-Guided Script Generation. Annual Meeting of the Association for Computational Linguistics (ACL), 2020. [PDF] [GitHub]
[20] Yutao Zhu, Zhicheng Dou, Jian-Yun Nie, and Ji-Rong Wen. ReBoost: A Retrieval-Boosted Sequence-to-Sequence Model for Neural Response Generation. Information Retrieval Journal, 2019. [PDF] [GitHub]
2019 谷歌卓越博士生奖学金
2016 北京市优秀毕业生
中国中文信息学会信息检索专业委员会委员
中国中文信息学会青年工作委员会委员
程序委员会成员:ACL、NeurIPS、ICML、ICLR、SIGIR、WWW、SIGKDD、AAAI、EMNLP、CIKM、WSDM
期刊审稿人:PNAS、TOIS、TKDD、计算机学报、KAIS、TALLIP、Computing Surveys
邮箱:ytz@ruc.edu.cn
个人网页:https://daod.github.io/