EN

人才培养导师组

窦志成 教授(长聘副教授)
中国人民大学高瓴人工智能学院副院长、教授,北京智源人工智能研究院“智能信息检索与挖掘”方向项目经理,基于大数据文科综合训练国家级虚拟仿真实验教学中心执行主任。2008 至 2014 年在微软亚洲研究院工作,2014 年开始在中国人民大学任教。主要研究方向为智能信息检索、自然语言处理、大数据分析。已在国际知名学术会议和期刊上(如 SIGIR、WWW、CIKM、WSDM、ACL、EMNLP、TKDE 等)发表论文 50余篇,获 SIGIR 2013 最佳论文提名奖,AIRS 2012 最佳论文奖。曾担任信息检索领域顶级会议 SIGIR 的程序委员会主席(2019 短文),亚洲信息检索学术会议 AIRS 大会主席(2016)、程序委员会主席(2017)和执委会主席(2018),全国信息检索学术会议CCIR 程序委员会主席(2020)等。任多个国际学术会议和期刊的程序委员会委员和审稿人,任中国计算机学会大数据专家委员会副秘书长、中文信息学会信息检索专委会执行委员。除学术研究外,窦志成教授还乐于将研究想法实现成可运行的系统,亲自动手开发了包括时事探针(http://playbigdata.ruc.edu.cn/)在内的多个系统,拥有多项发明专利。

个人主页: http://playbigdata.ruc.edu.cn/dou/

详细资料

教育经历

1999-2008 南开大学 本科-硕士-博士

工作经历

2008年-2014年 微软研究院 研究员

2014年至今 中国人民大学

研究方向

信息检索,自然语言处理,数据挖掘,大数据,信息抽取,机器学习

讲授课程

数据结构

大数据分析导论

网络群体与市场

计算机科学研究方法概论

互联网文本分析

程序设计实践

智能信息检索

对学生的培养要求

具有丰富的学生培养和指导经验,在微软亚洲研究院工作6年多的时间内,先后指导20多个实习生。

报名要求:

读研究生的目的:想要在硕士生或博士生阶段培养自己的项目开发或者科研能力,为将来的工作或进一步深造打好基础,而不是仅仅为了拿到研究生学历或硕士博士学位;

态度:踏实、勤奋、做事有责任心,能够认真对待老师分配给的项目或者研究课题;

基础:具有一定的编程开发动手能力,具有一定的自我学习能力,能够将研究想法编程实现;

对学生的培养:

能力培养:本着对学生负责的态度,同时培养学生的系统开发(编程、系统设计、项目管理)和科学研究能力(论文阅读、工作调研、问题分析、方法设计、实验分析、论文写作等),为结合学生的特长和职业规划,为不同学生制定不同的能力培养计划;

素质培养:培养学生做事的态度,锻炼语言沟通能力,增强团队合作意识;

欢迎各位有意向攻读硕士或博士学位的同学报考!

科研项目

基于深度学习的个性化搜索技术研究,自然科学基金面上项目

基于法律法规的司法解释文件核查关键技术研究,国家重点研发计划课题

信息检索中搜索结果个性化和多样化融合技术研究,自然科学基金青年项目

交互式文本数据多维分析方法研究,北京理工大学

基于大规模裁判文书和法律论坛数据的中国法制现状研究,中国法学会

基于UCL的中文信息处理应用理论体系,中国电子技术标准化研究院

互联网演艺设备大数据采集、抽取和检索技术研究,文化部科技文化提升项目

农业互联网大数据采集、分析与展示合作项目,北京金禾天成科技有限公司

科研成果

Xiaojie Wang, Zhicheng Dou, Tetsuya Sakai, and Ji-Rong Wen. Search Result Diversity Evaluation based on Intent Hierarchies, IEEE Trans. Knowl. Data Eng., 30(1), 156-169, 2018 CCF A corresponding author

Zhengbao Jiang, Zhicheng Dou, Ji-Rong Wen, Wayne Xin Zhao, Jian-Yun Nie, Ming Yue. Supervised Search Result Diversification via Subtopic Attention (Accepted) CCF A corresponding author Online

Zhengbao Jiang, Ji-Rong Wen, Zhicheng Dou, Wayne Xin Zhao, Jian-Yun Nie, Ming Yue. Learning to Diversify Search Results via Subtopic Attention. To appear in SIGIR 2017 CCF A corresponding author

Zhengbao Jiang, Zhicheng Dou, Ji-Rong Wen: Generating Query Facets Using Knowledge Bases. IEEE Trans. Knowl. Data Eng. 29(2): 315-329 (2017) CCF A corresponding author

窦志成,江政宝, 李谨秀,张宜春,文继荣. 基于词项图分析的查询分面挖掘方法[J]. 计算机学报, 2017, 40(3):556-569. first author

胡莎,窦志成,文继荣.论子话题粒度对搜索结果多样化算法的影响[J].中文信息学院, 2014, 31(4): 165-173.

Won-Kyung Sung, Hanmin Jung, Shuo Xu, Krisana Chinnasarn, Kazutoshi Sumiya, Jeonghoon Lee, Zhicheng Dou, Grace Hui Yang, Young-Guk Ha, Seungbock Lee: Information Retrieval Technology - 13th Asia Information Retrieval Societies Conference, AIRS 2017, Jeju Island, South Korea, November 22-24, 2017, Proceedings. Lecture Notes in Computer Science 10648, Springer 2017, ISBN 978-3-319-70144-8

Xiaojie Wang, Zhicheng Dou, Tetsuya Sakai, and Ji-Rong Wen. Evaluating Search Result Diversity using Intent Hierarchies. In Proceedings of SIGIR, 2016. CCF A corresponding author

Zhicheng Dou, Zhengbao Jiang, Sha Hu, Ji-Rong Wen, Ruihua Song: Automatically Mining Facets for Queries from Their Search Results. IEEE Trans. Knowl. Data Eng. (TKDE) 28(2):385-397 (2016) CCF A first author

Sha Hu, Ji-Rong Wen, Zhicheng Dou, Shuo Shang. Following the dynamic block on the Web. World Wide Web 19(6): 1077-1101 (2016)

Takehiro Yamamoto, Yiqun Liu, Min Zhang, Zhicheng Dou, Ke Zhou, Ilya Markov, Makoto P. Kato, Hiroaki Ohshima, Sumio Fujita. Overview of the NTCIR-12 IMine-2 Task. NTCIR 2016

Ming Yue, Zhicheng Dou, Sha Hu, Jinxiu Li, Xiao-Jie Wang, Ji-Rong Wen. RUCIR at NTCIR-12 IMINE-2 Task. NTCIR 2016

Shaoping Ma, Ji-Rong Wen, Yiqun Liu, Zhicheng Dou, Min Zhang, Yi Chang, Xin Zhao. Information Retrieval Technology - 12th Asia Information Retrieval Societies Conference, AIRS 2016, Beijing, China, November 30 - December 2, 2016, Proceedings. Lecture Notes in Computer Science 9994, Springer 2016, ISBN 978-3-319-48050-3

Zhongqi Lu, Zhicheng Dou, Xing Xie, Jianxun Lian, Qiang Yang. Content-based Collaborative Filtering for News Topic Recommendation. In Proceedings of Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2015), Austin Texas, USA, Jan 25-29, 2015. CCF A corresponding author

Sha Hu, Zhicheng Dou, Xiaojie Wang, Tetsuya Sakai, and Ji-Rong Wen. 2015. Search Result Diversification Based on Hierarchical Intents. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM \'15). ACM, New York, NY, USA, 63-72. DOI="http://dx.doi.org/10.1145/2806416.2806455" CCF B corresponding author

Sha Hu, Zhicheng Dou, Xiao-Jie Wang, Ji-Rong Wen: Search Result Diversification Based on Query Facets. J. Comput. Sci. Technol. (JCST) 30(4):888-901 (2015)

窦志成. 文本大数据分析技术的机遇与挑战[J]. 金融电子化, 2015(11):59-61.

窦志成, 文继荣. 大数据时代的互联网分析引擎[J]. 大数据, 2015(3).36-47 (2015-09-20)

Yiqun Liu, Ruihua Song, Min Zhang, Zhicheng Dou, Takehiro Yamamoto, Makoto Kato, Hiroaki Ohshima, Ke Zhou. Overview of the NTCIR-11 IMine Task. Proceedings of the 11th NTCIR conference.

Fei Chen, Yiqun Liu, Zhicheng Dou, Keyang Xu, Yujie Cao, Min Zhang, and Shaoping Ma, Revisiting the Evaluation of Diversified Search Evaluation Metrics with User Preferences. Proceedings of the 10th Asia Information Retrieval Society Conference (AIRS 2014)

Jingfei Li, Dawei Song, Peng Zhang, Ji-Rong Wen, and Zhicheng Dou, Personalizing Web Search Results Based on Subspace Projection, Proceedings of the 10th Asia Information Retrieval Society Conference (AIRS 2014)

Shu Tang, Zhicheng Dou, Xing Xie, and Jun He, Detecting and Monitoring Dynamic Content Blocks of a Web Page by Merging its Historical Versions, in SIGIR 2014 Workshop on Temporal, Social and Spatially-aware Information Access (TAIA2014), 2014

2013

Xiao Ding, Zhicheng Dou, Bing Qin, Ting Liu, and Ji-Rong Wen, Improving Web Search Ranking by Incorporating Structured Annotation of Queries, in Proceedings of EMNLP 2013, pages 468-478, October 2013 CCF B corresponding author

Kosetsu Tsukuda, Tetsuya Sakai, Zhicheng Dou, and Katsumi Tanaka, Estimating Intent Types for Search Result Diversification, in Information Retrieval Technology, pages 25-37, Springer Berlin Heidelberg, 2013

Ke Zhou, Tetsuya Sakai, Mounia Lalmas, Zhicheng Dou, and Joemon M. Jose, Evaluating Heterogeneous Information Access, in ACM SIGIR 2013 Workshop on Modeling User Behavior for Information Access Evaluation,

Qinglei Wang, Yanan Qian, Ruihua Song, Zhicheng Dou, Fan Zhang, Tetsuya Sakai, and Qinghua Zheng, Mining Subtopics from Text Fragments for a Web Query, in Information Retrieval 16(4) pages 484-503, 2013

Tetsuya Sakai and Zhicheng Dou, Summaries, Ranked Retrieval and Sessions: A Unified Framework for Information Access Evaluation, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 473-482, ACM, 2013 (The Best Paper Runner-Up Award) CCF A

Tetsuya Sakai, Zhicheng Dou, and Carles Clarke, The Impact of Intent Selection on Diversified Search Evaluation, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 921-924, ACM, 2013 CCF A

Tetsuya Sakai, Zhicheng Dou, Takehiro Yamamoto, Yiqun Liu, Min Zhang, Makoto Kato, Ruihua Song, and Mayu Iwata, Summary of the NTCIR-10 INTENT-2 Task: Subtopic Mining and Search Result Diversification, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 761 - 764, ACM, 2013 CCF A

Tetsuya Sakai, Zhicheng Dou, Takehiro Yamamoto, Yiqun Liu, Min Zhang, and Ruihua Song, Overview of the NTCIR-10 INTENT-2 Task, in Proceedings of the 10th NTCIR Conference, pages 94-123, June 18-21, 2013

Kosetsu Tsukuda, Zhicheng Dou, and Tetsuya Sakai, Microsoft Research Asia at the NTCIR-10 Intent Task, in Proceedings of the 10th NTCIR Conference, June 2013

Kazuya Narita, Tetsuya Sakai, Zhicheng Dou, and Young-In Song, MSRA at NTCIR-10 1CLICK-2, in Proceedings of the 10th NTCIR Conference, 2013

Tetsuya Sakai, Zhicheng Dou, Ruihua song, and Noriko Kando, The Reusability of a Diversified Search Test Collection, in Information Retrieval Technology (AIRS 2012), pages 26-38, Springer Berlin Heidelberg, 20 December 2012 (The Best Paper Award)

2011

Zhicheng Dou, Sha Hu, Kun Chen, Ruihua Song, and Ji-Rong Wen, Multi-dimensional Search Result Diversification, in Proceedings of the fourth ACM international conference on Web search and data mining (WSDM 2011), pages 475-484, ACM, February 2011 CCF B first author

Zhicheng Dou, Finding Dimensions for Queries, in Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM 2011), pages 1311-1320, ACM, 2011 CCF B first author

Jialong Han, Qinglei Wang, Naoki Orii, Zhicheng Dou, Tetsuya Sakai, and Ruihua Song, Microsoft Research Asia at the NTCIR-9 Intent Task, in Proceedings of the 10th NTCIR Conference (NTCIR-9), National Institute of Informatics, 2011

Tetsuya Sakai, Nick Craswell, Ruihua Song, Stephen Robertson, Zhicheng Dou, and Chin-Yew Lin, Simple Evaluation Metrics for Diversified Search Results, in Proceedings of the Third International Workshop on Evaluating Information Access (EVIA), Volumn 26, pages 27, National Institute of Informatics, June 2010

Ruihua Song, Zhicheng Dou, Hsiao-Wuen Hon, and Yong Yu, Learning Query Ambiguity Models by Using Search Logs, Journal of Computer Science and Technology, 25(4), pages 782-738, Springer, July 2010

Zhicheng Dou, Kun Chen, Ruihua Song, Yunxiao Ma, Shuming Shi, and Ji-Rong Wen, Microsoft Research Asia at the Web Track of TREC 2009, in Proceedings of TREC 2009, November 2009

Ji-Rong Wen, Zhicheng Dou, and Ruihua Song, Personalized Web Search, in Encyclopedia of Database Systems, pages 2099-2103, Springer-Verlag, New York, USA, September 2009

Zhicheng Dou, Ruihua Song, Jian-Yun Nie, and Ji-Rong Wen, Using Anchor Texts with Their Hyperlink Structure for Web Search, in Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval(SIGIR 2009), pages 227-234, ACM, July 2009 CCF A first author

Zhicheng Dou, Ruihua Song, Ji-Rong Wen, and Xiaojie Yuan, Evaluating the Effectiveness of Personalized Web Search, in IEEE Transactions on Knowledge and Data Engineering (TKDE), 21(8), pages 1178-1190, IEEE computer Society Digital Library, Aug., 2009 CCF A first author

Zhicheng Dou, Ruihua Song, Xiaojie Yuan, and Ji-Rong Wen, Are click-through data adequate for learning web search rankings?, in Proceeding of the 17th ACM conference on Information and knowledge management (CIKM 2008), pages 73-82, ACM, New York, NY, USA, 2008 CCF B first author

Zhicheng Dou, Xiaojie Yuan, and Songbai He, Analysis of Query Repetition in a Large-scale Chinese Search Log (大规模中文搜索日志中查询重复性分析), in Computer Engineering (In Chinese), Volumn 21, 2008

Xiaojie Yuan, Zhicheng Dou, Lu Zhang, and Fang Liu, Automatic User Goals Identification Based on Anchor Text and Click-through Data, in Wuhan University Journal of Natural Sciences (WISA2008), 13(4), pages 495-500, 2008

Xiaojie Yuan, Zhicheng Dou, Fang Liu, and Lu Zhang, Personalized Web Search Based on Dynamic User Profile (一种基于动态用户模型的个性化Web搜索算法), in NDBC 2008: Proceedings of the 25th National Database Conference (In Chinese), 2008

窦志成, 袁晓洁, and 何松柏, 大规模中文搜索日志中查询重复性分析, 计算机工程, 34(21), pages 40-44, 2008 (in Chinese)

Lu ZHANG, Xiao-jie YUAN, Fang LIU, and Zhicheng Dou, Research on Distributed Index Mechanism for Large Dataset, Microelectronics & Computer, Volume 10, Pages 037, 2008

Zhicheng Dou, Ruihua Song, and Ji-Rong Wen, A large-scale evaluation and analysis of personalized search strategies, in Proceedings of the 16th international conference on World Wide Web (WWW2007), pages 581-590, ACM Press, New York, NY, USA, 2007

社会兼职

SIGIR 2018 论文PC Chair,

CCIR 2018 青年学者论坛主席,

亚洲信息检索学术会议指导委员会(AIRS SC)主席,

SIGIR 2018 Senior PC,

ICTIR 2018赞助和工业论坛主席,

Information Retrieval编委。

任多个学术会议和期刊的程序委员会委员和审稿人(CIKM, WWW, TKDE,WSDM,KDD,AAAI,IEEE BigData, AIRS, 计算机学报,JMLC,CCL, NLPCC,JASIST,SIGIR Demo,深圳大学学报,中文信息学报等)。


检测到您当前使用浏览器版本过于老旧,会导致无法正常浏览网站;请您使用电脑里的其他浏览器如:360、QQ、搜狗浏览器的速模式浏览,或者使用谷歌、火狐等浏览器。

下载Firefox