讲座主题:Context Autoencoder for Scalable Self-Supervised Representation Pretrainin
讲座时间:2022年3月18日(周五)15:40-17:00
腾讯会议:610-614-378
邀请人:胡迪 中国人民大学高瓴人工智能学院准聘助理教授
讲座摘要:Self-supervised representation pretraining aims to learn an encoder from unlabeled images, such that the encoded representations take on semantics and benefit downstream tasks. In this talk, I present a novel masked image modeling approach, context autoencoder (CAE), for scalable self-supervised representation training. The core ideas include that predictions are made in the latent representation space from visible patches to masked patches and that the encoder is only for representation learning and representation learning is only by the encoder. I also discuss why masked image modeling potentially outperforms contrastive pretraining (e.g., SimCLR, MoCo) and why contrastive learning performs on par with supervised pretraining on ImageNet. In addition, I show that linear probing and the extended version, attentive probing, are more suitable than fine-tuning on ImageNet for pretraining evaluation.
主讲专家:王井东 百度 AI 计算机视觉首席科学家
Jingdong Wang is a Chief Scientist for computer vision with the Artificial Intelligence Group at Baidu. His team is focusing on conducting product-driven and cutting-edge computer vision/deep learning/AI research and developing practical computer vision applications. Before joining Baidu, he was a Senior Principal Researcher at Microsoft Research Asia. His areas of interest are computer vision, deep learning, and multimedia search. His representative works include deep high-resolution network (HRNet), discriminative regional feature integration (DRFI) for supervised saliency detection, neighborhood graph search (NGS, SPTAG) for large scale similarity search. He has been serving/served as an Associate Editor of IEEE TPAMI, IJCV, IEEE TMM, and IEEE TCSVT, and an area chair of leading conferences in vision, multimedia, and AI, such as CVPR, ICCV, ECCV, ACM MM, IJCAI, and AAAI. He was elected as an ACM Distinguished Member, a Fellow of IAPR, and a Fellow of IEEE, for his contributions to visual content understanding and retrieval.
项目背景:“高屋建瓴AI公开课”项目由中国人民大学高瓴人工智能学院发起,旨在扩大人工智能学科影响力、提升学科发展水准。公开课项目命名为“高屋建瓴”,寓意在高瓴人工智能学院的平台上,汇聚高端人才,发出人工智能研究方向高瞻远瞩的声音。
检测到您当前使用浏览器版本过于老旧,会导致无法正常浏览网站;请您使用电脑里的其他浏览器如:360、QQ、搜狗浏览器的速模式浏览,或者使用谷歌、火狐等浏览器。
下载Firefox