Researcher at Mohamed bin Zayed University of Artificial Intelligence
Co-Founder & CTO at Spatialtemporal AI
I am currently a Researcher at the Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), working with Prof. Xiaodan Liang, Prof. IvanLaptev, and Prof. IanReid. My research focuses on Intelligent Robot, Embodied AI, Multimodal Large Model, and Spatial Intelligence. The goal of my research is to train multimodal embodied large models and leverage limited data to enhance robotic perception, understanding, action, and decision-making. I proposed the general manipulation model A0 and the navigation model NaVid. I have published over 50 papers in related top journals and conferences, including 26 as the first or corresponding author in IEEE TPAMI/TIP/TNNLS/TIM/TMM/TCSVT/TGRS, RSS/ICRA/ICCV/AAAI/MICCAI/ICME, among which 15 are in CAS Zone 1, 6 in CCF-A, 8 in Tsinghua-A, and 3 are ESI Highly Cited Papers. I have delivered multiple Oral presentations at NeurIPS, AAAI, ICRA, etc., and my work has received over 1700 citations on Google Scholar. I am a member of IEEE, the China Society of Image and Graphics, the China Graphics Society, and the Chinese Society for Stereology. I serve as a reviewer for leading journals and conferences including IEEE TPAMI, IEEE TIP, IEEE TNNLS, IEEE TMM, IEEE TCSVT, IEEE TII, Neural Networks, CVPR, NeurIPS, AAAI, and MICCAI. Co-organizer of the CVPR 2025 Embodied AI International Challenge: Social Mobile Manipulation.
more
Previously, I was an Assistant Professor at the Institute of Automation, Chinese Academy of Sciences (CASIA). I received my Ph.D. in Artificial Intelligence in 2024 from the National Laboratory of Multimodal Artificial Intelligence Systems (CASIA-MAIS), Institute of Automation, Chinese Academy of Sciences. During my Ph.D. studies, I was awarded the CAS President’s Award, the National Scholarship, the Outstanding Graduate of Beijing Award, the Excellent Graduate of CAS Award, and received two Best Paper Nominations at flagship IEEE conferences. In 2019, I obtained dual bachelor's degrees in Mathematics and Computer Science from Huazhong University of Science and Technology (HUST), where I won the top prize in the National Mathematical Modeling Competition. Additionally, I conducted research under the supervision of Prof. He Wang at the BAAI / Galbot.
我现在是无界智慧联创兼CTO,阿联酋人工智能大学(MBZUAI)研究员,与梁小丹教授、 IvanLaptev教授和IanReid教授一起工作。 我的研究方向是智能机器人、具身智能、多模态大模型和空间智能。研究目标在于训练多模态具身大模型和利用有限的数据,以提升机器人在感知、理解、行动和决策等方面的能力。 提出操纵大模型A0,导航大模型NaVid。在相关领域学术期刊和会议上共发表论文70余篇,其中以第一作者或通讯作者在 IEEE TPAMI/TIP/TNNLS/TIM/TMM/TCSVT/TGRS, RSS/ICRA/ICCV/AAAI/MICCAI/ICME 等国际顶级期刊和会议上发表论文30余篇(中科院一区: 16, CCF‑A: 7, 清华‑A:9,ESI高被引论文: 3)。曾在NeurIPS、AAAI、ICRA等会议上发表多篇Oral论文,谷歌学术引用1700余次。 任IEEEmember,中国图像图形学会会员,中国图学学会会员,中国体视学学会会员,担任IEEETPAMI,IEEETIP,IEEETNNLS,IEEETMM,IEEETCSVT,IEEETII,NeuralNetworks, CVPR, NeurIPS, AAAI, MICCAI 等 国际期刊和会议的审稿人。共同组织CVPR2025具身智能国际挑战赛:Social Mobile Manipulation。
我们正在深圳积极招募工程师和实习生。如果您感兴趣或有合作机会,请随时联系我!
展开更多
在此之前,我是中国科学院自动化研究所的助理研究员,我2024年在中国科学院自动化研究所多模态人工智能系统全国重点实验室(CASIA‑MAIS)获得了人工智能博士学位, 在学期间曾获得中国科学院院长奖、国家奖学金、北京市优秀毕业生、中国科学院优秀毕业生和两次IEEE旗舰会议最佳论文提名奖。 我2019年在华中科技大学(HUST)获得了数学与计算机双学士学位,曾获全国数学建模竞赛最高奖。 此外,我曾在BAAI/银河通用机器人 (王鹤教授)指导下共同主导全球首个视频具身导航大模型NaVid并落地。