Rongtao Xu
Rongtao Xu
许镕涛

阿联酋人工智能大学博士后研究员

南方科技大学访问学者

I am currently a Postdoctoral Researcher at the Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), working with Prof. Xiaodan Liang, Prof. IvanLaptev, and Prof. IanReid. My research focuses on Intelligent Robot, Embodied AI, Multimodal Large Model, and Spatial Intelligence. The goal of my research is to train multimodal embodied large models and leverage limited data to enhance robotic perception, understanding, action, and decision-making. I proposed the general manipulation model A0 and the navigation model NaVid. I have published over 50 papers in related top journals and conferences, including 26 as the first or corresponding author in IEEE TPAMI/TIP/TNNLS/TIM/TMM/TCSVT/TGRS, RSS/ICRA/ICCV/AAAI/MICCAI/ICME, among which 15 are in CAS Zone 1, 6 in CCF-A, 8 in Tsinghua-A, and 3 are ESI Highly Cited Papers. I have delivered multiple Oral presentations at NeurIPS, AAAI, ICRA, etc., and my work has received over 800 citations on Google Scholar. I am a member of IEEE, the China Society of Image and Graphics, the China Graphics Society, and the Chinese Society for Stereology. I serve as a reviewer for leading journals and conferences including IEEE TPAMI, IEEE TIP, IEEE TNNLS, IEEE TMM, IEEE TCSVT, IEEE TII, Neural Networks, CVPR, NeurIPS, AAAI, and MICCAI. Co-organizer of the CVPR 2025 Embodied AI International Challenge: Social Mobile Manipulation.

We are actively looking for engineers and interns in Shenzhen. Feel free to contact me if you are interested or if there are potential collaboration opportunities.

more

Previously, I was an Assistant Professor at the Institute of Automation, Chinese Academy of Sciences (CASIA). I received my Ph.D. in Artificial Intelligence in 2024 from the National Laboratory of Multimodal Artificial Intelligence Systems (CASIA-MAIS), Institute of Automation, Chinese Academy of Sciences. During my Ph.D. studies, I was awarded the CAS President’s Award, the National Scholarship, the Outstanding Graduate of Beijing Award, the Excellent Graduate of CAS Award, and received two Best Paper Nominations at flagship IEEE conferences. In 2019, I obtained dual bachelor's degrees in Mathematics and Computer Science from Huazhong University of Science and Technology (HUST), where I won the top prize in the National Mathematical Modeling Competition. Additionally, I conducted research under the supervision of Prof. Xiang Bai at Huazhong University of Science and Technology, and Prof. He Wang at the Beijing Academy of Artificial Intelligence / Galbot.

我现在是阿联酋人工智能大学(MBZUAI)博士后研究员,与梁小丹教授、 IvanLaptev教授和IanReid教授一起工作。 我的研究方向是智能机器人、具身智能、多模态大模型和空间智能。研究目标在于训练多模态具身大模型和利用有限的数据,以提升机器人在感知、理解、行动和决策等方面的能力。 提出操纵大模型A0,导航大模型NaVid。在相关领域学术期刊和会议上共发表论文50余篇,其中以第一作者或通讯作者在 IEEE TPAMI/TIP/TNNLS/TIM/TMM/TCSVT/TGRS, RSS/ICRA/ICCV/AAAI/MICCAI/ICME 等国际顶级期刊和会议上发表论文26篇(中科院一区: 15, CCF‑A: 6, 清华‑A:8,ESI高被引论文: 3)。曾在NeurIPS、AAAI、ICRA等会议上发表多篇Oral论文,谷歌学术引用800余次。 任IEEEmember,中国图像图形学会会员,中国图学学会会员,中国体视学学会会员,担任IEEETPAMI,IEEETIP,IEEETNNLS,IEEETMM,IEEETCSVT,IEEETII,NeuralNetworks, CVPR, NeurIPS, AAAI, MICCAI 等 国际期刊和会议的审稿人。共同组织CVPR2025具身智能国际挑战赛:Social Mobile Manipulation。

我们正在深圳积极招募工程师和实习生。如果您感兴趣或有合作机会,请随时联系我!

展开更多

在此之前,我是中国科学院自动化研究所的助理研究员。我2024年在中国科学院自动化研究所多模态人工智能系统全国重点实验室(CASIA‑MAIS)获得了人工智能博士学位, 在学期间曾获得中国科学院院长奖、国家奖学金、北京市优秀毕业生、中国科学院优秀毕业生和两次IEEE旗舰会议最佳论文提名奖。 我2019年在华中科技大学(HUST)获得了数学与计算机双学士学位,曾获全国数学建模竞赛最高奖。 此外,我曾在华中科技大学(白翔教授)和北京智源人工智能研究院/Galbot (王鹤教授)指导下开展科研工作。

A0 is the first object-centric affordance-aware hierarchical model for general robotic manipulation, decomposing manipulation tasks into spatial reasoning and action execution.
A0 是首个以目标物体为中心的具备可供性感知的通用机器人操纵分层模型,将操纵任务分解为空间推理与动作执行。
The world's first generalized embodied navigation large model: NaVid.
世界首个通用具身导航大模型:NaVid。
The robot dog developed in collaboration with Sun Yat-sen University demonstrated real-world navigation, human-robot interaction, and instruction understanding capabilities, and was interviewed and reported by Dragon TV.
与中山大学合作的机器狗在真实环境中展示了导航、人机交互与指令理解能力,被东方卫视采访报道。
The CA-Nav agent follows long-horizon instructions in complex continuous environments by dynamically adapting to spatial constraints without any expert demonstrations.
CA-Nav 智能体无需专家示范,便可在复杂连续环境中动态适应空间约束,完成长程指令的执行。
A0 is the first object-centric affordance-aware hierarchical model for general robotic manipulation, decomposing manipulation tasks into spatial reasoning and action execution.
A0 是首个以目标物体为中心的具备可供性感知的通用机器人操纵分层模型,将操纵任务分解为空间推理与动作执行。
The world's first generalized embodied navigation large model: NaVid.
世界首个通用具身导航大模型:NaVid。
The robot dog developed in collaboration with Sun Yat-sen University demonstrated real-world navigation, human-robot interaction, and instruction understanding capabilities, and was interviewed and reported by Dragon TV.
与中山大学合作的机器狗在真实环境中展示了导航、人机交互与指令理解能力,被东方卫视采访报道。
The CA-Nav agent follows long-horizon instructions in complex continuous environments by dynamically adapting to spatial constraints without any expert demonstrations.
CA-Nav 智能体无需专家示范,便可在复杂连续环境中动态适应空间约束,完成长程指令的执行。
NEWS
新闻
SELECTED PUBLICATIONS
精选论文

Rongtao Xu*, Jian Zhang*, Minghao Guo*, Youpeng Wen*, Haoting Yuyang, Min Lin, Jianzheng Huang, Zhe Li, Kaidong Zhang, Liqiong Wang, Yuxuan Kuang, Meng Cao, Feng Zheng, Xiaodan Liang

arXiv 2025

Kehan Chen*, Dong An*, Yan Huang, Rongtao Xu, Yifei Su, Yonggen Ling, Ian Reid, Liang Wang

arXiv 2024

Jiazhao Zhang*, Kunyu Wang*, Rongtao Xu*, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang

RSS 2024

Pengzhen Ren*, Min Li*, Zhen Luo*, Xinshuai Song*, Ziwei Chen*, Weijia Liufu*, Yixuan Yang*, Hao Zheng*, Rongtao Xu, Zitong Huang, Tongsheng Ding, Luyang Xie, Kaidong Zhang, Changfei Fu, Yang Liu, Liang Lin, Feng Zheng, Xiaodan Liang

arXiv 2024

Zixuan Gong*, Guangyin Bao*, Qi Zhang†, Zhongwei Wan, Duoqian Miao†, Shoujin Wang, Lei Zhu, Changwei Wang, Rongtao Xu, Liang Hu, Ke Liu, Yu Zhang

NeurIPS 2024 (Oral)

Rongtao Xu, Changwei Wang, Duzhen Zhang, Man Zhang, Shibiao Xu*, Weiliang Meng*, Xiaopeng Zhang

ICRA 2024 (Oral)

Wenhao Xu*, Rongtao Xu*, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

AAAI 2024

Rongtao Xu*, Changwei Wang*, Jiguang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TIP, 2023 (ESI Highly Cited Paper)

Changwei Wang*, Rongtao Xu*, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TPAMI, 2023

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TMM, 2024 (ESI Highly Cited Paper)

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TCSVT, 2024 (ESI Highly Cited Paper)

Rongtao Xu*, Changwei Wang*, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

AAAI 2023 (Oral)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

ICCV 2023

Changwei Wang*,Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

IEEE TNNLS, 2023 (extended version of SoftGAN)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Qimin Peng, Xiaopeng Zhang

IEEE ICME, 2022, Best Paper Nomination

Rongtao Xu*, Jian Zhang*, Minghao Guo*, Youpeng Wen*, Haoting Yuyang, Min Lin, Jianzheng Huang, Zhe Li, Kaidong Zhang, Liqiong Wang, Yuxuan Kuang, Meng Cao, Feng Zheng, Xiaodan Liang

arXiv 2025

Kehan Chen*, Dong An*, Yan Huang, Rongtao Xu, Yifei Su, Yonggen Ling, Ian Reid, Liang Wang

arXiv 2024

Jiazhao Zhang*, Kunyu Wang*, Rongtao Xu*, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang

arXiv 2024

Pengzhen Ren*, Min Li*, Zhen Luo*, Xinshuai Song*, Ziwei Chen*, Weijia Liufu*, Yixuan Yang*, Hao Zheng*, Rongtao Xu, Zitong Huang, Tongsheng Ding, Luyang Xie, Kaidong Zhang, Changfei Fu, Yang Liu, Liang Lin, Feng Zheng, Xiaodan Liang

arXiv 2024

Zixuan Gong*, Guangyin Bao*, Qi Zhang†, Zhongwei Wan, Duoqian Miao†, Shoujin Wang, Lei Zhu, Changwei Wang, Rongtao Xu, Liang Hu, Ke Liu, Yu Zhang

NeurIPS 2024 (口头报告)

Rongtao Xu, Changwei Wang, Duzhen Zhang, Man Zhang, Shibiao Xu*, Weiliang Meng*, Xiaopeng Zhang

ICRA 2024 (口头报告)

Wenhao Xu*, Rongtao Xu*, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

AAAI 2024

Rongtao Xu*, Changwei Wang*, Jiguang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TIP, 2023 (ESI 高被引论文)

Changwei Wang*, Rongtao Xu*, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TPAMI, 2023

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TMM, 2024 (ESI 高被引论文)

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TCSVT, 2024 (ESI 高被引论文)

Rongtao Xu*, Changwei Wang*, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

AAAI 2023 (口头报告)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

ICCV 2023

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

IEEE TNNLS, 2023(SoftGAN 的扩展版本)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Qimin Peng, Xiaopeng Zhang

IEEE ICME, 2022, 最佳论文提名

Achievements
成就
  • 2024 Excellent Prize of the President Scholarship, Chinese Academy of Sciences
  • 2024 Beijing Outstanding Graduates
  • 2023 CAS Outstanding Graduates, Chinese Academy of Sciences
  • 2023 National Scholarship for PhD Students, Institute of Automation, CAS
  • 2022 Best Paper Nomination Award, IEEE ICME 2022
  • 2021 Best Student Paper Nomination Award, IEEE ISBI 2021
  • 2024 中国科学院院长优秀奖
  • 2024 北京市优秀毕业生
  • 2023 中国科学院优秀毕业生
  • 2023 博士研究生国家奖学金(中国科学院自动化研究所)
  • 2022 IEEE ICME 最佳论文提名奖
  • 2021 IEEE ISBI 最佳学生论文提名奖
Education
教育背景
Institute of Automation, CASIA
Ph.D. in Pattern Recognition and Intelligent System     •Sept 2019 – Jun 2024
Published 20+ papers as the first author or joint first author, including 15 CAS Zone 1/CCF-A, 3 ESI Highly Cited Papers, 1 Best Paper Nomination
Huazhong University of Science and Technology
B.Sc. in Mathematics, Minor in Computer Science     •Jun 2015 – Sept 2019
CUMCM “Higher Education Cup” Winner
Guiyang No.1 High School
High School     • Sept 2012 – Jun 2015
Science Experimental Class
中国科学院自动化研究所
博士,模式识别与智能系统     • 2019年9月 – 2024年6月
以一作或共一发表论文20余篇,包括15篇中科院一区/CCF-A,3篇ESI高被引论文,1项最佳论文提名
华中科技大学
数学学士,辅修计算机科学     • 2015年6月 – 2019年9月
全国大学生数学建模竞赛“高教社杯”一等奖
贵阳一中
高中     • 2012年9月 – 2015年6月
理科实验班
PROFESSIONAL SERVICE
学术服务
  • Reviewer: IEEE TPAMI, TIP, TNNLS, TMM, TCSVT, TII, CVPR, ICCV, NeurIPS, AAAI, ICRA, IROS, MICCAI
  • Member: IEEE, China Society of Image and Graphics (CSIG), Chinese Society for Stereology (CSS), China Graphics Society (CGS)
  • Organizer: CVPR 2025 Embodied AI Workshop Social Mobile Manipulation
  • 审稿人:IEEE TPAMI、TIP、TNNLS、TMM、TCSVT、TII,CVPR、ICCV、NeurIPS、AAAI、MICCAI、ICRA
  • 成员:IEEE 电气与电子工程师协会、中国图象图形学学会(CSIG)、中国体视学学会(CSS)、中国图学学会(CGS)
  • 组织者:CVPR 2025 Embodied AI Workshop Social Mobile Manipulation