Rongtao Xu
Rongtao Xu

I am currently a researcher at MBZUAI (ranked top 10 in AI), and also serve as the part-time CTO of Spatialtemporal AI, and have served as a technical lead for multiple unicorns and startups (Rongtao-Xu.github.io). Previously, I was an Assistant Professor at the Institute of Automation, Chinese Academy of Sciences (CASIA), and a Visiting Scholar at Southern University of Science and Technology (SUSTech). I co-led the development of the world’s first navigation foundation model, NaVid, at Galbot. At the Momenta–CAS joint laboratory, I led research on autonomous driving perception algorithms. At Spatialtemporal AI, as Co-founder and CTO, I led the development of manipulation foundation models A1 and A0. I also co-organized the CVPR 2026 Embodied Intelligence Challenge (ManipArena) in collaboration with x2robot. I received my Ph.D. from the State Key Laboratory of Multimodal Artificial Intelligence at the Institute of Automation, Chinese Academy of Sciences (formerly the National Lab of Pattern Recognition). During my Ph.D., I was awarded the CAS President’s Award (top 0.7\%), the National Scholarship, Beijing Outstanding Graduate, and CAS Outstanding Graduate, and received two Best Paper Award nominations (top 0.3\%) at IEEE flagship conferences. In 2019, I obtained dual Bachelor’s degrees in Mathematics and Computer Science from Huazhong University of Science and Technology (HUST), where I received the university’s first-ever top prize in the National Undergraduate Mathematical Contest in Modeling (1st out of 36,375 teams). My research focuses on embodied intelligence and robot foundation models. I have published over 80 papers in top-tier conferences and journals (including RSS, IJCAI, IROS, CVPR, ICCV, ECCV, NeurIPS, ICLR, AAAI, EMNLP, MICCAI, TPAMI, TIP, TNNLS, TII, TIM, TMM, TCSVT, ISPRS), with nearly 40 as first or corresponding author. My works include 3 ESI Highly Cited Papers, 1 IEEE Transactions cover paper, and 8 oral presentations. My publications have received over 2,000 citations on Google Scholar. I hold more than 10 invention patents, and my research has been deployed in real-world systems such as the YOLO series, as well as products from Wujie Intelligence, Galaxy General Robotics, Yijiahe, Huawei, and Momenta. Students and interns I have mentored have gone on to receive top industry honors such as Huawei “Genius Youth,” Ant Star, and Xiaomi Star programs, or have been admitted to leading universities worldwide, including CMU, Stanford, UCSD, Cambridge, HKU, Tsinghua, and Peking University.

more

I am a member of IEEE, the China Society of Image and Graphics, the China Society for Graphics, and the Chinese Society for Stereology. I serve as a reviewer for leading international journals and conferences, including IEEE TPAMI, TIP, TNNLS, TMM, TCSVT, TII, CVPR, NeurIPS, AAAI, and MICCAI. I also co-organize the Embodied Intelligence International Challenge at CVPR 2025 and CVPR 2026.

我现在是MBZUAI(全球AI前10)研究员,并兼职无界智慧CTO,多家独角兽/初创技术负责人,Rongtao-Xu.github.io。 前中科院自动化所助理研究员、南方科技大学访问学者。曾在银河通用机器人共同主导全球首个导航大模型NaVid,在Momenta-中科院联合实验室主导自动驾驶感知算法,在无界智慧任CTO主导操控大模型A1和A0,和自变量机器人共同组织CVPR2026具身智能挑战赛ManipArena。 中科院自动化所多模态人工智能国重(前模识国重)博士,在学期间曾获得中国科学院院长奖(0.7\%)、国家奖学金、北京市优秀毕业生、中国科学院优秀毕业生和两次IEEE旗舰会议最佳论文提名奖(均0.3\%)。 2019年在华中科技大学(HUST)获得了数学与计算机双学士学位,曾获校史首次全国大学生数模竞赛最高奖(1/36375)。 研究方向为具身智能与机器人大模型,在顶级学术会议和期刊(RSS,IRCA,IROS,CVPR,ICCV,ECCV,NeurIPS,ICLR,AAAI,EMNLP,MICCAI,TPAMI,TIP,TNNLS,TII,TIM,TMM,TCSVT,ISPRS)上共发表论文80余篇,其中以一作/通讯发表近40篇,含ESI高被引论文3篇,IEEE Trans封面文章1篇,发表8次Oral论文。谷歌学术引用2000余次。 拥有10余项发明专利,研究成果应用于YOLO系列,以及无界智慧、银河通用、亿嘉和、华为、Momenta等多款产品。指导的多位学生/实习生拿到华为天才少年/蚂蚁星/小米星等大厂头部计划或者申请上CMU/Stanford/UCSD/Cambridge/HKU/THU/PKU等海内外名校。

展开更多

任IEEE会员,中国图像图形学会会员,中国图学学会会员,中国体视学学会会员,担任IEEE TPAMI, TIP, TNNLS, TMM, TCSVT, TII, CVPR, NeurIPS, AAAI, MICCAI等国际期刊和会议的审稿人。共同组织CVPR 2025 和CVPR 2026具身智能国际挑战赛。

A0 is the first object-centric affordance-aware hierarchical model for general robotic manipulation, decomposing manipulation tasks into spatial reasoning and action execution.
A0 是首个以目标物体为中心的具备可供性感知的通用机器人操纵分层模型,将操纵任务分解为空间推理与动作执行。
The world's first generalized embodied navigation large model: NaVid.
世界首个通用具身导航大模型:NaVid。
The robot dog developed in collaboration with Sun Yat-sen University demonstrated real-world navigation, human-robot interaction, and instruction understanding capabilities, and was interviewed and reported by Dragon TV.
与中山大学合作的机器狗在真实环境中展示了导航、人机交互与指令理解能力,被东方卫视采访报道。
The CA-Nav agent follows long-horizon instructions in complex continuous environments by dynamically adapting to spatial constraints without any expert demonstrations.
CA-Nav 智能体无需专家示范,便可在复杂连续环境中动态适应空间约束,完成长程指令的执行。
A0 is the first object-centric affordance-aware hierarchical model for general robotic manipulation, decomposing manipulation tasks into spatial reasoning and action execution.
A0 是首个以目标物体为中心的具备可供性感知的通用机器人操纵分层模型,将操纵任务分解为空间推理与动作执行。
The world's first generalized embodied navigation large model: NaVid.
世界首个通用具身导航大模型:NaVid。
The robot dog developed in collaboration with Sun Yat-sen University demonstrated real-world navigation, human-robot interaction, and instruction understanding capabilities, and was interviewed and reported by Dragon TV.
与中山大学合作的机器狗在真实环境中展示了导航、人机交互与指令理解能力,被东方卫视采访报道。
The CA-Nav agent follows long-horizon instructions in complex continuous environments by dynamically adapting to spatial constraints without any expert demonstrations.
CA-Nav 智能体无需专家示范,便可在复杂连续环境中动态适应空间约束,完成长程指令的执行。
NEWS
新闻
  • [2025] Two papers(A0)and(RoBridge) accepted by ICCV 2025
  • [2025] One paper(3D-MoRe)accepted as oral presentation at IROS 2025; One paper (CA-Nav)was accepted at IEEE TPAMI
  • [2025] Co-organizer of the CVPR 2025 Embodied AI Workshop: Social Mobile Manipulation Challenge
  • [2025] One paper (PhyBlock) was accepted at NeurIPS 2025; One paper accepted as oral presentation at ICRA 2025; Three papers accepted by AAAI 2025
  • [2025] Four papers accepted by CVPR, IEEE TIP, IEEE TCSVT, and IEEE TIM respectively; Two papers accepted by TMM; Two papers accepted by TCSVT
  • [2024] Awarded the CAS President’s Award, Outstanding Graduate of Beijing, and Excellent Graduate of CAS
  • show more
  • [2024] One paper (NaVid) was accepted by RSS 2024
  • [2024] The quadruped robot project in collaboration with Sun Yat-sen University was featured by Dragon TV and reposted by People’s Daily
  • [2024] One paper (NeuroClips) was accepted as oral presentation at NeurIPS 2024
  • [2024] Three papers were accepted by ICRA 2024, AAAI 2024, and IEEE TIP; two additional papers were accepted by IEEE JBHI
  • [2023] Received the National Scholarship for Doctoral Students from the Institute of Automation, Chinese Academy of Sciences
  • [2023] One paper (RSSFormer) was accepted by IEEE TIP and selected as an ESI Highly Cited Paper
  • [2023] One paper (WaveCAM) was accepted by IEEE TMM, selected as an ESI Highly Cited Paper, and included in the 2024 CCF Paper Digest
  • [2023] One paper (DomainFeat) was accepted by IEEE TCSVT, selected as an ESI Highly Cited Paper, and featured as a cover article
  • [2023] One paper (TSCD) was accepted as oral presentation at AAAI 2023; One paper was accepted at ICCV 2023; One paper was accepted at IEEE TPAMI
  • [2022] One paper (SoftGAN) was accepted by ICME 2022 and nominated for Best Paper Award; four other papers were accepted by AAAI 2022, TMM, MICCAI 2022, and ICASSP 2022
  • [2021] Two papers were accepted by ISIB, including one Best Student Paper Nomination; another paper was accepted by MICCAI 2021
  • [2017] Received the “Higher Education Press Cup” top award in the National Undergraduate Mathematical Modeling Competition
  • ...
  • [2025]两篇论文(A0)和(RoBridge)被 ICCV 2025 接收
  • [2025]一篇论文(3D-MoRe)被 IROS 2025 接收为口头报告; 一篇论文(CA-Nav) 被IEEE TPAMI接收
  • [2025]共同组织 CVPR 2025 Embodied AI Workshop: Social Mobile Manipulation Challenge
  • [2025]一篇论文被 (PhyBlock) NeurIPS 2025接收; 一篇论文被 ICRA 2025 接收为口头报告, 三篇论文被 AAAI 2025 接收
  • [2025]四篇论文分别被 CVPR、IEEE TIP、IEEE TIM 、IEEE TII接收; 两篇论文被IEEE TMM接收; 两篇论文被IEEE TCSVT接收
  • [2024]获中科院院长奖、北京市优秀毕业生和中科院优秀毕业生称号
  • 展开更多
  • [2024]一篇论文(NaVid)被 RSS 2024 接收
  • [2024]与中山大学合作的机器狗项目被东方卫视采访报道,被人民日报转载
  • [2024]一篇论文(NeuroClips)被 NeurIPS 2024 接收为口头报告.
  • [2024]三篇论文分别被 ICRA 2024、AAAI 2024、IEEE TIP 接收;两篇论文被 IEEE JBHI 接收
  • [2023]在中国科学院自动化研究所获得博士研究生国家奖学金
  • [2023]一篇论文(RSSFormer)被 IEEE TIP 接收,并入选 ESI 高被引论文
  • [2023]一篇论文(WaveCAM)被 IEEE TMM 接收,入选 ESI 高被引论文,入选 2024 CCF 图文所论文导读
  • [2023]一篇论文(DomainFeat)被 IEEE TCSVT 接收,入选 ESI 高被引论文,并作为封面文章发表
  • [2023]一篇论文(TSCD)被 AAAI 2023 接收为口头报告; 一篇论文被 ICCV 2023 接收; 一篇论文被 IEEE TPAMI 接收
  • [2022]一篇论文(SoftGAN)被 ICME 2022接收,并获得最佳论文提名奖;另有四篇论文分别被 AAAI 2022、TMM、MICCAI 2022和 ICASSP 2022接收
  • [2021]两篇论文被 ISIB 接收,其中一篇获得最佳学生论文提名奖;另有一篇论文被 MICCAI 2021接收
  • [2017]在全国大学生数学建模竞赛中荣获“高等教育杯”最高奖
  • ...
SELECTED PUBLICATIONS
精选论文

Weiye Zhu, Zekai Zhang, Xiangchen Wang, Hewei Pan, Teng Wang, Tiantian Geng, Rongtao Xu, Feng Zheng

Arxiv

Rongtao Xu*, Jian Zhang*, Minghao Guo*, Youpeng Wen*, Haoting Yuyang, Min Lin, Jianzheng Huang, Zhe Li, Kaidong Zhang, Liqiong Wang, Yuxuan Kuang, Meng Cao, Feng Zheng, Xiaodan Liang

ICCV 2025

Kaidong Zhang*, Rongtao Xu, Pengzhen Ren, Junfan Lin, Hefeng Wu, Liang Lin, Xiaodan Liang

ICCV 2025

Rongtao Xu*, Han Gao*, Mingming Yu, Dong An, Shunpeng Chen, Changwei Wang, Li Guo, Xiaodan Liang, Shibiao Xu

IROS 2025(Oral)

Bingqian Lin*, Yunshuang Nie*, Khun Loun Zai, Ziming Wei, Mingfei Han, Rongtao Xu, Minzhe Niu, Jianhua Han, Liang Lin, Cewu Lu, Xiaodan Liang

arXiv 2025

Liang Ma*, Jiajun Wen*, Min Lin*, Rongtao Xu*, Xiwen Liang*, Bingqian Lin, Jun Ma, Yongxin Wang, Ziming Wei, Haokun Lin, Mingfei Han, Meng Cao, Bokui Chen, Ivan Laptev, Xiaodan Liang

NeurIPS 2025

Kehan Chen*, Dong An*, Yan Huang, Rongtao Xu, Yifei Su, Yonggen Ling, Ian Reid, Liang Wang

TPAMI 2025

Jiazhao Zhang*, Kunyu Wang*, Rongtao Xu*, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang

RSS 2024

Pengzhen Ren*, Min Li*, Zhen Luo*, Xinshuai Song*, Ziwei Chen*, Weijia Liufu*, Yixuan Yang*, Hao Zheng*, Rongtao Xu, Zitong Huang, Tongsheng Ding, Luyang Xie, Kaidong Zhang, Changfei Fu, Yang Liu, Liang Lin, Feng Zheng, Xiaodan Liang

arXiv 2024

Zixuan Gong*, Guangyin Bao*, Qi Zhang†, Zhongwei Wan, Duoqian Miao†, Shoujin Wang, Lei Zhu, Changwei Wang, Rongtao Xu, Liang Hu, Ke Liu, Yu Zhang

NeurIPS 2024 (Oral)

Rongtao Xu, Changwei Wang, Duzhen Zhang, Man Zhang, Shibiao Xu*, Weiliang Meng*, Xiaopeng Zhang

ICRA 2024 (Oral)

Wenhao Xu*, Rongtao Xu*, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

AAAI 2024

Rongtao Xu*, Changwei Wang*, Jiguang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TIP, 2023 (ESI Highly Cited Paper)

Changwei Wang*, Rongtao Xu*, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TPAMI, 2023

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TMM, 2024 (ESI Highly Cited Paper)

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TCSVT, 2024 (ESI Highly Cited Paper)

Rongtao Xu*, Changwei Wang*, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

AAAI 2023 (Oral)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

ICCV 2023

Changwei Wang*,Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

IEEE TNNLS, 2023 (extended version of SoftGAN)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Qimin Peng, Xiaopeng Zhang

IEEE ICME, 2022, Best Paper Nomination

Weiye Zhu, Zekai Zhang, Xiangchen Wang, Hewei Pan, Teng Wang, Tiantian Geng, Rongtao Xu, Feng Zheng

Arxiv

Rongtao Xu*, Jian Zhang*, Minghao Guo*, Youpeng Wen*, Haoting Yuyang, Min Lin, Jianzheng Huang, Zhe Li, Kaidong Zhang, Liqiong Wang, Yuxuan Kuang, Meng Cao, Feng Zheng, Xiaodan Liang

ICCV 2025

Kaidong Zhang*, Rongtao Xu, Pengzhen Ren, Junfan Lin, Hefeng Wu, Liang Lin, Xiaodan Liang

ICCV 2025

Rongtao Xu*, Han Gao*, Mingming Yu, Dong An, Shunpeng Chen, Changwei Wang, Li Guo, Xiaodan Liang, Shibiao Xu

IROS 2025(Oral)

Bingqian Lin*, Yunshuang Nie*, Khun Loun Zai, Ziming Wei, Mingfei Han, Rongtao Xu, Minzhe Niu, Jianhua Han, Liang Lin, Cewu Lu, Xiaodan Liang

arXiv 2025

Liang Ma*, Jiajun Wen*, Min Lin*, Rongtao Xu*, Xiwen Liang*, Bingqian Lin, Jun Ma, Yongxin Wang, Ziming Wei, Haokun Lin, Mingfei Han, Meng Cao, Bokui Chen, Ivan Laptev, Xiaodan Liang

NeurIPS 2025

Kehan Chen*, Dong An*, Yan Huang, Rongtao Xu, Yifei Su, Yonggen Ling, Ian Reid, Liang Wang

TPAMI 2025

Jiazhao Zhang*, Kunyu Wang*, Rongtao Xu*, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang

RSS 2024

Pengzhen Ren*, Min Li*, Zhen Luo*, Xinshuai Song*, Ziwei Chen*, Weijia Liufu*, Yixuan Yang*, Hao Zheng*, Rongtao Xu, Zitong Huang, Tongsheng Ding, Luyang Xie, Kaidong Zhang, Changfei Fu, Yang Liu, Liang Lin, Feng Zheng, Xiaodan Liang

arXiv 2024

Zixuan Gong*, Guangyin Bao*, Qi Zhang†, Zhongwei Wan, Duoqian Miao†, Shoujin Wang, Lei Zhu, Changwei Wang, Rongtao Xu, Liang Hu, Ke Liu, Yu Zhang

NeurIPS 2024 (口头报告)

Rongtao Xu, Changwei Wang, Duzhen Zhang, Man Zhang, Shibiao Xu*, Weiliang Meng*, Xiaopeng Zhang

ICRA 2024 (口头报告)

Wenhao Xu*, Rongtao Xu*, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

AAAI 2024

Rongtao Xu*, Changwei Wang*, Jiguang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TIP, 2023 (ESI 高被引论文)

Changwei Wang*, Rongtao Xu*, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TPAMI, 2023

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TMM, 2024 (ESI 高被引论文)

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TCSVT, 2024 (ESI 高被引论文)

Rongtao Xu*, Changwei Wang*, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

AAAI 2023 (口头报告)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

ICCV 2023

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

IEEE TNNLS, 2023(SoftGAN 的扩展版本)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Qimin Peng, Xiaopeng Zhang

IEEE ICME, 2022, 最佳论文提名

Achievements
成就
  • 2024 Excellent Prize of the President Scholarship, Chinese Academy of Sciences
  • 2024 Beijing Outstanding Graduates
  • 2023 CAS Outstanding Graduates, Chinese Academy of Sciences
  • 2023 National Scholarship for PhD Students, Institute of Automation, CAS
  • 2022 Best Paper Nomination Award, IEEE ICME 2022
  • 2021 Best Student Paper Nomination Award, IEEE ISBI 2021
  • 2024 中国科学院院长优秀奖
  • 2024 北京市优秀毕业生
  • 2023 中国科学院优秀毕业生
  • 2023 博士研究生国家奖学金(中国科学院自动化研究所)
  • 2022 IEEE ICME 最佳论文提名奖
  • 2021 IEEE ISBI 最佳学生论文提名奖
Education
教育背景
Institute of Automation, CASIA
Ph.D. in Pattern Recognition and Intelligent System     •Sept 2019 – Jun 2024
Published 20+ papers as the first author or joint first author, including 15 CAS Zone 1/CCF-A, 3 ESI Highly Cited Papers, 1 Best Paper Nomination
Huazhong University of Science and Technology
B.Sc. in Mathematics, Minor in Computer Science     •Jun 2015 – Sept 2019
CUMCM “Higher Education Cup” Winner
Guiyang No.1 High School
High School     • Sept 2012 – Jun 2015
Science Experimental Class
中国科学院自动化研究所
博士,模式识别与智能系统     • 2019年9月 – 2024年6月
以一作或共一发表论文20余篇,包括15篇中科院一区/CCF-A,3篇ESI高被引论文,1项最佳论文提名
华中科技大学
数学学士,辅修计算机科学     • 2015年6月 – 2019年9月
全国大学生数学建模竞赛“高教社杯”一等奖
贵阳一中
高中     • 2012年9月 – 2015年6月
理科实验班
PROFESSIONAL SERVICE
学术服务
  • Reviewer: IEEE TPAMI, TIP, TNNLS, TMM, TCSVT, TII, CVPR, ICCV, NeurIPS, AAAI, ICRA, IROS, MICCAI
  • Member: IEEE, China Society of Image and Graphics (CSIG), Chinese Society for Stereology (CSS), China Graphics Society (CGS)
  • Organizer: CVPR 2025 Embodied AI Workshop Social Mobile Manipulation
  • 审稿人:IEEE TPAMI、TIP、TNNLS、TMM、TCSVT、TII,CVPR、ICCV、NeurIPS、AAAI、MICCAI、ICRA
  • 成员:IEEE 电气与电子工程师协会、中国图象图形学学会(CSIG)、中国体视学学会(CSS)、中国图学学会(CGS)
  • 组织者:CVPR 2025 Embodied AI Workshop Social Mobile Manipulation