Rongtao Xu
Rongtao Xu
Rongtao Xu

Co-Founder & CTO at Spatialtemporal AI

Postdoctoral Researcher at Mohamed bin Zayed University of Artificial Intelligence

许镕涛

无界智慧联合创始人兼CTO

阿联酋人工智能大学博士后研究员

I am currently a Postdoctoral Researcher at the Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), working with Prof. Xiaodan Liang, Prof. IvanLaptev, and Prof. IanReid. My research focuses on Intelligent Robot, Embodied AI, Multimodal Large Model, and Spatial Intelligence. The goal of my research is to train multimodal embodied large models and leverage limited data to enhance robotic perception, understanding, action, and decision-making. I proposed the general manipulation model A0 and the navigation model NaVid. I have published over 50 papers in related top journals and conferences, including 26 as the first or corresponding author in IEEE TPAMI/TIP/TNNLS/TIM/TMM/TCSVT/TGRS, RSS/ICRA/ICCV/AAAI/MICCAI/ICME, among which 15 are in CAS Zone 1, 6 in CCF-A, 8 in Tsinghua-A, and 3 are ESI Highly Cited Papers. I have delivered multiple Oral presentations at NeurIPS, AAAI, ICRA, etc., and my work has received over 800 citations on Google Scholar. I am a member of IEEE, the China Society of Image and Graphics, the China Graphics Society, and the Chinese Society for Stereology. I serve as a reviewer for leading journals and conferences including IEEE TPAMI, IEEE TIP, IEEE TNNLS, IEEE TMM, IEEE TCSVT, IEEE TII, Neural Networks, CVPR, NeurIPS, AAAI, and MICCAI. Co-organizer of the CVPR 2025 Embodied AI International Challenge: Social Mobile Manipulation.

We are actively looking for engineers and interns in Shenzhen. Feel free to contact me if you are interested or if there are potential collaboration opportunities.

more

Previously, I was an Assistant Professor at the Institute of Automation, Chinese Academy of Sciences (CASIA). I received my Ph.D. in Artificial Intelligence in 2024 from the National Laboratory of Multimodal Artificial Intelligence Systems (CASIA-MAIS), Institute of Automation, Chinese Academy of Sciences. During my Ph.D. studies, I was awarded the CAS President’s Award, the National Scholarship, the Outstanding Graduate of Beijing Award, the Excellent Graduate of CAS Award, and received two Best Paper Nominations at flagship IEEE conferences. In 2019, I obtained dual bachelor's degrees in Mathematics and Computer Science from Huazhong University of Science and Technology (HUST), where I won the top prize in the National Mathematical Modeling Competition. Additionally, I conducted research under the supervision of Prof. Xiang Bai at Huazhong University of Science and Technology, and Prof. He Wang at the Beijing Academy of Artificial Intelligence / Galbot.

我现在是无界智慧联合创始人兼CTO,阿联酋人工智能大学(MBZUAI)博士后研究员,与梁小丹教授、 IvanLaptev教授和IanReid教授一起工作。 我的研究方向是智能机器人、具身智能、多模态大模型和空间智能。研究目标在于训练多模态具身大模型和利用有限的数据,以提升机器人在感知、理解、行动和决策等方面的能力。 提出操纵大模型A0,导航大模型NaVid。在相关领域学术期刊和会议上共发表论文60余篇,其中以第一作者或通讯作者在 IEEE TPAMI/TIP/TNNLS/TIM/TMM/TCSVT/TGRS, RSS/ICRA/ICCV/AAAI/MICCAI/ICME 等国际顶级期刊和会议上发表论文29篇(中科院一区: 16, CCF‑A: 7, 清华‑A:9,ESI高被引论文: 3)。曾在NeurIPS、AAAI、ICRA等会议上发表多篇Oral论文,谷歌学术引用1000余次。 任IEEEmember,中国图像图形学会会员,中国图学学会会员,中国体视学学会会员,担任IEEETPAMI,IEEETIP,IEEETNNLS,IEEETMM,IEEETCSVT,IEEETII,NeuralNetworks, CVPR, NeurIPS, AAAI, MICCAI 等 国际期刊和会议的审稿人。共同组织CVPR2025具身智能国际挑战赛:Social Mobile Manipulation。

我们正在深圳积极招募工程师和实习生。如果您感兴趣或有合作机会,请随时联系我!

展开更多

在此之前,我是中国科学院自动化研究所的助理研究员。我2024年在中国科学院自动化研究所多模态人工智能系统全国重点实验室(CASIA‑MAIS)获得了人工智能博士学位, 在学期间曾获得中国科学院院长奖、国家奖学金、北京市优秀毕业生、中国科学院优秀毕业生和两次IEEE旗舰会议最佳论文提名奖。 我2019年在华中科技大学(HUST)获得了数学与计算机双学士学位,曾获全国数学建模竞赛最高奖。 此外,我曾在华中科技大学(白翔教授)和BAAI/银河通用机器人 (王鹤教授)指导下开展科研工作。

A0 is the first object-centric affordance-aware hierarchical model for general robotic manipulation, decomposing manipulation tasks into spatial reasoning and action execution.
A0 是首个以目标物体为中心的具备可供性感知的通用机器人操纵分层模型,将操纵任务分解为空间推理与动作执行。
The world's first generalized embodied navigation large model: NaVid.
世界首个通用具身导航大模型:NaVid。
The robot dog developed in collaboration with Sun Yat-sen University demonstrated real-world navigation, human-robot interaction, and instruction understanding capabilities, and was interviewed and reported by Dragon TV.
与中山大学合作的机器狗在真实环境中展示了导航、人机交互与指令理解能力,被东方卫视采访报道。
The CA-Nav agent follows long-horizon instructions in complex continuous environments by dynamically adapting to spatial constraints without any expert demonstrations.
CA-Nav 智能体无需专家示范,便可在复杂连续环境中动态适应空间约束,完成长程指令的执行。
A0 is the first object-centric affordance-aware hierarchical model for general robotic manipulation, decomposing manipulation tasks into spatial reasoning and action execution.
A0 是首个以目标物体为中心的具备可供性感知的通用机器人操纵分层模型,将操纵任务分解为空间推理与动作执行。
The world's first generalized embodied navigation large model: NaVid.
世界首个通用具身导航大模型:NaVid。
The robot dog developed in collaboration with Sun Yat-sen University demonstrated real-world navigation, human-robot interaction, and instruction understanding capabilities, and was interviewed and reported by Dragon TV.
与中山大学合作的机器狗在真实环境中展示了导航、人机交互与指令理解能力,被东方卫视采访报道。
The CA-Nav agent follows long-horizon instructions in complex continuous environments by dynamically adapting to spatial constraints without any expert demonstrations.
CA-Nav 智能体无需专家示范,便可在复杂连续环境中动态适应空间约束,完成长程指令的执行。
NEWS
新闻
  • [2025] Two papers accepted by ICCV 2025
  • [2025] One paper accepted as oral presentation at IROS 2025
  • [2025] Co-organizer of the CVPR 2025 Embodied AI Workshop: Social Mobile Manipulation Challenge
  • [2025] One paper accepted as oral presentation at ICRA 2025
  • [2025] Two papers accepted by AAAI 2025
  • [2025] Three papers accepted by IEEE TIP, IEEE TCSVT, and IEEE TIM respectively
  • [2024] Awarded the CAS President’s Award, Outstanding Graduate of Beijing, and Excellent Graduate of CAS
  • show more
  • [2024] One paper (NaVid) was accepted by RSS 2024
  • [2024] The quadruped robot project in collaboration with Sun Yat-sen University was featured by Dragon TV and reposted by People’s Daily
  • [2024] One paper (NeuroClips) was accepted as oral presentation at NeurIPS 2024
  • [2024] Three papers were accepted by ICRA 2024, AAAI 2024, and IEEE TIP; two additional papers were accepted by IEEE JBHI
  • [2023] Received the National Scholarship for Doctoral Students from the Institute of Automation, Chinese Academy of Sciences
  • [2023] One paper (RSSFormer) was accepted by IEEE TIP and selected as an ESI Highly Cited Paper
  • [2023] One paper (WaveCAM) was accepted by IEEE TMM, selected as an ESI Highly Cited Paper, and included in the 2024 CCF Paper Digest
  • [2023] One paper (DomainFeat) was accepted by IEEE TCSVT, selected as an ESI Highly Cited Paper, and featured as a cover article
  • [2023] One paper (SCD) was accepted as oral presentation at AAAI 2023; One paper was accepted at ICCV 2023
  • [2022] One paper (SoftGAN) was accepted by ICME 2022 and nominated for Best Paper Award; four other papers were accepted by AAAI 2022, TMM, MICCAI 2022, and ICASSP 2022
  • [2021] Two papers were accepted by ISIB, including one Best Student Paper Nomination; another paper was accepted by MICCAI 2021
  • [2017] Received the “Higher Education Press Cup” top award in the National Undergraduate Mathematical Modeling Competition
  • ...
  • [2025]两篇论文被 ICCV 2025 接收
  • [2025]一篇论文被 IROS 2025 接收为口头报告
  • [2025]共同组织 CVPR 2025 Embodied AI Workshop: Social Mobile Manipulation Challenge
  • [2025]一篇论文被 ICRA 2025 接收为口头报告
  • [2025]两篇论文被 AAAI 2025 接收
  • [2025]三篇论文分别被 IEEE TIP、IEEE TCSVT、IEEE TIM 接收
  • [2024]获中科院院长奖、北京市优秀毕业生和中科院优秀毕业生称号
  • 展开更多
  • [2024]一篇论文(NaVid)被 RSS 2024 接收
  • [2024]与中山大学合作的机器狗项目被东方卫视采访报道,被人民日报转载
  • [2024]一篇论文(NeuroClips)被 NeurIPS 2024 接收为口头报告.
  • [2024]三篇论文分别被 ICRA 2024、AAAI 2024、IEEE TIP 接收;两篇论文被 IEEE JBHI 接收
  • [2023]在中国科学院自动化研究所获得博士研究生国家奖学金
  • [2023]一篇论文(RSSFormer)被 IEEE TIP 接收,并入选 ESI 高被引论文
  • [2023]一篇论文(WaveCAM)被 IEEE TMM 接收,入选 ESI 高被引论文,入选 2024 CCF 图文所论文导读
  • [2023]一篇论文(DomainFeat)被 IEEE TCSVT 接收,入选 ESI 高被引论文,并作为封面文章发表
  • [2023]一篇论文(SCD)被 AAAI 2023 接收为口头报告; 一篇论文被 ICCV 2023 接收
  • [2022]一篇论文(SoftGAN)被 ICME 2022接收,并获得最佳论文提名奖;另有四篇论文分别被 AAAI 2022、TMM、MICCAI 2022和 ICASSP 2022接收
  • [2021]两篇论文被 ISIB 接收,其中一篇获得最佳学生论文提名奖;另有一篇论文被 MICCAI 2021接收
  • [2017]在全国大学生数学建模竞赛中荣获“高等教育杯”最高奖
  • ...
SELECTED PUBLICATIONS
精选论文

Rongtao Xu*, Jian Zhang*, Minghao Guo*, Youpeng Wen*, Haoting Yuyang, Min Lin, Jianzheng Huang, Zhe Li, Kaidong Zhang, Liqiong Wang, Yuxuan Kuang, Meng Cao, Feng Zheng, Xiaodan Liang

ICCV 2025

Kaidong Zhang*, Rongtao Xu, Pengzhen Ren, Junfan Lin, Hefeng Wu, Liang Lin, Xiaodan Liang

arXiv 2025

Rongtao Xu*, Han Gao*, Mingming Yu, Dong An, Shunpeng Chen, Changwei Wang, Li Guo, Xiaodan Liang, Shibiao Xu

IROS 2025(Oral)

Bingqian Lin*, Yunshuang Nie*, Khun Loun Zai, Ziming Wei, Mingfei Han, Rongtao Xu, Minzhe Niu, Jianhua Han, Liang Lin, Cewu Lu, Xiaodan Liang

arXiv 2025

Liang Ma*, Jiajun Wen*, Min Lin*, Rongtao Xu*, Xiwen Liang*, Bingqian Lin, Jun Ma, Yongxin Wang, Ziming Wei, Haokun Lin, Mingfei Han, Meng Cao, Bokui Chen, Ivan Laptev, Xiaodan Liang

arXiv 2025

Kehan Chen*, Dong An*, Yan Huang, Rongtao Xu, Yifei Su, Yonggen Ling, Ian Reid, Liang Wang

arXiv 2024

Jiazhao Zhang*, Kunyu Wang*, Rongtao Xu*, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang

RSS 2024

Pengzhen Ren*, Min Li*, Zhen Luo*, Xinshuai Song*, Ziwei Chen*, Weijia Liufu*, Yixuan Yang*, Hao Zheng*, Rongtao Xu, Zitong Huang, Tongsheng Ding, Luyang Xie, Kaidong Zhang, Changfei Fu, Yang Liu, Liang Lin, Feng Zheng, Xiaodan Liang

arXiv 2024

Zixuan Gong*, Guangyin Bao*, Qi Zhang†, Zhongwei Wan, Duoqian Miao†, Shoujin Wang, Lei Zhu, Changwei Wang, Rongtao Xu, Liang Hu, Ke Liu, Yu Zhang

NeurIPS 2024 (Oral)

Rongtao Xu, Changwei Wang, Duzhen Zhang, Man Zhang, Shibiao Xu*, Weiliang Meng*, Xiaopeng Zhang

ICRA 2024 (Oral)

Wenhao Xu*, Rongtao Xu*, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

AAAI 2024

Rongtao Xu*, Changwei Wang*, Jiguang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TIP, 2023 (ESI Highly Cited Paper)

Changwei Wang*, Rongtao Xu*, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TPAMI, 2023

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TMM, 2024 (ESI Highly Cited Paper)

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TCSVT, 2024 (ESI Highly Cited Paper)

Rongtao Xu*, Changwei Wang*, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

AAAI 2023 (Oral)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

ICCV 2023

Changwei Wang*,Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

IEEE TNNLS, 2023 (extended version of SoftGAN)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Qimin Peng, Xiaopeng Zhang

IEEE ICME, 2022, Best Paper Nomination

Rongtao Xu*, Jian Zhang*, Minghao Guo*, Youpeng Wen*, Haoting Yuyang, Min Lin, Jianzheng Huang, Zhe Li, Kaidong Zhang, Liqiong Wang, Yuxuan Kuang, Meng Cao, Feng Zheng, Xiaodan Liang

arXiv 2025

Kaidong Zhang*, Rongtao Xu, Pengzhen Ren, Junfan Lin, Hefeng Wu, Liang Lin, Xiaodan Liang

arXiv 2025

Rongtao Xu*, Han Gao*, Mingming Yu, Dong An, Shunpeng Chen, Changwei Wang, Li Guo, Xiaodan Liang, Shibiao Xu

IROS 2025(Oral)

Bingqian Lin*, Yunshuang Nie*, Khun Loun Zai, Ziming Wei, Mingfei Han, Rongtao Xu, Minzhe Niu, Jianhua Han, Liang Lin, Cewu Lu, Xiaodan Liang

arXiv 2025

Liang Ma*, Jiajun Wen*, Min Lin*, Rongtao Xu*, Xiwen Liang*, Bingqian Lin, Jun Ma, Yongxin Wang, Ziming Wei, Haokun Lin, Mingfei Han, Meng Cao, Bokui Chen, Ivan Laptev, Xiaodan Liang

arXiv 2025

Kehan Chen*, Dong An*, Yan Huang, Rongtao Xu, Yifei Su, Yonggen Ling, Ian Reid, Liang Wang

arXiv 2024

Jiazhao Zhang*, Kunyu Wang*, Rongtao Xu*, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang

arXiv 2024

Pengzhen Ren*, Min Li*, Zhen Luo*, Xinshuai Song*, Ziwei Chen*, Weijia Liufu*, Yixuan Yang*, Hao Zheng*, Rongtao Xu, Zitong Huang, Tongsheng Ding, Luyang Xie, Kaidong Zhang, Changfei Fu, Yang Liu, Liang Lin, Feng Zheng, Xiaodan Liang

arXiv 2024

Zixuan Gong*, Guangyin Bao*, Qi Zhang†, Zhongwei Wan, Duoqian Miao†, Shoujin Wang, Lei Zhu, Changwei Wang, Rongtao Xu, Liang Hu, Ke Liu, Yu Zhang

NeurIPS 2024 (口头报告)

Rongtao Xu, Changwei Wang, Duzhen Zhang, Man Zhang, Shibiao Xu*, Weiliang Meng*, Xiaopeng Zhang

ICRA 2024 (口头报告)

Wenhao Xu*, Rongtao Xu*, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

AAAI 2024

Rongtao Xu*, Changwei Wang*, Jiguang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TIP, 2023 (ESI 高被引论文)

Changwei Wang*, Rongtao Xu*, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TPAMI, 2023

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

IEEE TMM, 2024 (ESI 高被引论文)

Rongtao Xu*, Changwei Wang*, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

IEEE TCSVT, 2024 (ESI 高被引论文)

Rongtao Xu*, Changwei Wang*, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

AAAI 2023 (口头报告)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

ICCV 2023

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

IEEE TNNLS, 2023(SoftGAN 的扩展版本)

Changwei Wang*, Rongtao Xu*, Shibiao Xu, Weiliang Meng, Jun Xiao, Qimin Peng, Xiaopeng Zhang

IEEE ICME, 2022, 最佳论文提名

Achievements
成就
  • 2024 Excellent Prize of the President Scholarship, Chinese Academy of Sciences
  • 2024 Beijing Outstanding Graduates
  • 2023 CAS Outstanding Graduates, Chinese Academy of Sciences
  • 2023 National Scholarship for PhD Students, Institute of Automation, CAS
  • 2022 Best Paper Nomination Award, IEEE ICME 2022
  • 2021 Best Student Paper Nomination Award, IEEE ISBI 2021
  • 2024 中国科学院院长优秀奖
  • 2024 北京市优秀毕业生
  • 2023 中国科学院优秀毕业生
  • 2023 博士研究生国家奖学金(中国科学院自动化研究所)
  • 2022 IEEE ICME 最佳论文提名奖
  • 2021 IEEE ISBI 最佳学生论文提名奖
Education
教育背景
Institute of Automation, CASIA
Ph.D. in Pattern Recognition and Intelligent System     •Sept 2019 – Jun 2024
Published 20+ papers as the first author or joint first author, including 15 CAS Zone 1/CCF-A, 3 ESI Highly Cited Papers, 1 Best Paper Nomination
Huazhong University of Science and Technology
B.Sc. in Mathematics, Minor in Computer Science     •Jun 2015 – Sept 2019
CUMCM “Higher Education Cup” Winner
Guiyang No.1 High School
High School     • Sept 2012 – Jun 2015
Science Experimental Class
中国科学院自动化研究所
博士,模式识别与智能系统     • 2019年9月 – 2024年6月
以一作或共一发表论文20余篇,包括15篇中科院一区/CCF-A,3篇ESI高被引论文,1项最佳论文提名
华中科技大学
数学学士,辅修计算机科学     • 2015年6月 – 2019年9月
全国大学生数学建模竞赛“高教社杯”一等奖
贵阳一中
高中     • 2012年9月 – 2015年6月
理科实验班
PROFESSIONAL SERVICE
学术服务
  • Reviewer: IEEE TPAMI, TIP, TNNLS, TMM, TCSVT, TII, CVPR, ICCV, NeurIPS, AAAI, ICRA, IROS, MICCAI
  • Member: IEEE, China Society of Image and Graphics (CSIG), Chinese Society for Stereology (CSS), China Graphics Society (CGS)
  • Organizer: CVPR 2025 Embodied AI Workshop Social Mobile Manipulation
  • 审稿人:IEEE TPAMI、TIP、TNNLS、TMM、TCSVT、TII,CVPR、ICCV、NeurIPS、AAAI、MICCAI、ICRA
  • 成员:IEEE 电气与电子工程师协会、中国图象图形学学会(CSIG)、中国体视学学会(CSS)、中国图学学会(CGS)
  • 组织者:CVPR 2025 Embodied AI Workshop Social Mobile Manipulation