Publications

2025

  1. Preprint
    A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities
    Liu, Yihao, Cao, Xu, Chen, Tingting, Jiang, Yankai, You, Junjie, Wu, Minghua, Wang, Xiaosong, Feng, Mengling, Jin, Yaochu, and Chen, Jintai
    arXiv preprint arXiv:2501.07468 2025

2024

  1. ML4H
    MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection
    Cao, Xu, Ye, Wenqian, Moise, Kenny, and Coffee, Megan
    In ML4H 2024
  2. EMNLP Findings
    Learning Autonomous Driving Tasks via Human Feedbacks with Large Language Models
    Ma, Yunsheng, Cao, Xu, Ye, Wenqian, Cui, Can, Mei, Kai, and Wang, Ziran
    In EMNLP 2024
  3. Preprint
    Towards social AI: A survey on understanding social interactions
    Lee, Sangmin, Li, Minzhi, Lai, Bolin, Jia, Wenqi, Ryan, Fiona, Cao, Xu, Kara, Ozgur, Boote, Bikram, Shi, Weiyan, Yang, Diyi, and others,
    arXiv preprint arXiv:2409.15316 2024
  4. CVPR
    MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding
    Cao, Xu, Zhou, Tong, Ma, Yunsheng, Ye, Wenqian, Cui, Can, Tang, Kun, Cao, Zhipeng, Liang, Kaizhao, Wang, Ziran, Rehg, James M, and others,
    In CVPR 2024
  5. CVPR
    Lampilot: An open benchmark dataset for autonomous driving with language model programs
    Ma, Yunsheng, Cui, Can, Cao, Xu, Ye, Wenqian, Liu, Peiran, Lu, Juanwu, Abdelraouf, Amr, Gupta, Rohit, Han, Kyungtae, Bera, Aniket, and others,
    In CVPR 2024
  6. WACVW
    A survey on multimodal large language models for autonomous driving
    Cui, Can, Ma, Yunsheng, Cao, Xu, Ye, Wenqian, Zhou, Yang, Liang, Kaizhao, Chen, Jintai, Lu, Juanwu, Yang, Zichong, Liao, Kuei-Da, and others,
    In WACV Workshop 2024
  7. WACVW
    Drive as you speak: Enabling human-like interaction with large language models in autonomous vehicles
    Cui, Can, Ma, Yunsheng, Cao, Xu, Ye, Wenqian, and Wang, Ziran
    In WACV Workshop 2024
  8. WACV
    MACP: Efficient model adaptation for cooperative perception
    Ma, Yunsheng, Lu, Juanwu, Cui, Can, Zhao, Sicheng, Cao, Xu, Ye, Wenqian, and Wang, Ziran
    In WACV 2024

2023

  1. ICASSP
    Vitasd: Robust vision transformer baselines for autism spectrum disorder facial diagnosis
    Cao, Xu, Ye, Wenqian, Sizikova, Elena, Bai, Xue, Coffee, Megan, Zeng, Hongwu, and Cao, Jianguo
    In ICASSP 2023
  2. JCPP
    Commentary: Machine learning for autism spectrum disorder diagnosis–challenges and opportunities
    Cao, Xu, and Cao, Jianguo
    Journal of Child Psychology and Psychiatry 2023
  3. UAI
    Mitigating transformer overconfidence via Lipschitz regularization
    Ye, Wenqian, Ma, Yunsheng, Cao, Xu, and Tang, Kun
    In UAI 2023
  4. AAAI Oral in IAAI
    THMA: Tencent hd map ai system for creating hd map annotations
    Tang, Kun, Cao, Xu, Cao, Zhipeng, Zhou, Tong, Li, Erlong, Liu, Ao, Zou, Shengtao, Liu, Chang, Mei, Shuqi, Sizikova, Elena, and others,
    In AAAI 2023

2022

  1. IJCAI Oral
    Aggpose: Deep aggregation vision transformer for infant pose estimation
    Cao, Xu, Li, Xiaoye, Ma, Liya, Huang, Yi, Feng, Xuan, Chen, Zening, Zeng, Hongwu, and Cao, Jianguo
    In IJCAI 2022