Xuecheng Wu

01s | Multi-modal Learning
Xi'an City, China
Xi'an Jiaotong University

✨About Me

I'm currently a second-year master student majored in computer technology at the School of Computer Science and Technology, Xi'an Jiaotong University (XJTU), supervised by Prof. Heli Sun. My research interest lies in deep learning and multi-media computing, primarily focusing on large-scale video understanding, multi-modal large langugae models (MLLMs), and misinfo detection.

Prior to that, I recevied my B.E. degree at the School of Cyber Science and Engineering, Zhengzhou University, where I worked closely with Prof. Junxiao Xue (Zhejiang Lab) and Prof. Lei Shi.

Please feel free to contact me if you are interested in my works and want to explore potential collaborations 🙌.

  • Video Understanding: multi-modal self-supervised learning
  • Multi-modal Large Language Models (MLLMs): human-centric, audio-visual joint modeling
  • Misinfo Detection: deepfake and AIGC detection

đź‘€News

  • [10/2024] Served as a reviewer for AISTATS'25.
  • [08/2024] Served as a reviewer for ICLR'25.
  • [07/2024] FineCLIPER and CREST are accepted to MM'24!
  • [05/2024] Served as a reviewer for NeurIPS'24.
  • [02/2024] Served as a reviewer for ECCV'24.
  • [01/2024] Served as a reviewer for MM'24.
  • [01/2024] UrbanCLIP is accepted to WWW'24!

Experience

Research Intern | UCF
Time: 7/2024 - Present. Advisor: Prof. Ser-Nam Lim

Research Intern | HKGAI
Time: 5/2024 - 8/2024. Mentor: Prof. Wenhan Luo

Research Intern | DianLab, NPU
Time: 6/2023 - 8/2024. Advisor: Prof. Dian Shao

Publications

Beyond Uncertainty: Evidential Deep Learning for Robust Video Temporal Grounding
Kaijing Ma*, Haojian Huang*, Jin Chen*, Haodong Chen, Pengliang Ji, Xianghao Zang, Han Fang, Chao Ban, Hao Sun, Mulin Chen, Xuelong Li
arXiv, 2024
[Arxiv] [Project] [Code]

GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting
Haodong Chen, Yongle Huang, Haojian Huang, Xiangsheng Ge, Dian Shao
arXiv, 2024
[Arxiv] [Project] [Code]

FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs
Haodong Chen, Haojian Huang, Junhao Dong, Mingzhe Zheng, Dian Shao
ACM International Conference on Multimedia (MM), 2024
[Paper] [Arxiv] [Project]

CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning
Haojian Huang, Xiaozhen Qiao, Zhuo Chen, Haodong Chen, Bingyu Li, Zhe Sun, Mulin Chen, Xuelong Li
ACM International Conference on Multimedia (MM), 2024
[Paper] [Arxiv] [Code]

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web
Yibo Yan, Haomin Wen, Siru Zhong, Wei Chen, Haodong Chen, Qingsong Wen, Roger Zimmermann, Yuxuan Liang
ACM International World Wide Web Conference (WWW), 2024
[Paper] [Arxiv] [Video] [Code]
Oral Presentation

Awards & Honors

  • Outstanding University Student of NPU, 2024.
  • Innovation and Entrepreneurship Advanced Individual Honor, NPU, 2024.
  • School Scholarship, NPU, 2023&2024.
  • University Student Innovation Fund, Ministry of Education of P.R. China, 2023
  • Academic Advancement Individual Honor, NPU, 2023.

Services

  • Conference Reviewer,
     International Conference on Learning Representations (ICLR)
     Neural Information Processing Systems (NeurIPS)
     European Conference on Computer Vision (ECCV)
     International Conference on Artificial Intelligence and Statistics (AISTATS)
     ACM International Conference on Multimedia (MM)