Xuecheng Wu's Homepage

✨About Me

I'm currently a final-year master student majored in computer technology at the School of Computer Science and Technology, Xi'an Jiaotong University (XJTU), supervised by Prof. Heli Sun. My research interest lies in deep learning and multi-media computing, primarily focusing on large-scale self-supervised video understanding, multi-modal large langugae models (MLLMs), and misinfo detection (Deepfake & AIGC).

Prior to that, I recevied my B.E. degree (with honors) at the School of Cyber Science and Engineering, Zhengzhou University (ZZU), where I worked closely with Prof. Junxiao Xue (PI, Zhejiang Lab) and Prof. Lei Shi (Vice Dean). Besides, as a student PI, I have led the eMotionAI Lab of Zhengzhou University Students innovative Entrepreneurial Base (North Campus) from 2021 to 2023.

My CSDN Technology Blogs are located at HERE.

Please feel free to contact me if you are interested in my works and want to explore potential collaborations 🙌.

Video Understanding: Multi-modal self-supervised learning
Multi-modal Large Language Models (MLLMs): Human-centric, Unified understanding and generation, and CoT reasoning
Misinfo Detection: Deepfake and AIGC detection

📢News

[09/2025] One paper is submitted to IEEE TGRS.

[09/2025] One paper is accepted by IEEE TCSS!

[08/2025] One paper is accepted by EMNLP'25 Main Conference!

[08/2025] Eight papers are submitted to AAAI'26.

[07/2025] One paper is accepted by MM'25 SVC Workshop!

[07/2025] One paper is submitted to IEEE TCSVT.

[07/2025] One paper is submitted to MM'25 SVC Workshop.

[07/2025] One paper is submitted to IEEE TCSVT.

[07/2025] Two papers are accepted by ACM MM'25!

[06/2025] One paper is accpeted by IEEE SMC'25!

[06/2025] One paper is submitted to Big Data Mining and Analytics.

[06/2025] Served as a reviewer for EMNLP'25.

[06/2025] One paper is submitted to ACM MM'25 Grand Challenge.

[05/2025] One paper is submitted to Intelligent Computing.

[05/2025] One paper is submitted to EMNLP'25.

[05/2025] Four papers are submitted to NeurIPS'25.

[05/2025] Served as a reviewer for ACM MM'25.

[04/2025] One paper is accpeted by Big Data Mining and Analytics!

[04/2025] Three papers are accepted by ACM ICMR'25!

[04/2025] Five papers are submitted to ACM MM'25.

[04/2025] One paper is accepted by CVPR'25 NTIRE Challenge!

[04/2025] Three papers are accepted by IJCNN'25!

[03/2025] One paper is submitted to IEEE SMC'25.

[03/2025] One paper is submitted to CVPR'25 NTIRE Challenge.

[03/2025] One paper is submitted to ICCV'25🎈.

[02/2025] Two papers are accepted by CVPR'25!

[02/2025] Three papers are submitted to ICMR'25.

[01/2025] One paper is submitted to IJCAI'25🎈.

[01/2025] Three papers are submitted to IJCNN'25.

[12/2024] One paper is submitted to ACL'25.

[12/2024] Three papers are submitted to ICME'25🎈.

[11/2024] Five papers are submitted to CVPR'25.

[11/2024] One paper is submitted to Big Data Mining and Analytics.

[11/2024] Served as a reviewer for CVPR'25.

[10/2024] Served as a reviewer for WWW'25.

[10/2024] One paper is submitted to WWW'25🎈.

[09/2024] One paper is submitted to ICASSP'25🎈.

[08/2024] One paper is submitted to AAAI'25🎈.

[08/2024] One paper is accpeted by Big Data Mining and Analytics!

[07/2024] One paper is accpeted by ACM MM'24!

[05/2024] Served as a reviewer for NeurIPS'24.

[03/2024] Served as a reviewer for ACM MM'24.

😘Experience

Research Intern | Data-Douyin, ByteDance
Period: 04/2025 - Present. Mentor: Dingkang Yang & Xiao Liang

Research Intern | Multi-modal Evaluation Group, Meituan-M17
Period: 01/2025 - 05/2025. Mentor: Jiaxing Liu & Xiaoyu Li

Member | Data Intelligence and Social Governance Lab, Xi'an Jiaotong University
Period: 09/2023 - Present. Supervisor: Prof. Heli Sun

Visiting Stundent | State Key Laboratory of Communication Content Cognition
Period: 10/2023 - 10/2024. Supervisor: Prof. Heli Sun

Student PI | eMotionAI Lab, Zhengzhou University
Period: 06/2021 - 06/2023. Advisor: Prof. Junxiao Xue

Research Assistant | Machine Vision Lab, Zhengzhou University
Period: 06/2021 - 09/2021. Supervisor: Prof. Jianhong Ma

Research Assistant | Computational Learning Lab, Zhengzhou University
Period: 09/2020 - 06/2023. Supervisor: Prof. Junxiao Xue

😍 Selected Works

AVF-MAE++: Scaling Affective Video Facial Masked Autoencoders via Efficient Audio-Visual Self-Supervised Learning
Xuecheng Wu, Heli Sun^†, Yifan Wang, Jiayu Nie, Jie Zhang, Yabing Wang, Junxiao Xue, Liang He
IEEE/CVF CVPR, 2025
[Paper] [Code] Poster

Towards Emotion Analysis in Short-form Videos: A Large-Scale Dataset and Baseline
Xuecheng Wu, Heli Sun^†, Junxiao Xue, Jiayu Nie, Xiangyan Kong, Ruofan Zhai, Liang He
ACM ICMR, 2025
[Paper] [Code]

JTD-UAV: MLLM-Enhanced Joint Tracking and Description Framework for Anti-UAV Systems
Yifan Wang*, Jian Zhao*, Zhaoxin Fan^†, Xin Zhang, Xuecheng Wu, Yudian Zhang, Lei Jin, Xinyue Li, Gang Wang^†, Mengxi Jia, Ping Hu, Zheng Zhu, Xuelong Li
IEEE/CVF CVPR, 2025
[Paper] Poster

ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Xuecheng Wu*, Jiaxing Liu*, Heli Sun^†, Danlei Huang, Xiaoyu Li^†, Yifan Wang, Chen Chen, Liya Ma, Xuezhi Cao, Junxiao Xue, Liang He
arXiv, 2025
[Paper] [Dataset]

LLaVA-World: Benchmarking and Enhancing Fine-Grained Open-World Knowledge Understanding for MLLMs
Yifan Wang*, Xuecheng Wu*, Yuhao Dong, Zuyan Liu, Jia Zhang, Qi Zhang, Winston Hu, Yongming Rao^† (*: Equal Contribution.)
Under Review, 2025

3A-YOLO: New Real-time Object Detectors with Triple Discriminative Awareness and Coordinated Representations
Xuecheng Wu*, Junxiao Xue*^†, Liangyu Fu, Jiayu Nie, Danlei Huang, Xinyi Yin
IEEE SMC, 2025
[Paper]

Magnifier: A Pluggable Framework for Enhanced High-Resolution Image Comprehension in Multi-modal Large Language Models
Yifan Wang, Yunfei Wu, Xin Li^†, Xuecheng Wu, Wentao Zhang, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun, Feiyue Huang^†
Under Review, 2025

TokenFocus-VQA: Enhancing Text-to-Image Evaluation with Position-Specific Probability Loss and Multi-Perspective Aggregations on LVLMs
Zijian Zhang, Xunhui Zheng, Xuecheng Wu, Chong Peng^†, Xuezhi Cao
IEEE/CVF CVPRW, 2025
[Paper]

HOLA: Enhancing Audio-visual Deepfake Detection via Hierarchical Contextual Aggregations and Efficient Pre-training
Xuecheng Wu, Heli Sun^†, Danlei Huang, Xinyi Yin, Yifan Wang, Hao Wang, Jia Zhang, Fei Wang, Peihao Guo, Suyu Xing, Junxiao Xue, Liang He
ACM MM, 2025

HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs
Zijian Zhang*, Xuecheng Wu*, Danlei Huang, Siyu Yan, Chong Peng^†, Xuezhi Cao (*: Equal Contribution.)
ACM MM, 2025

Building Robust Video-Level Deepfake Detection via Audio-Visual Local-Global Interactions
Yifan Wang*, Xuecheng Wu*, Jia Zhang, Mohan Jing, Keda Lu, Jun Yu^†, Wen Su, Fang Gao, Qingsong Liu, Jianqing Sun, Jiaen Liang (*: Equal Contribution and Radom Order.)
ACM International Conference on Multimedia (MM), 2024
[Paper]

DDSE: A Decoupled Dual-Stream Enhanced Framework for Multimodal Sentiment Analysis with Text-Centric SSM
Shenjie Jiang, Zhuoyu Wang, Xuecheng Wu, Hongru Ji, Mingxin Li, Xianghua Li, Chao Gao
ACM MM, 2025

A Trustworthy Method for Multimodal Emotion Recognition
Junxiao Xue, Xiaozhen Liu^†, Jie Wang, Xuecheng Wu, Bin Wu
Big Data Mining and Analytics, 2025

LR-Doc: Benchmarking and Advancing Long Document Reasoning in MLLMs with Learned Priors
Yifan Wang*, Xuecheng Wu*, Danlei Huang, Zhaoxin Fan^†, Xinyi Yin, Tingqi Hu, Yang Xiao, Zhe Gao, Jun Xie, Xin Fu, Liang Xie^†
Under Review, 2025

MM-AntiUAV: A Comprehensive Benchmark for Multi-UAV Tracking and Intent Recognition
Yifan Wang*, Jian Zhao*, Xuecheng Wu, Xin Zhang, Danlei Huang, Zhaoxin Fan^†, Gang Wang^†, Lei Jin, Jianan Li, Xuelong Li
Under Review, 2025

DSACap: Enhancing Visual-Semantic Alignment with Diffusion-based Framework for Image Captioning
Liangyu Fu, Junbo Wang, Yuke Li, Qiangguo Jin, Hongsong Wang, Ya Jing, Linjiang Huang, Liang Yao, Jiangbin Zheng, Xuecheng Wu, Zhiyong Wang
ACM MM, 2025

Affective Video Content Analysis: Decade Review and New Perspectives
Junxiao Xue, Jie Wang^†, Xiaozhen Liu, Qian Zhang, Xuecheng Wu
Big Data Mining and Analytics,2024
[Paper]

PTSR: A Unified Patch Tokenization, Selection and Representation Framework for Efficient Micro-expression Recognition
Liangyu Fu, Junbo Wang, Qiangguo Jin, Yining Zhu, Hongsong Wang, Yuke Li, Xuecheng Wu, Zhiyong Wang^†
ACM ICMR, 2025

TACR-YOLO: A Real-time Detection Framework for Abnormal Human Behaviors Enhanced with Coordinate and Task-Aware Representations
Xinyi Yin, Wenbo Yuan, Xuecheng Wu^†, Liangyu Fu, Danlei Huang
IJCNN, 2025

InfoSyncNet: Information Synchronization Temporal Convolutional Network for Visual Speech Recognition
Junxiao Xue, Xiaozhen Liu^†, Xuecheng Wu, Fei Yu, Jun Wang
IJCNN, 2025

FAMNet: Integrating 2D and 3D Features for Micro-expression Recognition via Multi-task Learning and Hierarchical Attention
Liangyu Fu, Xuecheng Wu^†, Danlei Huang, Xinyi Yin
IJCNN, 2025

EPIR: An Efficient Patch Tokenization, Integration and Representation Framework for Micro-expression Recognition
Liangyu Fu, Junbo Wang, Yuke Li, Yining Zhu, Hongsong Wang, Xuecheng Wu, Kun Hu
IEEE TCSVT'25, Under Review

A Method on Mask Wearing Detection of Natural Population Based on Improved YOLOv4
Junxiao Xue*, Xuecheng Wu*, Shihao Wang, Mengmeng Tian, Lei Shi^†
Journal of Zhengzhou University (Engineering Science), 2022
[Paper]

MirrorDiff: Learning Mirror Diffusion for Image Captioning via Regeneration
Junbo Wang, Liangyu Fu, Yining Zhu, Qiangguo Jin, Hongsong Wang, Yuke Li, Xuecheng Wu, Kun Hu^†
ACM ICMR, 2025

ICVNet: A Method on Cross-modal Fusion of Short Video Emotion Recognition
Junxiao Xue*, Xuecheng Wu*, Qian Zhang, Mengmeng Tian, Lanhang Zhai, Lei Shi^†
Chinese Journal of Ergonomics, 2022
[Paper]

🎉Awards

2025年ACM Multimedia大会 Deepfakes1M++全球挑战赛冠军
2025年ACM Multimedia大会 Responsible AI全球挑战赛冠军
2025年全国大学生软件创新大赛 西北赛区一等奖 & 全国三等奖
2025年CVPR'25 NTIRE-文生图模型质量评估全球挑战赛亚军
2024年第十九届"挑战杯"全国大学生课外学术科技作品竞赛"揭榜挂帅"专项赛 全国一等奖
2024年"中国网谷·华为杯"中国研究生网络安全创新大赛 全国三等奖
2024年ACM Multimedia大会 Deepfakes1M全球挑战赛冠军
2024年"华为杯"第六届中国研究生人工智能创新大赛 全国三等奖
2024年第十五届中国大学生服务外包创新创业大赛 省级二等奖 & 全国三等奖
2024年中国高校计算机大赛-网络技术挑战赛 省级二等奖
2022年第十五届全国大学生信息安全竞赛-作品赛 全国三等奖
2022年全国大学生物联网设计竞赛 省级一等奖 & 全国二等奖
2022年中国高校计算机大赛-网络技术挑战赛 省级二等奖 & 全国三等奖
2022年大学生创新创业训练计划 河南省教育厅暨郑州大学创新重点项目 (项目主持人)
2022年中国大学生计算机设计大赛 省级三等奖
2021年全国大学生物联网设计竞赛 省级一等奖 & 全国三等奖
2021年中国高校计算机大赛-网络技术挑战赛 省级三等奖
2021年中国大学生计算机设计大赛 省级三等奖
2021年第十七届 "挑战杯"大学生课外学术科技作品竞赛 郑州大学校级二等奖
2020年全国大学生英语竞赛 (NECCS) 省级优秀奖

💖Honors

西安交通大学2023-2024学年研究生特等奖学金
西安交通大学2023-2024学年优秀研究生
西安交通大学-浪潮集团2023-2024学年优秀研究生学业奖学金
2022至2023年度中国大学生自强之星 (该年度郑州大学唯一入选; 河南省排名第一) [Link]
西安交通大学2023级研究生新生一等奖学金
郑州大学2023届普通本科生优秀毕业论文 (TOP 1.4%)
2023年度郑州大学优秀毕业生
2023年度郑州大学网络空间安全学院十佳优秀毕业生
2022至2023学年郑州大学一等学业奖学金
2022年度郑州大学校园之星
郑州大学2022-2023学年"蜜雪冰城"奖学金
郑州大学2021至2022学年三好学生
郑州大学2021至2022学年优秀学生干部
2022年度教育部-华为"智能基座"奖学金
2021至2022学年国家励志奖学金
2021年度河南省高校文明宿舍 (TOP 0.2%, 宿舍长)
2020至2021学年国家励志奖学金
郑州大学2020至2021学年三好学生

🌹Academic Services

Conference Reviewer for
1. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2. IEEE/CVF International Conference on Computer Vision (ICCV)
3. The Annual Conference on Neural Information Processing Systems (NeurIPS)
4. ACM The Web Conference (WWW)
5. CAAI International Conference on Artificial Intelligence (CICAI)
6. ACM International Conference on Multimedia (MM)
7. IEEE International Conference on Multimedia & Expo (ICME)
8. IEEE BigData
9. International Joint Conference on Neural Networks (IJCNN)
10. IEEE International Conference on Advanced Visual and Signal-Based Systems (AVSS)
11. IEEE International Conference on Systems, Man, and Cybernetics (SMC)
12. Empirical Methods in Natural Language Processing (EMNLP)
13. The AAAI Conference on Artificial Intelligence (AAAI)
Journal Reviewer for
1. IEEE Transactions on Multimedia (TMM)
2. Knowledge-based Systems (KBS)
3. IEEE Transactions on Knowledge and Data Engineering (TKDE)
4. Intelligent Computing
5. ACM Transactions on Multimedia Computing Communications and Applications (TOMM)

🥳Kind Assistance

2024 Year Entrance

2025 Year Entrance

2026 Year Entrance