2025
  • Yujie Wang, Yunwei Zhao, Jing Yang, Han Han, Shiguang Shan, Jie Zhang. Evaluating Cognitive-Behavioral Fixation via Multimodal User Viewing Patterns on Social Media. The 2025 Conference on Empirical Methods in Natural Language Processing (ENNLP), Suzhou, China, Nov. 4-9, 2025. PDF
  • Jiadong Pan, Liang Li, Hongcheng Gao, Zhengjun Zha, Qingming Huang, Jiebo Luo. SafeCFG: Controlling Harmful Features with Dynamic Safe Guidance for Safe Generation. ACM International Conference on Multimedia (ACM MM), Dublin, Ireland, Oct. 27-31, 2025. PDF
  • Gaoxiang Cong, Liang Li, Jiadong Pan, Zhedong Zhang, Amin Beheshti, Anton Van Den Hengel, Yuankai Qi, Qingming Huang. FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing. ACM International Conference on Multimedia (ACM MM), Dublin, Ireland, Oct. 27-31, 2025. PDF
  • Mingquan Zhou, Chen He, Ruiping Wang, Xilin Chen. OV3D-CG: Open-vocabulary 3D Instance Segmentation with Contextual Guidance. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Yufei Cai, Hu Han, Yuxiang Wei, Shiguang Shan, Xilin Chen. EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Xiaorong Qin, Xinhang Song, Sixian Zhang, Xinyao Yu, Xinmiao Zhang, Shuqiang Jiang. Learning on the Go: A Meta-learning Object Navigation Model. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Yuyi Liu, Xinhang Song, Tianliang Qi, Shuqiang Jiang. Trial-Oriented Visual Rearrangement. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Zhaoxin Yuan, Shuang Yang, Shiguang Shan, Xilin Chen. Not Only Vision: Evolve Visual Speech Recognition via Peripheral Information. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Feixiang Wang, Shuang Yang, Shiguang Shan, Xilin Chen. CogCM: Cognition-Inspired Contextual Modeling for Audio Visual Speech Enhancement. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Sixian Zhang, Xinyao Yu, Xinhang Song, Yiyao Wang, Shuqiang Jiang. Function-centric Bayesian Network for Zero-Shot Object Goal Navigation. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Zhuo Li*, Mingshuang Luo*, Ruibing Hou, Xin Zhao, Hao Liu, Hong Chang, Zimo Liu, Chen Li. Morph: A Motion-free Physics Optimization Framework for Human Motion Generation. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Jiahe Zhao, Ruibing Hou, Zejie Tian, Hong Chang, Shiguang Shan. HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Mengdi Liu, Zhangyang Gao, Hong Chang, Ziqing Li, Shiguang Shan, Xilin Chen. G2PDiffusion: Cross-species Genotype-to-Phenotype Prediction via Evolutionary Diffusion. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Xinkuan Qiu, Meina Kan, Yongbin Zhou, Shiguang Shan. Benchmarking Multimodal Large Language Models Against Image Corruptions. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Zongyao Xue, Meina Kan, Shiguang Shan, Xilin Chen. Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning. IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, Hawaii, USA, Oct. 19-23, 2025. PDF
  • Zhen Yang, Zhuo Tao, Qi Chen, Yuankai Qi, Anton van den Hengel, Qingming Huang, Liang Li. Separation of powers: On segregating knowledge from observation in LLM-enabled knowledge-based visual question answering. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville TN, USA, Jun. 11-15, 2025. PDF
  • Yue Wu, Zhaobo Qi, Junshu Sun, Yaowei Wang, Qingming Huang, Shuhui Wang. Video Language Model Pretraining with Spatio-temporal Masking. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville TN, USA, Jun. 11-15, 2025. PDF
  • Gaoxiang Cong, Jiadong Pan, Liang Li, Yuankai Qi, Yuxin Peng, Anton van den Hengel, Jian Yang, Qingming Huang. EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville TN, USA, Jun. 11-15, 2025. PDF
  • Zonghui Guo, Yingjie Liu, Jie Zhang, Haiyong Zheng, Shiguang Shan. Face Forgery Video Detection via Temporal Forgery Cue Unraveling. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville TN, USA, Jun. 11-15, 2025. PDF
  • Ziyi Bai, Hanxuan Li, Bin Fu, Chuyan Xiong, Ruiping Wang, Xilin Chen. R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville TN, USA, Jun. 11-15, 2025. PDF