Sen Nie, Jie Zhang, Jianxin Yan, Shiguang Shan, Xilin Chen. V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Boyu Han, Qianqian Xu, Shilong Bao, Zhiyong Yang, Ruochen Cui, Xilin Zhao, Qingming Huang. Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual Representation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Yiyao Wang, Sixian Zhang, Keming Zhang, Xinhang Song, Songjie Du, Shuqiang Jiang. TrajRAG: Retrieving Geometric-Semantic Experience for Zero-Shot Object Navigation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Sixian Zhang, Yiyao Wang, Xinhang Song, Keming Zhang, Zijian Xu, Shuqiang Jiang. Multi-Scale Gaussian-Language Map for Embodied Navigation and Reasoning. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Tiantian Dang, Chao Bi, Shufan Shen, Jinzhe Liu, Qingming Huang, Shuhui Wang. Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Xinkuan Qiu, Meina Kan, Zhenliang He, Yongbin Zhou, Shiguang Shan. Revisiting Visual Corruptions in LVLMs: A Shape–Texture Perspective on Model Failures. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Zuyan Zhao, Zhenliang He, Meina Kan, Shiguang Shan, Xilin Chen. UniPercept: A Unified Diffusion Model for Generalizable Visual Perception. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Keliang Li, Yansong Li, Hongze Shen, Mengdi Liu, Hong Chang, Shiguang Shan. LensWalk: Agentic Video Understanding by Planning How You See in Videos. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Jiahao Yang, Zihan Wang, Xiangyang Li, Xing Zhu, Yujun Shen, Yinghao Xu, Shuqiang Jiang. GA-VLN: Geometry-Aware BEV Representation for Efficient Vision-Language Navigation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Tianliang Qi, Xinhang Song, Yuyi Liu, Shuqiang Jiang. Rethinking Visual Rearrangement from A Diffusion Perspective. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Denver, Colorado, USA, Jun. 3-7, 2026. (Accepted)
PDF
Liang Xu, Shufan Shen, Qingming Huang, Yao Zhu, Xiangyang Ji, Shuhui Wang. Adaptive Nonlinear Compression for Large Foundation Models. International Conference on Learning Representations (ICLR), Rio de Janeiro, Brazil, Apr. 23-27, 2026. (Accepted)
PDF
Xiaolong Tang, Meina Kan, Shiguang Shan, Xilin Chen. Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling. International Conference on Learning Representations (ICLR), Rio de Janeiro, Brazil, Apr. 23-27, 2026. (Accepted)
PDF
Jie Huang, Xuejing Liu, Sibo Song, Ruibing Hou, Hong Chang, Junyang Lin, Shuai Bai. Revisiting Multimodal Positional Encoding in Vision-Language Models. International Conference on Learning Representations (ICLR), Rio de Janeiro, Brazil, Apr. 23-27, 2026. (Accepted)
PDF
Keliang Li, Hongze Shen, Hao Shi, Ruibing Hou, Hong Chang, Jie Huang, Chenghao Jia, Wen Wang, Yiling Wu, Dongmei Jiang, Shiguang Shan, Xilin Chen. HumanPCR: Probing MLLM Capabilities in Diverse Human-Centric Scenes. International Conference on Learning Representations (ICLR), Rio de Janeiro, Brazil, Apr. 23-27, 2026. (Accepted)
PDF