2024
  • Yufei Cai, Yuxiang Wei, Zhilong Ji, Jinfeng Bai, Hu Han, Wangmeng Zuo. Decoupled Textual Embeddings for Customized Image Generation. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 909-917, Vancouver, Canada, Feb. 20–27, 2024. PDF
  • Churan Zhi, Junbao Zhuo, Shuhui Wang. Confusing Pair Correction Based on Category Prototype for Domain Adaptation under Noisy Environments. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 17060-17068, Vancouver, Canada, Feb. 20–27, 2024. PDF
  • Hanxuan Li, Bin Fu, Ruiping Wang, Xilin Chen. Point2Real: Bridging the Gap between Point Cloud and Realistic Image for Open-World 3D Recognition. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 3055-3063, Vancouver, Canada, Feb. 20-27, 2024. PDF
  • Yifan Yang, Yuecong Min, Xilin Chen. S2Net: Skeleton-aware SlowFast Network for Efficient Sign Language Recognition. Proceedings of the Asian Conference on Computer Vision (ACCV), pp. 355-372, Hanoi, Vietnam, Dec. 8-12, 2024. PDF
  • Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang. Context-aware Difference Distilling for Multi-change Captioning. Annual Meeting of the Association for Computational Linguistics (ACL), pp. 7941-7956, Bangkok, Thailand, Aug. 11–16, 2024. PDF
  • Yiming Cui, Liang Li, Jiehua Zhang, Chenggang Yan, Hongkui Wang, Shuai Wang, Jin Heng, Wu Li. Stochastic Context Consistency Reasoning for Domain Adaptive Object Detection. ACM Conference on Multimedia (ACM MM), pp. 1331-1340, Melbourne, Australia, Oct. 28-Nov 1, 2024. PDF
  • Henglei Lv, Jiayu Xiao, Liang Li. Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization. ACM Conference on Multimedia (ACM MM), pp. 10535- 10543, Melbourne, Australia, Oct. 28-Nov 1, 2024. PDF
  • Zhedong Zhang, Liang Li, Gaoxiang Cong, Haibing Yin, Yuhan Gao, Chenggang Yan, Anton van den Hengel, Yuankai Qi. From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning. ACM Conference on Multimedia (ACM MM), pp. 7523-7532, Melbourne, Australia, Oct. 28-Nov 1, 2024. PDF
  • Yijia Wang, Qianqian Xu, Yangbangyan Jiang, Siran Dai, Qingming Huang. Regularized Contrastive Partial Multi-view Outlier Detection. ACM Conference on Multimedia (ACM MM), pp. 8711-8720, Melbourne, Australia, Oct. 28-Nov 1, 2024. PDF
  • Kenan Huang, Junbao Zhuo, Shuhui Wang, Chi Su, Qingming Huang, Huimin Ma. Unsupervised Image-to-Video Adaptation via Category-aware Flow Memory Bank and Realistic Video Generation. ACM Conference on Multimedia (ACM MM), pp. 8795-8804, Melbourne, Australia, Oct. 28-Nov 1, 2024. PDF
  • Yang Liu, Qianqian Xu, Peisong Wen, Siran Dai, Qingming Huang. Not All Pairs are Equal: Hierarchical Learning for Average-Precision-Oriented Video Retrieval. ACM Conference on Multimedia (ACM MM), pp. 3828-3837, Melbourne, Australia, Oct. 28-Nov 1, 2024. PDF
  • Junwei He, Qianqian Xu, Yangbangyan Jiang, Zitai Wang, Yuchen Sun, Qingming Huang. HGOE: Hybrid External and Internal Graph Outlier Exposure for Graph Out-of-Distribution Detection. ACM Conference on Multimedia (ACM MM), pp. 1544-1553, Melbourne, Australia, Oct. 28-Nov 1, 2024. PDF
  • Xuhan Zhu, Yifei Xing, Ruiping Wang, Yaowei Wang, Xiangyuan Lan. Calibration for Long-tailed Scene Graph Generation. ACM Conference on Multimedia (ACM MM), pp. 3037-3046, Melbourne, Australia, Oct. 28-Nov.1, 2024. PDF
  • Xuhan Zhu, Yifei Xing, Ruiping Wang, Yaowei Wang, Xiangyuan Lan. Hierarchical Prompt Learning for Scene Graph Generation. British Machine Vision Conference (BMVC), Glasgow, UK, Nov. 25-28, 2024. PDF
  • Bin Fu, Qiyang Wan, Jialin Li, Ruiping Wang, Xilin Chen. Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models. British Machine Vision Conference (BMVC), Glasgow, UK, Nov. 25-28, 2024. PDF
  • Chuyan Xiong, Chengyu Shen, Xiaoqi Li, Kaichen Zhou, Jiaming Liu, Ruiping Wang, Hao Dong. Autonomous Interactive Correction MLLM for Robust Robotic Manipulation. Annual Conference on Robot Learning (CoRL), Munich, Germany, Nov. 6-9, 2024. PDF
  • Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Shuqiang Jiang. Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation. Conference on Robot Learning (CRL), Munich, Germany, Nov. 6-9, 2024. PDF
  • Zonghui Guo, Xinyu Han, Jie Zhang, Shiguang Shan, Haiyong Zheng. Video Harmonization with Triplet Spatio-Temporal Variation Patterns. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 19177-19186, Seattle WA, USA, Jun. 17-21, 2024. PDF
  • Sibo Wang, Jie Zhang, Zheng Yuan, Shiguang Shan. Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 24502-24511, Seattle WA, USA, Jun. 17-21, 2024. PDF
  • Xiaohan Wang, Yuehu Liu, Xinhang Song, Yuyi Liu, Sixian Zhang, Shuqiang Jiang. An Interactive Navigation Method with Effect-oriented Affordance. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 16446-16456, Seattle WA, USA, Jun. 17-21, 2024. PDF