Yufei Cai, Yuxiang Wei, Zhilong Ji, Jinfeng Bai, Hu Han, Wangmeng Zuo. Decoupled Textual Embeddings for Customized Image Generation. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 909-917, Vancouver, Canada, Feb. 20–27, 2024.
PDF
Churan Zhi, Junbao Zhuo, Shuhui Wang. Confusing Pair Correction Based on Category Prototype for Domain Adaptation under Noisy Environments. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 17060-17068, Vancouver, Canada, Feb. 20–27, 2024.
PDF
Hanxuan Li, Bin Fu, Ruiping Wang, Xilin Chen. Point2Real: Bridging the Gap between Point Cloud and Realistic Image for Open-World 3D Recognition. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 3055-3063, Vancouver, Canada, Feb. 20-27, 2024.
PDF
Yifan Yang, Yuecong Min, Xilin Chen. S2Net: Skeleton-aware SlowFast Network for Efficient Sign Language Recognition. Proceedings of the Asian Conference on Computer Vision (ACCV), pp. 355-372, Hanoi, Vietnam, Dec. 8-12, 2024.
PDF
Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang. Context-aware Difference Distilling for Multi-change Captioning. Annual Meeting of the Association for Computational Linguistics (ACL), pp. 7941-7956, Bangkok, Thailand, Aug. 11–16, 2024.
PDF
Yiming Cui, Liang Li, Jiehua Zhang, Chenggang Yan, Hongkui Wang, Shuai Wang, Jin Heng, Wu Li. Stochastic Context Consistency Reasoning for Domain Adaptive Object Detection. ACM Conference on Multimedia (ACM MM), pp. 1331-1340, Melbourne, Australia, Oct. 28-Nov 1, 2024.
PDF
Henglei Lv, Jiayu Xiao, Liang Li. Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization. ACM Conference on Multimedia (ACM MM), pp. 10535- 10543, Melbourne, Australia, Oct. 28-Nov 1, 2024.
PDF
Zhedong Zhang, Liang Li, Gaoxiang Cong, Haibing Yin, Yuhan Gao, Chenggang Yan, Anton van den Hengel, Yuankai Qi. From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning. ACM Conference on Multimedia (ACM MM), pp. 7523-7532, Melbourne, Australia, Oct. 28-Nov 1, 2024.
PDF
Kenan Huang, Junbao Zhuo, Shuhui Wang, Chi Su, Qingming Huang, Huimin Ma. Unsupervised Image-to-Video Adaptation via Category-aware Flow Memory Bank and Realistic Video Generation. ACM Conference on Multimedia (ACM MM), pp. 8795-8804, Melbourne, Australia, Oct. 28-Nov 1, 2024.
PDF
Yang Liu, Qianqian Xu, Peisong Wen, Siran Dai, Qingming Huang. Not All Pairs are Equal: Hierarchical Learning for Average-Precision-Oriented Video Retrieval. ACM Conference on Multimedia (ACM MM), pp. 3828-3837, Melbourne, Australia, Oct. 28-Nov 1, 2024.
PDF
Junwei He, Qianqian Xu, Yangbangyan Jiang, Zitai Wang, Yuchen Sun, Qingming Huang. HGOE: Hybrid External and Internal Graph Outlier Exposure for Graph Out-of-Distribution Detection. ACM Conference on Multimedia (ACM MM), pp. 1544-1553, Melbourne, Australia, Oct. 28-Nov 1, 2024.
PDF
Xuhan Zhu, Yifei Xing, Ruiping Wang, Yaowei Wang, Xiangyuan Lan. Calibration for Long-tailed Scene Graph Generation. ACM Conference on Multimedia (ACM MM), pp. 3037-3046, Melbourne, Australia, Oct. 28-Nov.1, 2024.
PDF
Xuhan Zhu, Yifei Xing, Ruiping Wang, Yaowei Wang, Xiangyuan Lan. Hierarchical Prompt Learning for Scene Graph Generation. British Machine Vision Conference (BMVC), Glasgow, UK, Nov. 25-28, 2024.
PDF
Bin Fu, Qiyang Wan, Jialin Li, Ruiping Wang, Xilin Chen. Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models. British Machine Vision Conference (BMVC), Glasgow, UK, Nov. 25-28, 2024.
PDF
Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Shuqiang Jiang. Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation. Conference on Robot Learning (CRL), Munich, Germany, Nov. 6-9, 2024.
PDF
Zonghui Guo, Xinyu Han, Jie Zhang, Shiguang Shan, Haiyong Zheng. Video Harmonization with Triplet Spatio-Temporal Variation Patterns. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 19177-19186, Seattle WA, USA, Jun. 17-21, 2024.
PDF
Sibo Wang, Jie Zhang, Zheng Yuan, Shiguang Shan. Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 24502-24511, Seattle WA, USA, Jun. 17-21, 2024.
PDF
Xiaohan Wang, Yuehu Liu, Xinhang Song, Yuyi Liu, Sixian Zhang, Shuqiang Jiang. An Interactive Navigation Method with Effect-oriented Affordance. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 16446-16456, Seattle WA, USA, Jun. 17-21, 2024.
PDF