2023
  • Peifeng Gao, Qianqian Xu, Peisong Wen, Huiyang Shao, Yuan He and Qingming Huang. Towards Decision-Friendly AUC: Learning Multi-Classifier with AUCµ. AAAI Conference on Artificial Intelligence, pp. 7633–7641, Washington, DC, USA, Feb. 7-14, 2023. PDF
  • Yiting Liu, Liang Li, Beichen Zhang, Shan Huang, Zheng-Jun Zha, Qingming Huang. MaTCR: Modality-Aligned Thought Chain Reasoning for Multimodal Task-Oriented Dialogue Generation. Proceedings of the 31st ACM International Conference on Multimedia, pp. 5776–5785, Ottawa, Canada, 2023. PDF
  • Qiankun Li, Xiaolong Huang, Zhifan Wan, Lanqing Hu, Shuzhe Wu, Jie Zhang, Shiguang Shan, Zengfu Wang. Data-Efficient Masked Video Modeling for Self-supervised Action Recognition. Proceedings of the 31st ACM International Conference on Multimedia, pp. 5776–5785, Ottawa, Canada, 2023. PDF
  • Jiong Yin, Liang Li, Jiehua Zhang, Chenggang Yan, Lei Zhang, Zunjie Zhu. Reducing Intrinsic and Extrinsic Data Biases for Moment Localization with Natural Language. Proceedings of the 31st ACM International Conference on Multimedia, pp. 4584–4594, Ottawa, Canada, 2023. PDF
  • Shuhan Kong, Liang Li, Beichen Zhang, Wenyu Wang, Bin. Jiang, Chenggang Yan, Changhao Xu. Dynamic Contrastive Learning with Pseudo-samples Intervention for Weakly Supervised Joint Video MR and HD. Proceedings of the 31st ACM International Conference on Multimedia, pp. 538–546, Ottawa, Canada, 2023. PDF
  • Wenxin Ni, Qianqian Xu, Yangbangyan Jiang, Zongsheng Cao, Xiaochun Cao and Qingming Huang. PSNEA: Pseudo-Siamese Network for Entity Alignment between Multi-modal Knowledge Graphs. Proceedings of the 31st ACM International Conference on Multimedia, pp. 3489–3497, Ottawa, Canada, 2023. PDF
  • Nan Kang, Hong Chang, Bingpeng Ma, Shutao Bai, Shiguang Shan, Xilin Chen. Predictive Consistency Learning for Long-Tailed Recognition. British Machine Vision Conference (BMVC), Aberdeen, UK, Nov. 20-24, 2023. PDF
  • Gaoxiang Cong, Liang Li, Yuankai Qi, Zheng-Jun Zha, Qi Wu, Wenyu Wang, Bin. Jiang, Ming-Hsuan Yang, Qingming Huang. Learning to Dub Movies via Hierarchical Prosody Models. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14687-14697, Vancouver, Canada, Jun. 18-22, 2023. PDF
  • Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang. Self-Supervised Cross-View Representation Reconstruction for Change Captioning.IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2805-2815, Paris, France, Oct. 4-6, 2023. PDF
  • Dongyang Liu, Meina Kan, Shiguang Shan, Xilin Chen. Function-Consistent Feature Distillation. International Conference on Learning Representations (ICLR), 2023. PDF
  • Zhongqi Wang, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan. CCLAP: Controllable Chinese Landscape Painting Generation via Latent Diffusion Model. IEEE International Conference on Multimedia and Expo (ICME), Brisbane, Australia, Jul. 10 - 14, 2023. PDF
  • Peifeng Gao, Qianqian Xu, Peisong Wen, Zhiyong Yang, Huiyang Shao and Qingming Huang. Feature Directions Matters: Long-Tailed Learning via Rotated Balanced Representation. International Conference on Machine Learning, pp. 27542–27563, Honolulu, Hawaii USA, Jul. 23-29, 2023. PDF
  • Xiaohan Wang, Yuehu Liu, Xinhang Song, Beibei Wang, Shuqiang Jiang. CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes. Annual Conference on Neural Information Processing Systems (NeurIPS), New Orleans, LA, Dec. 10-16, 2023. PDF
  • Zitai Wang, Qianqian Xu, Zhiyong Yang, Yuan He, Xiaochun Cao and Qingming Huang. A Unified Generalization Analysis of Re-Weighting and Logit-Adjustment for Imbalanced Learning. Annual Conference on Neural Information Processing Systems (NeurIPS), New Orleans, LA, Dec. 10-16, 2023. PDF
  • Siran Dai, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, Qingming Huang. DRAUC: An Instance-wise Distributionally Robust AUC Optimization Framework. Annual Conference on Neural Information Processing Systems (NeurIPS), New Orleans, LA, Dec. 10-16, 2023. PDF
  • Yan Zhu, Junbao Zhuo, Bin Ma, Jiajia Geng, Xiaoming Wei, Xiaolin Wei, Shuhui Wang. Orthogonal Temporal Interpolation for Zero-Shot Video Recognition. Proceedings of the 31st ACM International Conference on Multimedia, pp. 7491-7501, Ottawa, Canada, 2023. PDF
  • Yuchen Sun, Qianqian Xu, Zitai Wang and Qingming Huang. When Measures are Unreliable: Imperceptible Adversarial Perturbations toward Top-k Multi-Label Learning. Proceedings of the 31st ACM International Conference on Multimedia, pp. 1515–1526, Ottawa, Canada, 2023. PDF
  • Hao Wei, Shuhui Wang, Zhe Xue, Shengbo Chen, Qingming Huang. Conversational Composed Retrieval with Iterative Sequence Refinement. Proceedings of the 31st ACM International Conference on Multimedia, pp. 6390-6399, Ottawa, Canada, 2023. PDF
  • Junbao Zhuo, Xingyu Zhao, Shuhui Wang, Huimin Ma, Qingming Huang. Synthesizing Videos from Images for Image-to-Video Adaptation. Proceedings of the 31st ACM International Conference on Multimedia, pp. 8294-8303, Ottawa, Canada, 2023. PDF
  • Junbao Zhuo, Xingyu Zhao, Shuhao Cui, Qingming Huang, Shuhui Wang. Adaptive Feature Swapping for Unsupervised Domain Adaptation. Proceedings of the 31st ACM International Conference on Multimedia, pp. 7017-7028, Ottawa, Canada, 2023. PDF