2021
  • Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu and Shiguang Shan. ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task (The 1st Place). IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshop of the International Challenge on Activity Recognition (ActivityNet), 2021. PDF
  • Zheng Yuan, Jie Zhang, Yunpei Jia, Chuanqi Tan, Tao Xue and Shiguang Shan. Meta Gradient Adversarial Attack. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 7748-7757, Montreal, Canada, Oct.11-17, 2021. PDF
  • Sijin Wang, Ziwei Yao, Ruiping Wang, Zhongqin Wu and Xilin Chen. FAIEr: Fidelity and Adequacy Ensured Image Caption Evaluation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14050–14059, Virtual Event, June 19-25, 2021. PDF
  • Yuecong Min, Aiming Hao, Xiujuan Chai and Xilin Chen. Visual Alignment Constraint for Continuous Sign Language Recognition. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 11542-11551, Montreal, Canada, Oct. 11-17, 2021. PDF
  • Difei Gao, Ruiping Wang, Ziyi Bai and Xilin Chen. Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 1675-1685, Montreal, Canada, Oct. 11-17, 2021. PDF
  • Jiwei Xiao, Ruiping Wang and Xilin Chen. Holistic Pose Graph: Modeling Geometric Structure among Objects in a Scene using Graph Inference for 3D Object Prediction. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 12717–12726, Montreal, Canada, Oct. 11-17, 2021. PDF
  • Wenbin Wang, Ruiping Wang and Xilin Chen. Topic Scene Graph Generation by Attention Distillation from Caption. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 15900-15910, Montreal, Canada, Oct. 11-17, 2021. PDF
  • Aiming Hao, Yuecong Min and Xilin Chen. Self-Mutual Distillation Learning for Continuous Sign Language Recognition. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 11303-11312, Montreal, Canada, Oct. 11-17, 2021. PDF
  • Yunjia Sun, Jiabei Zeng, Shiguang Shan, Xilin Chen. Cross-Encoder for Unsupervised Gaze Representation Learning. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 3702-3711, Montreal, Canada, Oct. 11-17, 2021. PDF
  • Tianyu Zhang, Weiqing Min, Jiahao Yang, Tao Liu, Shuqiang Jiang and Yong Rui. What If We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation. International Joint Conference on Artificial Intelligence (IJCAI), pp. 1316-1322, Virtual Event / Montreal Canada, Aug. 19-26, 2021. PDF
  • Sixian Zhang, Xinhang Song, Yubing Bai, Weijie Li, Yakui Chu and Shuqiang Jiang. Hierarchical Object-to-zone Graph for Object Navigation. IEEE/CVF International Conference on Computer Vision (ICCV), pages 15130–15140, Montreal, Canada, Oct. 11-17, 2021. PDF
  • Zhenliang He, Meina Kan and Shiguang Shan. EigenGAN: Layer-Wise Eigen-Learning for GAN. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 14408-14417, Montreal, Canada, Oct. 11-17, 2021. PDF
  • Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan and Xilin Chen. UniCon: Unified Context Network for Robust Active Speaker Detection. ACM International Conference on Multimedia (ACM Multimedia), pp. 3964-3972, Chengdu, China, Oct. 20-24, 2021. PDF
  • Xu Yan, Zhengcong Fei, Zekang Li, Shuhui Wang, Qingming Huang and Qi Tian. Semi-Autoregressive Image Captioning. ACM International Conference on Multimedia (ACM Multimedia), pp. 2708-2716, Chengdu, China, Oct. 20-24, 2021. PDF
  • Weijie Li, Xinhang Song, Yubing Bai, Sixian Zhang and Shuqiang Jiang. ION: Instance-level Object Navigation. ACM International Conference on Multimedia (ACM Multimedia), pp. 4343-4352, Chengdu, China, Oct. 20-24, 2021. PDF
  • Jingru Gan, Jinchang Luo, Haiwei Wang, Shuhui Wang, Wei He and Qingming Huang. Multimodal Entity Linking: A New Dataset and A Baseline. ACM International Conference on Multimedia (ACM Multimedia), pp. 993-1001, Chengdu, China, Oct. 20-24, 2021. PDF
  • Qianxiu Hao, Qianqian Xu, Zhiyong Yang and Qingming Huang. Learning Unified Embeddings for Recommendation via Meta-path Semantics. ACM International Conference on Multimedia (ACM Multimedia), pp. 3909–3917, Chengdu, China, Oct. 20-24, 2021. PDF
  • Qianxiu Hao, Qianqian Xu, Zhiyong Yang and Qingming Huang. Pareto Optimality for Fairness-constrained Collaborative Filtering. ACM International Conference on Multimedia (ACM Multimedia), pp. 5619–5627, Chengdu, China, Oct. 20-24, 2021. PDF
  • Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang and Shiguang Shan. BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2014–2023, Virtual Event, Jun. 19-25, 2021. PDF
  • Jiguo Li, Chuanmin Jia, Xinfeng Zhang, Siwei Ma and Wen Gao. Cross Modal Compression: Towards Human-comprehensible Semantic Compression. ACM International Conference on Multimedia (ACM Multimedia), pp. 4230-4238, Chengdu, China, Oct. 20-24, 2021. PDF