Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu and Shiguang Shan. ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task (The 1st Place). IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshop of the International Challenge on Activity Recognition (ActivityNet), 2021.
PDF
Zheng Yuan, Jie Zhang, Yunpei Jia, Chuanqi Tan, Tao Xue and Shiguang Shan. Meta Gradient Adversarial Attack. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 7748-7757, Montreal, Canada, Oct.11-17, 2021.
PDF
Sijin Wang, Ziwei Yao, Ruiping Wang, Zhongqin Wu and Xilin Chen. FAIEr: Fidelity and Adequacy Ensured Image Caption Evaluation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14050–14059, Virtual Event, June 19-25, 2021.
PDF
Yuecong Min, Aiming Hao, Xiujuan Chai and Xilin Chen. Visual Alignment Constraint for Continuous Sign Language Recognition. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 11542-11551, Montreal, Canada, Oct. 11-17, 2021.
PDF
Difei Gao, Ruiping Wang, Ziyi Bai and Xilin Chen. Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 1675-1685, Montreal, Canada, Oct. 11-17, 2021.
PDF
Jiwei Xiao, Ruiping Wang and Xilin Chen. Holistic Pose Graph: Modeling Geometric Structure among Objects in a Scene using Graph Inference for 3D Object Prediction. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 12717–12726, Montreal, Canada, Oct. 11-17, 2021.
PDF
Wenbin Wang, Ruiping Wang and Xilin Chen. Topic Scene Graph Generation by Attention Distillation from Caption. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 15900-15910, Montreal, Canada, Oct. 11-17, 2021.
PDF
Aiming Hao, Yuecong Min and Xilin Chen. Self-Mutual Distillation Learning for Continuous Sign Language Recognition. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 11303-11312, Montreal, Canada, Oct. 11-17, 2021.
PDF
Yunjia Sun, Jiabei Zeng, Shiguang Shan, Xilin Chen. Cross-Encoder for Unsupervised Gaze Representation Learning. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 3702-3711, Montreal, Canada, Oct. 11-17, 2021.
PDF
Tianyu Zhang, Weiqing Min, Jiahao Yang, Tao Liu, Shuqiang Jiang and Yong Rui. What If We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation. International Joint Conference on Artificial Intelligence (IJCAI), pp. 1316-1322, Virtual Event / Montreal Canada, Aug. 19-26, 2021.
PDF
Sixian Zhang, Xinhang Song, Yubing Bai, Weijie Li, Yakui Chu and Shuqiang Jiang. Hierarchical Object-to-zone Graph for Object Navigation. IEEE/CVF International Conference on Computer Vision (ICCV), pages 15130–15140, Montreal, Canada, Oct. 11-17, 2021.
PDF
Zhenliang He, Meina Kan and Shiguang Shan. EigenGAN: Layer-Wise Eigen-Learning for GAN. IEEE/CVF International Conference on Computer Vision (ICCV), pp. 14408-14417, Montreal, Canada, Oct. 11-17, 2021.
PDF
Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan and Xilin Chen. UniCon: Unified Context Network for Robust Active Speaker Detection. ACM International Conference on Multimedia (ACM Multimedia), pp. 3964-3972, Chengdu, China, Oct. 20-24, 2021.
PDF
Xu Yan, Zhengcong Fei, Zekang Li, Shuhui Wang, Qingming Huang and Qi Tian. Semi-Autoregressive Image Captioning. ACM International Conference on Multimedia (ACM Multimedia), pp. 2708-2716, Chengdu, China, Oct. 20-24, 2021.
PDF
Weijie Li, Xinhang Song, Yubing Bai, Sixian Zhang and Shuqiang Jiang. ION: Instance-level Object Navigation. ACM International Conference on Multimedia (ACM Multimedia), pp. 4343-4352, Chengdu, China, Oct. 20-24, 2021.
PDF
Jingru Gan, Jinchang Luo, Haiwei Wang, Shuhui Wang, Wei He and Qingming Huang. Multimodal Entity Linking: A New Dataset and A Baseline. ACM International Conference on Multimedia (ACM Multimedia), pp. 993-1001, Chengdu, China, Oct. 20-24, 2021.
PDF
Qianxiu Hao, Qianqian Xu, Zhiyong Yang and Qingming Huang. Learning Unified Embeddings for Recommendation via Meta-path Semantics. ACM International Conference on Multimedia (ACM Multimedia), pp. 3909–3917, Chengdu, China, Oct. 20-24, 2021.
PDF
Qianxiu Hao, Qianqian Xu, Zhiyong Yang and Qingming Huang. Pareto Optimality for Fairness-constrained Collaborative Filtering. ACM International Conference on Multimedia (ACM Multimedia), pp. 5619–5627, Chengdu, China, Oct. 20-24, 2021.
PDF
Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang and Shiguang Shan. BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2014–2023, Virtual Event, Jun. 19-25, 2021.
PDF
Jiguo Li, Chuanmin Jia, Xinfeng Zhang, Siwei Ma and Wen Gao. Cross Modal Compression: Towards Human-comprehensible Semantic Compression. ACM International Conference on Multimedia (ACM Multimedia), pp. 4230-4238, Chengdu, China, Oct. 20-24, 2021.
PDF