您当前的位置:
视觉建模组
组 长: 常虹 研究员
Email: changhong [at] ict dot ac dot cn; hong.chang [at] vipl dot ict dot ac dot cn
课题组简介

    视觉建模组致力于研究新的机器学习方法与技术,以解决现实的计算机视觉问题,主要包括图像视频的表示、理解、预测和增强,以及面向下一代智能视频监控的目标(特别是行人)检测、跟踪、识别和检索。过去的几年中,视觉建模组在相关领域会议和期刊上发表多篇论文。最近,该组成员获得ECCV’2018行人检测竞赛冠军,以及ICME’18铂金最佳论文奖。

研究内容

机器学习的模型和算法,特别是

1) 度量学习
2) 小样本学习, 元学习
3) 无监督/半监督/弱监督学习
4) 深度学习


目标检测、跟踪与识别

1) 行人再识别
2) 目标(特别是行人)检测
3) 单目标/多目标跟踪


图像/视频表示学习

1) 属性学习
2) 视觉关系检测
3) 组合语义学习

图像/视频生成与增强

1) 图像生成
2) 视频预测
3) 图像/视频超分辨率



部分论文

刊物论文

  • Nan Kang, Hong Chang, Bingpeng Ma, Shiguang Shan. A Comprehensive Framework for Long-tailed Learning via Pretraining and Normalization. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), Vol. 35, No. 3, pp. 3437-3449, 2024.
  • Ruibing Hou, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen. Triplet adaptation framework for robust semi-supervised learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Vol. 46, No. 12, pp. 8056-8073, 2024.
  • Keliang Li, Hong Chang, Shiguang Shan, Xilin Chen. Enhancing robustnesses to class-conditional distribution shift in long-tailed recognition. Transactions on Machine Learning Research, 2024.
  • Ruibing Hou,Hong Chang,Bingpeng Ma,Shiguang Shan,Xilin Chen. Dual Compensation Residual Networks for Class Imbalanced Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Vol. 45, No. 10, pp. 11733 - 11752, 2023.
  • Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang, Shiguang Shan. Temporal Multi-Scale Complementary Feature for Video Person Re-Identification. CHINESE JOURNAL OF COMPUTERS, Vol 46, No. 1, pp. 31-50, 2023.
  • Ruibing Hou, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen. Dual Compensation Residual Networks for Class Imbalanced Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 45, No. 10, pp. 11733-11752, 2023.
  • Fengling Mao, Bingpeng Ma, Hong Chang, Shiguang Shan and Xilin Chen. Learning Efficient Text to Image Synthesis via Interstage Cross-sample Similarity Distillation. SCIENCE CHINA Information Sciences, 64(2): 120102:1-120102:12, 2021.
  • Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan and Xilin Chen. IAUnet: Global Context-Aware Feature Learning for Person Re-Identification. IEEE Trans. on Neural Networks and Learning Systems (TNNLS), 32(10):4460-4474, 2021.
  • Xiangzhou Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan and Xilin Chen. Location Sensitive Network for Human Instance Segmentation. IEEE Transactions on Image Processing (TIP), 30:7649-7662, 2021.
  • Kongming Liang, Hong Chang, Shiguang Shan, Xilin Chen, "Visual Concept Conjunction Learning with Recurrent Neural Networks," Neurocomputing, Vol. 395, PP. 229-236, June 2020.
  • Yucheng Chen, Bingpeng Ma and Hong Chang, "Part Alignment Network for Vehicle Re-identification," Neurocomputing, vol. 418, no. 12, pp. 114-125, December 2020.
  • Furong Xu, Bingpeng Ma, Hong Chang and Shiguang Shan. "Isosceles Constraints for Person Re-identification." IEEE Trans. on Image Processing, vol. 29, no. 11, pp. 8930-8943, November 2020.
  • Kongming Liang, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen, "Unifying Visual Attribute Learning with Object Recognition in a Multiplicative Framework," IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI), 41(7):1747 - 1760, 2019.
  • Hao Liu, Bingpeng Ma, Lei Qin, Junbiao Pang, Chunjie Zhang, Qingming Huang, "Set-Label Modeling and Deep Metric Learning on Person Re-Identification," Neurocomputing, vol. 151, no. 3, pp. 1283-1292, 2015.
  • Bingpeng Ma, Yu Su, Frederic Jurie, "Covariance Descriptor Based on Bio-Inspired Features for Person Re-Identification and Face Verification," Image and Vision Computing, vol. 32, no. 6-7, pp. 379-390, 2014.
  • Zhen Cui, Hong Chang, Shiguang Shan, Bingpeng Ma, "Joint Sparse Representation for Video-Based face Recognition," Neurocomputing, 2014.
  • Bingpeng Ma, Xiujuan Chai, Tianjiang Wang, "A Novel Feature Descriptor Based on Biologically Inspired Feature for Head Pose Estimation," Neurocomputing, vol. 115, pp. 1-10, 2013.
  • Wei Zheng, Hong Chang, Luhong Liang, Haoyu Ren, Shiguang Shan, Xilin Chen, “Strip Features for Fast Object Detection,” IEEE Transactions on Systems, Man and Cybernetics, Part B: Cybernetics, vol.43, no.6, pp.1898-1912,Dec.2013.
  • Wei Zheng, Luhong Liang,Hong Chang, Cher-Keng Heng, Shiguang Shan, Xilin Chen, “Boosted Translation-Tolerable Classifiers for Fast Object Detection. Image and Vision Computing,” Image and Vision Computing, vol.30, no.8, pp.480-491,Aug.2012.
  • Deming Zhai, Hong Chang, Shiguang Shan, Xilin Chen, Wen Gao, "Multi-View Metric Learning with Global Consistency and Local Smoothness," ACM Transactions on Intelligent Systems and Technology, Volume 3 Issue 3, May 2012.

会议论文

  • Jinjing Hu, Wenrui Liu, Hong Chang, Bingpeng Ma,Shiguang Shan, Xilin Chen. An Information Theoretical View for Out-Of-Distribution Detection. European Conference on Computer Vision (ECCV), pp. 418-435, Milano, Italy, Sep 29-Oct 4, 2024.
  • Jinjing Hu, Wenrui Liu, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen. An Information Theoretical View for Out-Of-Distribution Detection. European Conference on Computer Vision (ECCV), pp. 418-435, Mico Milano, Italy, Sep 29-Oct 4, 2024.
  • Minyang Hu, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen. Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing. International Conference on Learning Representations (ICLR), Vienna, Austria, May 7-11, 2024.
  • Minyang Hu, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen. Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing. International Conference on Learning Representations (ICLR), Vienna, Austria, May 7-11, 2024.
  • Mingshuang Luo, Ruibing Hou, Zhuo Li, Hong Chang, Zimo Liu, Yaowei Wang, Shiguang Shan. M3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation. Annual Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, Dec. 10-15, 2024.
  • Jiachen Liang, Ruibing Hou, Minyang Hu, Hong Chang, Shiguang Shan, Xilin Chen. UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models. Annual Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, Dec. 10-15, 2024.
  • Nan Kang, Hong Chang, Bingpeng Ma, Shutao Bai, Shiguang Shan, Xilin Chen. Predictive Consistency Learning for Long-Tailed Recognition. British Machine Vision Conference (BMVC), Aberdeen, UK, Nov. 20-24, 2023.
  • Wenrui Liu, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen. Diversity-Measurable Anomaly Detection. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12147-12156, Vancouver, Canada, Jun. 20-22, 2023.
  • Fangyi Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan and Xilin Chen, "Relation-aware Multiple Attention Siamese Networks for Robust Visual Tracking," British Machine Vision Conference (BMVC), Cardiff, UK, September, 9-12, 2019.
  • Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, Xilin Chen, “Interaction-and-Aggregation Network for Person Re-identification,” IEEE Conference on Computer Vision and Pattern Recognition, pp.9317-9326, Long Beach, California, USA, June 16-20, 2019.
  • Ruibing Hou, Hong Chang, Bingpeng Ma and Xilin Chen, "Video Predictionwith Bidirectional Constraint Network," 14th IEEE InternationalConference on Automatic Face and Gesture Recognition (FG 2019), pp. 1-8,Lille, France, May 14-18, 2019.
  • Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, Xilin Chen, “VRSTC: Occlusion-Free Video Person Re-Identification,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.7183-7192, Long Beach, California, USA, June 16-20, 2019.
  • Liqiang Bao, Bingpeng Ma, Hong Chang, Xilin Chen, "Masked Graph Attention Network for Person Re-identification," International Workshop conjunction with IEEE Computer Vision and Pattern Recognition(CVPR), Long Beach, California, USA, June 2019.
  • Liqiang Bao, Bingpeng Ma, Hong Chang, Xilin Chen, "Preserving Structural Relationships for Person Re-identification," International Workshop conjunction with IEEE International Conference on Multimedia and Expo(ICME), pp.120-125, July 2019.
  • Hongkai Zhang, Hong Chang, Bingpeng Ma, Shiguang Shan and Xilin Chen, "Cascade RetinaNet: Maintaining Consistency for Single-Stage Object Detection," British Machine Vision Conference (BMVC), Cardiff, UK, September, 9-12, 2019.
  • Xiaoyi Yin, Xinqian Gu, Hong Chang, Bingpeng Ma and Xilin Chen, "Attribute-aware Pedestrian Image Editing," International Conference on Image and Graphics (ICIG), August 2019.
  • Kongming Liang, Yuhong Guo, Hong Chang, Xilin Chen, "Visual Relationship Detection with Deep Structural Ranking," AAAI Conference on Artificial Intelligence(AAAI2018), New Orleans, Louisiana, USA, 2018.
  • Shen Li, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, "Continuity-Discrimination Convolutional Neural Network for Visual Object Tracking," IEEE International Conference on Multimedia and Expo, San Diego, USA, July 23-27,2018.
  • Furong Xu, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, "Style Transfer with Adversarial Learning for Cross-Dataset Person Re-identification," Asian Conference on Computer Vision 2018(ACCV2018), 2-6 Dec. 2018, Perth Western Australia.
  • Xiaqing Xu, Bingpeng Ma, Hong Chang, Xilin Chen, "Siamese Recurrent Architecture for Visual Tracking," IEEE International Conference on Image Processing(ICIP2017), 2017.