Visual Information Processing and Learning
Visual Information Processing and Learning


Research
Visual Modeling

Leader:Hong Chang (Associate Professor)

Email:changhong [at] ict.ac.cn

Visual Modeling (VISMOD) group focuses on research of innovative machine learning methodologies and technologies to solve real-world computer vision problems including image/video representation, understanding, prediction and enhancement, as well as object (especially human) detection, tracking, recognition and retrieval, oriented to next-generation intelligent video surveillance.

During the past several years, VISMOD group has published many papers in related conferences and journals. Recently, the group members have achieved the 1stplace in ECCV’2018 Pedestrian Detection Challenge, and the Best Paper Award in ICME’2018.
Research

The current main research topics include:

●  Models and algorithms in machine learning, especially

1)    Metric learning

2)    Few-shot learning

3)    Semi-supervised/Weakly supervised learning

4)    Deep learning

 

●  Image/video representation and understanding

1)    Attribute learning

2)    Visual relationship detection

3)    Combinatorial semantic learning

 

●  Image/video generation and enhancement

1)    Image generation 

2)    Video prediction

3)    Image/video super-resolution

 

●  Object detection, tracking and recognition

1)    Person re-identification Multiple

2)   Object (especially human) detection

3)    Single/Multiple object tracking   


Papers

Journal Papers

1.    Kongming Liang, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen, "Unifying Visual Attribute Learning with Object Recognition in a Multiplicative Framework," IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI), 2019.(Accepted)2018 【pdf】

2.    Bingpeng Ma, Rui Huang, Lei Qin, "VOD: A Novel Image Representation for Head Yaw Estimation," Neurocomputing, vol. 148, pp. 455-466, 2015. 【pdf】

3.    Hao Liu, Bingpeng Ma, Lei Qin, Junbiao Pang, Chunjie Zhang, Qingming Huang, "Set-Label Modeling and Deep Metric Learning on Person Re-Identification," Neurocomputing, vol. 151, no. 3, pp. 1283-1292, 2015. 【pdf】

4.    Bingpeng Ma, Yu Su, Frederic Jurie, "Covariance Descriptor Based on Bio-Inspired Features for Person Re-Identification and Face Verification," Image and Vision Computing, vol. 32, no. 6-7, pp. 379-390, 2014. 【pdf】

5.    Zhen Cui, Hong Chang, Shiguang Shan, Bingpeng Ma, "Joint Sparse Representation for Video-Based face Recognition," Neurocomputing, 2014. 【pdf】

6.    Wei Zheng, Hong Chang, Luhong Liang, Haoyu Ren, Shiguang Shan, Xilin Chen, “Strip Features for Fast Object Detection,” IEEE Transactions on Systems, Man and Cybernetics, Part B: Cybernetics, vol.43, no.6, pp.1898-1912,Dec.2013. 【pdf】

7.    Bingpeng Ma, Xiujuan Chai, Tianjiang Wang, "A Novel Feature Descriptor Based on Biologically Inspired Feature for Head Pose Estimation," Neurocomputing, vol. 115, pp. 1-10, 2013. 【pdf】

8.    Deming Zhai, Hong Chang, Shiguang Shan, Xilin Chen, Wen Gao, "Multi-View Metric Learning with Global Consistency and Local Smoothness," ACM Transactions on Intelligent Systems and Technology, Volume 3 Issue 3, May 2012. 【pdf】

9.    Bo Li, Hong Chang, Shiguang Shan, Xilin Chen, “Low-Resolution Face recognition via Coupled Locality Preserving Mappings,” IEEE Signal Processing Letters, vol.16, no.11, pp.20-23,Nov.2009. 【pdf】

10.    Xiaopeng Hong, Hong Chang, Shiguang Shan, Bineng Zhong, Xilin Chen, Wen Gao, “Sigma Set Based Implicit Online Learning For Object Tracking,” IEEE Signal Processing Letters, vol.17, no.9, pp.807-810,Sept.2010.

11.    Bo Li, Hong Chang, Shiguang Shan, Xilin Chen, “Aligning Coupled Manifolds for Face Hallucination,” IEEE Signal Processing Letters, vol.17, no.1, pp. 957-960 ,Jan.2010. 【pdf】

Conference Papers

1.    Fangyi Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan and Xilin Chen, "Relation-aware Multiple Attention Siamese Networks for Robust Visual Tracking," British Machine Vision Conference, 2019. (Accepted)

2.    Fengling Mao, Bingpeng Ma, Hong Chang, Shiguang Shan and Xilin Chen, "MS-GAN: Text to Image Synthesis with Attention-Modulated Generators and Similarity-aware Discriminators," British Machine Vision Conference, 2019. (Accepted)

3.    Hongkai Zhang, Hong Chang, Bingpeng Ma, Shiguang Shan and Xilin Chen, "Cascade RetinaNet: Maintaining Consistency for Single-Stage Object Detection," British Machine Vision Conference, 2019. (Accepted)

4.    Xinqian Gu, Bingpeng Ma, Hong Chang, Shiguang Shan and Xilin Chen, "Temporal Knowledge Propagation for Image-to-Video Person Re-identification," Proc. IEEE International Conference on Computer Vision, 2019. (Accepted)

5.    Xinqian Gu, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, "Temporal Knowledge Propagation for Image-to-Video Person Re-identification," IEEE International Conference on Computer Vision (ICCV), 2019. (Accepted)

6.    Xiaoyi Yin, Xinqian Gu, Hong Chang, Bingpeng Ma and Xilin Chen, "Attribute-aware Pedestrian Image Editing," International Conference on Image and Graphics, 2019. (Accepted)

7.    Liqiang Bao, Bingpeng Ma, Hong Chang, Xilin Chen, "Preserving Structural Relationships for Person Re-identification," International Workshop conjunction with IEEE International Conference on Multimedia and Expo(ICME), pp.120-125, July 2019. 【pdf】

8.    Liqiang Bao, Bingpeng Ma, Hong Chang, Xilin Chen, "Masked Graph Attention Network for Person Re-identification," International Workshop conjunction with IEEE Computer Vision and Pattern Recognition(CVPR), Long Beach, California, USA, June 2019.

9.    Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, Xilin Chen, “Interaction-and-Aggregation Network for Person Re-identification,” IEEE Conference on Computer Vision and Pattern Recognition, pp.9317-9326, Long Beach, California, USA, June 16-20, 2019. 【pdf】

10.    Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, Xilin Chen, “VRSTC: Occlusion-Free Video Person Re-Identification,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.7183-7192, Long Beach, California, USA, June 16-20, 2019. 【pdf】

11.    Ruibing Hou, Hong Chang, Bingpeng Ma and Xilin Chen, "Video Prediction with Bidirectional Constraint Network," 14th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2019), pp. 1-8, Lille, France, May 14-18, 2019. 【pdf】

12.    Kongming Liang, Yuhong Guo, Hong Chang, Xilin Chen, "Visual Relationship Detection with Deep Structural Ranking," AAAI Conference on Artificial Intelligence(AAAI2018), New Orleans, Louisiana, USA, 2018. 【pdf】

13.    Furong Xu, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, "Style Transfer with Adversarial Learning for Cross-Dataset Person Re-identification," Asian Conference on Computer Vision 2018(ACCV2018), 2-6 Dec. 2018, Perth Western Australia. 【pdf】

14.    Shen Li, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, "Continuity-Discrimination Convolutional Neural Network for Visual Object Tracking," IEEE International Conference on Multimedia and Expo(ICME2018) 【pdf】

15.    Xiaqing Xu, Bingpeng Ma, Hong Chang, Xilin Chen, "Siamese Recurrent Architecture for Visual Tracking," IEEE International Conference on Image Processing(ICIP2017), 2017. 【pdf】

16.    Kongming Liang, Yuhong Guo, Hong Chang, Xilin Chen, "Incomplete Attribute Learning with Auxiliary Labels," International Joint Conference on Artificial Intelligence(IJCAI2017), 2017. 【pdf】

17.    Xuesong Deng, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, "Deep Second-Order Siamese Network for Pedestrian Re-Identification," Asian Conference on Computer Vision(ACCV2016), pp. 321-337, 2016. 【pdf】

18.    Jianfeng He, Bingpeng Ma, Shuhui Wang, Yugui Liu, Qingming Huang, "Cross-Modal Retrieval by Real Label Partial Least Squares," ACM Multimedia(ACMMM2016), pp. 227-231, 2016. 【pdf】

19.    Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian, "PL-Ranking:A Novel Ranking Method for Cross-Modal Retrieval," ACM Multimedia(ACMMM2016), pp. 1355-1364, 2016. 【pdf】

20.    Mingquan Ye, Hong Chang, Xilin Chen, "Online Visual Tracking Via Coupled Object-Context Dictionary," British Machine Vision Conference(BMVC2015), Swansea, UK, 2015. 【pdf】


Visual Information Processing and Learning
  • Address :No.6 Kexueyuan South Road
  • Zhongguancun,Haidian District
  • Beijing,China
  • Postcode :100190
  • Tel : (8610)62600514
  • Email:yi.cheng@vipl.ict.ac.cn
  • Valse

  • Big Lecture of DL

Copyright @ Visual Information Processing and Learning 京ICP备05002829号 京公网安备1101080060