Visual Information Processing and Learning
Visual Information Processing and Learning


Research
Visual Scene Understanding

Leader:Ruiping Wang (Professor)

Email:ruiping.wang [at] vipl.ict.ac.cn

Our group focuses on comprehensive scene understanding to enable intelligent perception and understanding of natural visual environment in the open world. More specifically, we aim to propose a vision-based robot system that has the basic capability just like human visual processing system for real world visual scene understanding, mainly including perceptual tasks such as object detection, object recognition, semantic segmentation, scene classification, attribute learning, relationship extraction, and so on. To facilitate more advanced natural language based visual concept semantic description, the system can also incorporate language models and knowledge-based reasoning for cognitive tasks like image/video captioning (description) and visual question answering.
Research

Research topics of our group mainly cover three aspects: 1) Object recognition, e.g. zero-shot learning, incremental/life-long learning, image retrieval, image classification, etc. 2) Scene understanding, e.g. object detection/segmentation, scene classification, relationship detection, scene graph generation, etc., and 3) Language/knowledge-based cognition, e.g. image/video captioning (description), visual question answering, visual concept learning, knowledge graph, etc.


 

Papers

Journal Papers

1.    Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen, “What is Tabby? Interpretable Model Decisions by Learning Attribute-based Classification Criteria,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Nov. 2019. (Accepted)

2.    Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen, “Learning Multifunctional Binary Codes for Personalized Image Retrieval,” International Journal of Computer Vision, Feb. 2020. (Accepted)

3.    Difei Gao, Ruiping Wang, Shiguang Shan, and Xilin Chen, "Learning to Recognize Visual Concepts for Visual Question Answering with Structural Label Space," IEEE Journal of Selected Topics in Signal Processing, 2020.(Accepted)

4.    Shishi Qiao, Ruiping Wang, Shiguang Shan, Xilin Chen, “Deep Heterogeneous Hashing for Face Video Retrieval,” IEEE Transactions on Image Processing, vol. 29, no. 1, pp. 1299-1312, Dec. 2020. 【pdf】

5.    Haomiao Liu, Ruiping Wang, Shiguang Shan and Xilin Chen, “Deep Supervised Hashing for Fast Image Retrieval,” International Journal of Computer Vision, vol. 127, no. 9, pp. 1217–1234, Sep. 2019. 【pdf】

6.    Wen Wang, Ruiping Wang, Zhiwu Huang, Shiguang Shan, Xilin Chen, “Discriminant Analysis on Riemannian Manifold of Gaussian Distributions for Face Recognition with Image Sets,” IEEE Transactions on Image Processing (TIP), vol. 27, no. 1, pp. 151-163, Jan. 2018. 【pdf】

7.    Zhiwu Huang, Ruiping Wang, Xianqiu Li, Wenxian Liu, Shiguang Shan, Luc Van Gool, Xilin Chen, "Geometry-aware Similarity Learning on SPD Manifolds for Visual Recognition," IEEE Transactions on circuits and systems for video technology, 28(10), Page(s):2513 – 2523. 2018.10. 【pdf】

8.    Huajie Jiang, Ruiping Wang, Shiguang Shan, Yan Li, Haomiao Liu, Xilin Chen, “Attribute Annotation on Large Scale Image Database by Active Knowledge Transfer,” Image and Vision Computing, vol. 78, pp. 1-13, Oct. 2018. 【pdf】

Conference Papers

1.    Wenbin Wang, Ruiping Wang, Shiguang Shan, Xilin Chen, "Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation," European Conference on Computer Vision, Aug. 23-28, 2020. 【pdf】

2.    Difei Gao, Ke li, Ruiping Wang, Shiguang Shan, Xilin Chen, "Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text," IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020), pp. 12746–12756, 2020. 【pdf】

3.    Ruikui Wang, Shishi Qiao, Ruiping Wang, Shiguang Shan, Xilin Chen, "Hybrid Video and Image Hashing for Robust Face Retrieval," IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020) , pp. 186-193, 2020. 【pdf】

4.    Ruikui Wang, Ruiping Wang, Shishi Qiao, Shiguang Shan, Xilin Chen, “Deep Position-Aware Hashing for Semantic Continuous Image Retrieval,” IEEE Winter Conference of Applications on Computer Vision (WACV 2020), pp. 2493–2502, Aspen, CO, Mar. 2-5, 2020. 【pdf】

5.    Sijin Wang, Ruiping Wang, Ziwei Yao, Shiguang Shan, Xilin Chen, “Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval,” IEEE Winter Conference of Applications on Computer Vision (WACV 2020), pp. 1508–1517, Aspen, CO, Mar. 2-5, 2020. 【pdf】

6.    Huajie Jiang, Ruiping Wang, Shiguang Shan, Xilin Chen, “Transferable Contrastive Network for Generalized Zero-Shot Learning,” 17th IEEE International Conference on Computer Vision (ICCV 2019), pp. 9764-9773, Seoul, Korea, Oct. 27-Nov. 2, 2019. 【pdf】

7.    Wenbin Wang, Ruiping Wang, Shiguang Shan, Xilin Chen, “Exploring Context and Visual Pattern of Relationship for Scene Graph Generation,” IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp.8180–8189, Long Beach, California, USA, June 16-20, 2019. 【pdf】

8.    Yirong Mao, Ruiping Wang, Shiguang Shan, Xilin Chen, "COSONet: Compact Second-Order Network for Video Face Recognition," Asian Conference on Computer Vision 2018(ACCV2018), 2-6 Dec. 2018, Perth Western Australia. 【pdf】

9.    Chen He, Ruiping Wang, Shiguang Shan, Xilin Chen, “Exemplar-Supported Generative Reproduction for Class Incremental Learning,”29th British Machine Vision Conference (BMVC2018), Newcastle upon Tyne, UK, Sep. 3-6, 2018. 【pdf】

10.    Yong Liu, Ruiping Wang, Shiguang Shan, Xilin Chen, “Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR2018), pp. 6985-6994, Salt Lake City, UT, June 18-22, 2018. 【pdf】

11.    Huajie Jiang, Ruiping Wang, Shiguang Shan, Xilin Chen, “Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition,”15th European Conference on Computer Vision (ECCV2018), Munich, Germany, Sep. 8-14, 2018. 【pdf】

12.    Difei Gao, Ruiping Wang, Shiguang Shan, Xilin Chen, "Visual Textbook Network:Watch Carefully before Answering Visual Questions," British Machine Vision ConferenceConference(BMVC2017), 2017. 【pdf】

13.    Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen, "Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks," IEEE Conference on Computer Vision and Pattern Recognition(CVPR2017), 2017. 【pdf】

14.    Huajie Jiang, Ruiping Wang, Shiguang Shan, Yi Yang, Xilin Chen, "Learning Discriminative Latent Attributes for Zero-Shot Classification," IEEE International Conference on Computer Vision(ICCV2017), Venice, Italy, 2017. 【pdf】

15.    Shishi Qiao, Ruiping Wang, Shiguang Shan, Xilin Chen, "Deep Video Code for Efficient Face Video Retrieval," Asian Conference on Computer Vision(ACCV2016), 2016. 【pdf】

16.    Yuheng Lu, Ruiping Wang, Shiguang Shan, Xilin Chen, "Multiple-shot Person Re-identification via Riemannian Discriminative Learning," Asian Conference on Computer Vision Workshops(ACCV2016), 2016. 【pdf】

17.    Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen, "Deep Supervised Hashing for Fast Image Retrieval," IEEE Conference on Computer Vision and Pattern Recognition(CVPR2016), 2016. 【pdf】

18.    Yan Li, Ruiping Wang, Haomiao Liu, Huajie Jiang, Shiguang Shan, Xilin Chen, "Two Birds,One Stone:Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction," IEEE International Conference on Computer Vision(ICCV2015), 2015. 【pdf】

19.    Mengyi Liu, Shaoxin Li, Shiguang Shan, Ruiping Wang, Xilin Chen, “Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis,” in Proceedings of Asian Conference on Computer Vision(ACCV2014), Singapore, Nov.2014. 【pdf】

20.    Mengyi Liu, Ruiping Wang, Shaoxin Li, Shiguang Shan, Zhiwu Huang, Xilin Chen, “Combining Multiple Kernel Methods on Riemannian Manifold for Emotion Recognition in the Wild,” in Proceedings of ACM International Conference on Multimodal Interaction(ICMI2014), Istanbul, Turkey, Nov.2014. 【pdf】


Visual Information Processing and Learning
  • Address :No.6 Kexueyuan South Road
  • Zhongguancun,Haidian District
  • Beijing,China
  • Postcode :100190
  • Tel : (8610)62600514
  • Email:yi.cheng@vipl.ict.ac.cn
  • Valse

  • Big Lecture of DL

Copyright @ Visual Information Processing and Learning 京ICP备05002829号 京公网安备1101080060