中科院计算所视觉信息处理与学习组
中科院计算所视觉信息处理与学习组


您所在的位置 / 科学研究

科学研究
多媒体计算与多模态智能组

组  长:蒋树强 研究员

Email:sqjiang@ict.ac.cn

*  现有研究员1人、副研究员1人、博士后1人、博士和硕士研究生10余人。

*  曾经或正在承担国家自然科学基金优秀青年科学基金、国家自然科学基金重点项目、国家自然科学基金面上项目、国家863课题、北京市科技项目、企业合作项目等课题十余项。曾经或正在承担国家自然科学基金优秀青年科学基金、国家自然科学基金重点项目、国家自然科学基金面上项目、国家863课题、北京市基金重点项目、北京市科技项目、企业合作项目等课题二十余项。

*获奖情况:

在基于搜索的多物体识别技术获得ACM ICMR2013 Best Demo Award;
基于多传感器的视觉识别技术获得ImageClef Robot Vision竞赛2013年度的冠军,
在图像与语言的关联理解技术上的工作分别获ACM Multimedia 2016 Yahoo-Flickr Challenge on Caption Prediction竞赛的冠军。
数据库:

复杂场景下的实例级图像数据集,主页为:http://vipl.ict.ac.cn/isia/instre/;论文:Shuang Wang, Shuqiang Jiang, INSTRE: A New Benchmark for Instance-Level Object Retrieval and Recognition. ACM Transactions on Multimedia Computing, Communications, and Applications(TOMCAT) Vol.11(3), pp. 37:1-37:21, 2015
建立了基于多传感器的手持物体检测数据集,主页为:http://vipl.ict.ac.cn/isia/HOD/;论文:Xiong Lv, Shuqiang Jiang, Luis Herranz, Shuang Wang, RGB-D Hand-Held Object Recognition Based on Heterogeneous Feature Fusion. Journal of Computing Science and Technology, Vol.30(2), pp.340-352 ,2015
建立了基于地理信息的多模态食品图像数据集,主页为:http://vipl.ict.ac.cn/isia/datasets_dish/index.html;论文:Ruihan Xu, Luis Herranz, Shuqiang Jiang, Shuang Wang, Xinhang Song, Ramesh Jain, Geolocalized Modeling for Dish Recognition. IEEE Trans. Multimedia,  Vol.17(8), pp.1187-1199, 2015
研究内容

*图像/视频等多媒体信息的分析、理解与搜索技术;
*视觉、语言、知识库和各种上下文信息的多模态关联、融合与理解技术;
*多模态智能交互技术。


部分论文

刊物论文

1.    Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian, "Generalized Semi-Supervised and Structured Subspace Learning for Cross-Modal Retrieval," IEEE Transaction on Multimedia(TMM2017), vol. 20, no. 1, pp. 128-141, 2018. 【pdf】

2.    Chunjie Zhang, Guibo Zhu, Chao Liang, Yifan Zhang, Qingming Huang, Qi Tian, “Image Class Prediction by Joint Object, Context, and Background Modeling”, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 28, Issue 2, 2018, 428-438. 【pdf】

3.    Zhe Xue, Guorong Li*, Shuhui Wang, Weigang Zhang, Qingming Huang*, “Bilevel Multiview Latent Space Learning”, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 28, Issue 2, 2018, 327-341. 【pdf】

4.    Jun Huang, Guorong Li, Qingming Huang, Xindong Wu, “Joint Feature Selection and Classification for Multi-Label Learning”, IEEE Transactions on Cybernetics, Vol. 48, Issue 3, 2018, 876-889. 【pdf】

5.    Dawei Du, Longyin Wen, Honggang Qi, Qingming Huang, Qi Tian, and Siwei Lyu, “Iterative Graph Seeking for Object Tracking”, IEEE Transactions on Image Processing, Vol. 27, Issue 4, 2018, 1809-1821. 【pdf】

6.    Xinhang Song, Shuqiang Jiang, Luis Herranz, Chengpeng Chen, "Learning Effective RGB-D Representations for Scene Recognition," IEEE Transactions on Image Processing (TIP), doi:10.1109/TIP.2018.2872629. 【pdf】

7.    Yuankai Qi, Lei Qin, Jian Zhang, Shengping Zhang, Ming-Hsuan Yang, Qingming Huang, “Structure-aware Local Sparse Coding for Visual Tracking”, IEEE Transactions on Image Processing, Vol. 27, Issue 8, 2018, 3857-3869. 【pdf】

8.    Junbiao Pang, Jing Huang, Xue Yang, Zuyun Wang, Haitao Yu, Qingming Huang, Baocai Yin, “Discovering Fine-Grained Spatial Pattern From Taxi Trips: Where Point Process Meets Matrix Decomposition and Factorization”, IEEE Transactions on Intelligent Transportation Systems, Vol. 19, Issue 10, 2018, 3208 – 3219. 【pdf】

9.    Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian, “Generalized Semi-supervised and Structured Subspace Learning for Cross-modal Retrieval”, IEEE Transactions on Multimedia, Vol. 20, Issue 1, 2018, 128-141. 【pdf】

10.    Xiangyang Li, Shuqiang Jiang, "Bundled Object Context for Referring Expressions," IEEE Transactions on Multimedia (TMM), vol. 20, no. 10, pp. 2749-2760, 2018. 【pdf】

11.    Zhe Xue, Guorong Li, Qingming Huang, "Joint Multi-view Representation and Image Annotation via Optimal Predictive Subspace Learning," Information Science, Vol. 451-452, 2018, 180-194. 【pdf】

12.    Hongyang Yu, Lei Qin, Qingming Huang, Hongxun Yao, "Online Multiple Object Tracking via Exchanging Object Context," Neurocomputing. Vol. 292, 2018, 28-37. 【pdf】

13.    Jianfeng He, Bingpeng Ma, Shuhui Wang, Yugui Liu, Qingming Huang, "Multi-label Double Layers Learning for Cross-Modal Retrieval," Neurocomputing, Vol. 275, 2018, 1893-1902. 【pdf】

14.    Xiaofeng Mao, Shuhui Wang, Liying Zheng, Qingming Huang, "Semantic Invariant Cross-domain Image Generation with Generative Adversarial Networks," Neurocomputing, Vol. 293, 2018, 55-63. 【pdf】

15.    Jiaming Zhang, Shuhui Wang, Qingming Huang, "Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval," ACM Transactions on Intelligent Systems and Technology, vol. 8, no. 3, pp. 1-21, 2017. 【pdf】

16.    Siyuan Liu, Shuhui Wang, Qiang Qu, "Trajectory Mining," Encyclopedia of GIS, pp. 2310-2313, 2017.

17.    Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian, "Cross-Modal Retrieval Using Multi-Ordered Discriminative Structured Subspace Learning," IEEE Transaction on Multimedia(TMM2017), vol. 19, no. 6, pp. 1220-1233, 2017. 【pdf】

18.    Weiqing Min, Bingkun Bao, Shuhuan Mei, Yaohui Zhu, Yong Rui, Shuqiang Jiang, "You Are What You Eat:Exploring Multi-Modal and Multi-Attribute Information from Recipes for Cross-Region Food Analysis," IEEE Transaction on Multimedia(TMM2017), 2017.

19.    Weiqing Min, Shuqiang Jiang, Jitao Sang, Huayang Wang, Xinda Liu, Luis Herranz, "Being a Supercook:Joint Food Attributes and Multimodal Content Modeling for Recipe Retrieval and Exploration," IEEE Transaction on Multimedia(TMM2017), 2017. 【pdf】

20.    Yanhao Zhang, Lei Qin, Rongrong Ji, Sicheng Zhao, Qingming Huang, Jiebo Luo, "Exploring Coherent Motion Patterns Via Structured Trajectory Learning for Crowd Mood Modeling," IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 3, pp. 635-648, 2017.

会议论文

1.    Jiangyangbang Yan, Zhiyong Yang, Qianqian Xu, Xiaochun Cao, Qingming Huang, “When to Learn What: Deep Cognitive Subspace Clustering”, 26th ACM International Conference on Multimedia (ACMMM2018), Seoul, Korea, October 22-26, 2018. 【pdf】

2.    Liang Li, Shuhui Wang, Shuqiang Jiang, Qingming Huang, “Attentive Recurrent Neural Network Weak-supervised Multi-label Image Classification”, 26th ACM International Conference on Multimedia (ACMMM2018), Seoul, Korea, October 22-26, 2018. 【pdf】

3.    Qianqian Xu, Jiechao Xiong, Xinwei Sun, Zhiyong Yang, Xiaochun Cao, Qingming Huang,Yuan Yao, “A Margin-based MLE for Crowdsourced Partial Ranking”, 26th ACM International Conference on Multimedia (ACMMM2018), Seoul, Korea, October 22-26, 2018. 【pdf】

4.    Shuhui Wang, Yangyu Chen, Junbao Zhuo, Qingming Huang, Qi Tian, “Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval”, 26th ACM International Conference on Multimedia (ACMMM2018), Seoul, Korea, October 22-26, 2018.

5.    Yiling Wu, Shuhui Wang, Qingming Huang, “Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval”, 26th ACM International Conference on Multimedia (ACMMM2018), Seoul, Korea, October 22-26, 2018. 【pdf】

6.    Qianqian Xu, Jiechao Xiong, Xi Chen, Qingming Huang, Yuan Yao, “HodgeRank with Information Maximization for Crowdsourced Pairwise Ranking Aggregation”, 32nd AAAI Conference on Artificial Intelligence (AAAI2018), New Orleans, Lousiana, United States, Feb 2-7, 2018. 【pdf】

7.    Zhiyong Yang, Qianqian Xu, Xiaochun Cao, Qingming Huang*, “From Common to Special: When Multi-Attribute Learning Meets Personalized”, 32nd AAAI Conference on Artificial Intelligence (AAAI2018), New Orleans, Lousiana, United States, Feb 2-7, 2018. 【pdf】

8.    Yongjian Xin, Shuhui Wang, Liang Li, Weigang Zhang, Qingming Huang, "Reverse Densely Connected Feature Pyramid Network for Object Detection," 14th Asian Conference on Computer Vision (ACCV2018), Perth, Australia, December 2-6, 2018. 【pdf】

9.    Dawei Du, Yuankai Qi, Hongyang Yu, Yifan Yang, Kaiwen Duan, Guorong Li, Weigang Zhang, Qingming Huang, Qi Tian, "The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking," 20th European Conference on Computer Vision (ECCV2018), Munich, Germany, Sept. 8-14, 2018. 【pdf】

10.    Yangyu Chen, Shuhui Wang, Weigang Zhang and Qingming Huang, "Less Is More: Picking Informative Frames for Video Captioning," 20th European Conference on Computer Vision (ECCV2018), Munich, Germany, Sept. 8-14, 2018. 【pdf】

11.    Shulin Li, Weigang Zhang, Guorong Li, Li Su, Qingming Huang, "Vehicle Detection in UAV Traffic Video Based on Convolution Neural Network," IEEE 1st International Conference on Multimedia Information Processing and Retrieval(ICESIP2018), Miami, FL, USA, April 10-12, 2018. 【pdf】

12.    Changsu Liao, Li Su, Weigang Zhang, Qingming Huang, "Semantic Manifold Alignment in Visual Feature Space for Zero-Shot Learning," IEEE International Conference on Multimedia and Expo(ICME2018), San Diego, USA, July 23-27, 2018. 【pdf】

13.    Kai Xu, Guorong Li, Huijuan Xu, Weigang Zhang, Qingming Huang, "Edge Guided Generation Network for Video Prediction," IEEE International Conference on Multimedia and Expo(ICME2018)San Diego, USA, July 23-27,2018. 【pdf】

14.    Xiaobin Liu, Shiliang Zhang, Qingming Huang, Wen Gao, "RAM: A Region-Aware Deep Model for Vehicle Re-Identification," IEEE International Conference on Multimedia and Expo(ICME2018), San Diego, USA, July 23-27,2018. 【pdf】

15.    Shuhui Wang, Liang Li, Chenxue Yang and Qingming Huang, "Regularized Topic-aware Latent Influence Propagation in Dynamic Relational Networks," MATES workshop VLDB, 2018. 【pdf】

16.    Chengpeng Chen, Xinhang Song, Shuqiang Jiang, "Focal Loss for Region Proposal Network," The Chinese Conference on Pattern Recognition and Computer Vision(PRCV2018), Guangzhou, China, 2018.11. 【pdf】

17.    Xinhang Song, Luis Herranz, Shuqiang Jiang, "Depth CNNs for RGB-D Scene Recognition: Learning from Scratch Better than Transferring from RGB-CNNs," AAAI Conference on Artificial Intelligence(AAAI2017), 2017. 【pdf】

18.    Junbao Zhuo, Shuhui Wang, Weigang Zhang, Qingming Huang, "Deep Unsupervised Convolutional Domain Adaptation," ACM Multimedia Conference(ACMMM2017), 2017. 【pdf】

19.    Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian, "Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval," ACM Multimedia Conference(ACMMM2017), Mountain View, CA, USA, 2017. 【pdf】

20.    Weiqing Min, Shuqiang Jiang, Shuhui Wang, Jitao Sang, Shuhuan Mei, "A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Aributes," ACM Multimedia Conference(ACMMM2017), 2017. 【pdf】


视觉信息处理和学习组
  • 单位地址:北京海淀区中关村科学院南路6号
  • 邮编:100190
  • 联系电话:010-62600514
  • Email:yi.cheng@vipl.ict.ac.cn
  • Valse

  • 深度学习大讲堂

版权所有 @ 中科院计算所视觉信息处理与学习组 京ICP备05002829号 京公网安备1101080060