中科院计算所视觉信息处理与学习组
中科院计算所视觉信息处理与学习组


您所在的位置 / 科学研究

科学研究
多媒体计算与多模态智能组

组  长:蒋树强 研究员

Email:sqjiang [at] dot ict dot ac dot cn

*  现有研究员1人、副研究员1人、博士后1人、博士和硕士研究生10余人。

*  曾经或正在承担国家自然科学基金优秀青年科学基金、国家自然科学基金重点项目、国家自然科学基金面上项目、国家863课题、北京市科技项目、企业合作项目等课题十余项。曾经或正在承担国家自然科学基金优秀青年科学基金、国家自然科学基金重点项目、国家自然科学基金面上项目、国家863课题、北京市基金重点项目、北京市科技项目、企业合作项目等课题二十余项。

*获奖情况:

在基于搜索的多物体识别技术获得ACM ICMR2013 Best Demo Award;
基于多传感器的视觉识别技术获得ImageClef Robot Vision竞赛2013年度的冠军,
在图像与语言的关联理解技术上的工作分别获ACM Multimedia 2016 Yahoo-Flickr Challenge on Caption Prediction竞赛的冠军。
数据库:

复杂场景下的实例级图像数据集,主页为:http://vipl.ict.ac.cn/isia/instre/;论文:Shuang Wang, Shuqiang Jiang, INSTRE: A New Benchmark for Instance-Level Object Retrieval and Recognition. ACM Transactions on Multimedia Computing, Communications, and Applications(TOMCAT) Vol.11(3), pp. 37:1-37:21, 2015
建立了基于多传感器的手持物体检测数据集,主页为:http://vipl.ict.ac.cn/isia/HOD/;论文:Xiong Lv, Shuqiang Jiang, Luis Herranz, Shuang Wang, RGB-D Hand-Held Object Recognition Based on Heterogeneous Feature Fusion. Journal of Computing Science and Technology, Vol.30(2), pp.340-352 ,2015
建立了基于地理信息的多模态食品图像数据集,主页为:http://vipl.ict.ac.cn/isia/datasets_dish/index.html;论文:Ruihan Xu, Luis Herranz, Shuqiang Jiang, Shuang Wang, Xinhang Song, Ramesh Jain, Geolocalized Modeling for Dish Recognition. IEEE Trans. Multimedia,  Vol.17(8), pp.1187-1199, 2015
研究内容

*图像/视频等多媒体信息的分析、理解与搜索技术;
*视觉、语言、知识库和各种上下文信息的多模态关联、融合与理解技术;
*多模态智能交互技术。


部分论文

刊物论文

1.    Weiqing Min,Shuqiang Jiang, Linhu Liu,Yong Rui and Ramesh Jain, "A Survey on Food Computing," ACM Computing Surveys (CSUR) (Accepted)2019

2.    Shuqiang Jiang, Weiqing Min, Linhu Liu, Zhengdong Luo, "Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition," IEEE Transactions on Image Processing (Accepted) 2019.

3.    Shuqiang Jiang, Weiqing Min, Shuhuan Mei, "Hierarchy-Dependent Cross-Platform Multi-View Feature Learning for Venue Category Prediction," IEEE Transactions on Multimedia, 21(6): 1609–1619, 2019. 【pdf】

4.    Chengpeng Chen, Weiqing Min, Xue Li, Shuqiang Jiang, "Hybrid incremental learning of new data and new classes for hand-held object recognition," Journal of Visual Communication and Image Representation(JVCI), 58: 138-148, 2019 【pdf】

5.    Shuhuan Mei, Weiqing Min, Hua Duan, Shuqiang Jiang, "Instance-level object retrieval via deep region CNN," Multimedia Tools and Applications (MTAP), 78(10): 13247-13261, 2019. 【pdf】

6.    Weiqing Min, Bing-Kun Bao, Shuhuan Mei, Yaohui Zhu, Yong Rui, Shuqiang Jiang, "You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis," IEEE Trans. Multimedia, 20(4): 950-964, 2018. 【pdf】

7.    Chunjie Zhang, Guibo Zhu, Chao Liang, Yifan Zhang, Qingming Huang, Qi Tian, “Image Class Prediction by Joint Object, Context, and Background Modeling”, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 28, Issue 2, 2018, 428-438. 【pdf】

8.    Zhe Xue, Guorong Li*, Shuhui Wang, Weigang Zhang, Qingming Huang*, “Bilevel Multiview Latent Space Learning”, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 28, Issue 2, 2018, 327-341. 【pdf】

9.    Dawei Du, Longyin Wen, Honggang Qi, Qingming Huang, Qi Tian, and Siwei Lyu, “Iterative Graph Seeking for Object Tracking”, IEEE Transactions on Image Processing, Vol. 27, Issue 4, 2018, 1809-1821. 【pdf】

10.    Xinhang Song, Shuqiang Jiang, Luis Herranz, Chengpeng Chen, "Learning Effective RGB-D Representations for Scene Recognition," IEEE Transactions on Image Processing (TIP), doi:10.1109/TIP.2018.2872629. 【pdf】

11.    Yuankai Qi, Lei Qin, Jian Zhang, Shengping Zhang, Ming-Hsuan Yang, Qingming Huang, “Structure-aware Local Sparse Coding for Visual Tracking”, IEEE Transactions on Image Processing, Vol. 27, Issue 8, 2018, 3857-3869. 【pdf】

12.    Junbiao Pang, Jing Huang, Xue Yang, Zuyun Wang, Haitao Yu, Qingming Huang, Baocai Yin, “Discovering Fine-Grained Spatial Pattern From Taxi Trips: Where Point Process Meets Matrix Decomposition and Factorization”, IEEE Transactions on Intelligent Transportation Systems, Vol. 19, Issue 10, 2018, 3208 – 3219. 【pdf】

13.    Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian, “Generalized Semi-supervised and Structured Subspace Learning for Cross-modal Retrieval”, IEEE Transactions on Multimedia, Vol. 20, Issue 1, 2018, 128-141. 【pdf】

14.    Xiangyang Li, Shuqiang Jiang, "Bundled Object Context for Referring Expressions," IEEE Transactions on Multimedia (TMM), vol. 20, no. 10, pp. 2749-2760, 2018. 【pdf】

15.    Zhe Xue, Guorong Li, Qingming Huang, "Joint Multi-view Representation and Image Annotation via Optimal Predictive Subspace Learning," Information Science, Vol. 451-452, 2018, 180-194. 【pdf】

16.    Hongyang Yu, Lei Qin, Qingming Huang, Hongxun Yao, "Online Multiple Object Tracking via Exchanging Object Context," Neurocomputing. Vol. 292, 2018, 28-37. 【pdf】

17.    Jianfeng He, Bingpeng Ma, Shuhui Wang, Yugui Liu, Qingming Huang, "Multi-label Double Layers Learning for Cross-Modal Retrieval," Neurocomputing, Vol. 275, 2018, 1893-1902. 【pdf】

18.    Xiaofeng Mao, Shuhui Wang, Liying Zheng, Qingming Huang, "Semantic Invariant Cross-domain Image Generation with Generative Adversarial Networks," Neurocomputing, Vol. 293, 2018, 55-63. 【pdf】

19.    Jiaming Zhang, Shuhui Wang, Qingming Huang, "Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval," ACM Transactions on Intelligent Systems and Technology, vol. 8, no. 3, pp. 1-21, 2017. 【pdf】

20.    Siyuan Liu, Shuhui Wang, Qiang Qu, "Trajectory Mining," Encyclopedia of GIS, pp. 2310-2313, 2017.

会议论文

1.    Weiqing Min, Linhu Liu, Zhengdong Luo, Shuqiang Jiang, "Ingredient-Guided Cascaded Multi-Attention Network for Food Recognition," ACM International Conference on Multimedia (ACMMM), 2019. (Accepted)

2.    Xinhang Song, Bohan Wang, Gongwei Chen, Shuqiang Jiang, "MUCH: MUtual Coupling enHancement of scene recognition and dense captioning," ACM International Conference on Multimedia (ACMMM), 2019. (Accepted)

3.    Xinhang Song, Sixian Zhang, Yuyun Hua, Shuqiang Jiang, "Aberrance-aware gradient-sensitive attentions for scene recognition with RGB-D videos," ACM International Conference on Multimedia (ACMMM), 2019. (Accepted)

4.    Yongqing Zhu, Shuqiang Jiang, "Attention-based Densely Connected LSTM for Video Captioning," ACM International Conference on Multimedia (ACMMM), 2019. (Accepted)

5.    Jiangyangbang Yan, Zhiyong Yang, Qianqian Xu, Xiaochun Cao, Qingming Huang, “When to Learn What: Deep Cognitive Subspace Clustering”, 26th ACM International Conference on Multimedia (ACMMM2018), Seoul, Korea, October 22-26, 2018. 【pdf】

6.    Qianqian Xu, Jiechao Xiong, Xinwei Sun, Zhiyong Yang, Xiaochun Cao, Qingming Huang,Yuan Yao, “A Margin-based MLE for Crowdsourced Partial Ranking”, 26th ACM International Conference on Multimedia (ACMMM2018), Seoul, Korea, October 22-26, 2018. 【pdf】

7.    Yiling Wu, Shuhui Wang, Qingming Huang, “Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval”, 26th ACM International Conference on Multimedia (ACMMM2018), Seoul, Korea, October 22-26, 2018. 【pdf】

8.    Zhiyong Yang, Qianqian Xu, Xiaochun Cao, Qingming Huang*, “From Common to Special: When Multi-Attribute Learning Meets Personalized”, 32nd AAAI Conference on Artificial Intelligence (AAAI2018), New Orleans, Lousiana, United States, Feb 2-7, 2018. 【pdf】

9.    Yaohui Zhu, Shuqiang Jiang, "Deep Structured Learning for Visual Relationship Detection," Thirty-Second AAAI Conference on Artificial Intelligence(AAAI2018), New Orleans, Lousiana, USA, February 2-7, 2018. 【pdf】

10.    Yongjian Xin, Shuhui Wang, Liang Li, Weigang Zhang, Qingming Huang, "Reverse Densely Connected Feature Pyramid Network for Object Detection," 14th Asian Conference on Computer Vision (ACCV2018), Perth, Australia, December 2-6, 2018. 【pdf】

11.    Shulin Li, Weigang Zhang, Guorong Li, Li Su, Qingming Huang, "Vehicle Detection in UAV Traffic Video Based on Convolution Neural Network," IEEE 1st International Conference on Multimedia Information Processing and Retrieval(ICESIP2018), Miami, FL, USA, April 10-12, 2018. 【pdf】

12.    Changsu Liao, Li Su, Weigang Zhang, Qingming Huang, "Semantic Manifold Alignment in Visual Feature Space for Zero-Shot Learning," IEEE International Conference on Multimedia and Expo(ICME2018), San Diego, USA, July 23-27, 2018. 【pdf】

13.    Kai Xu, Guorong Li, Huijuan Xu, Weigang Zhang, Qingming Huang, "Edge Guided Generation Network for Video Prediction," IEEE International Conference on Multimedia and Expo(ICME2018)San Diego, USA, July 23-27,2018. 【pdf】

14.    Xiaobin Liu, Shiliang Zhang, Qingming Huang, Wen Gao, "RAM: A Region-Aware Deep Model for Vehicle Re-Identification," IEEE International Conference on Multimedia and Expo(ICME2018), San Diego, USA, July 23-27,2018. 【pdf】

15.    Shuhui Wang, Liang Li, Chenxue Yang and Qingming Huang, "Regularized Topic-aware Latent Influence Propagation in Dynamic Relational Networks," MATES workshop VLDB, 2018. 【pdf】

16.    Chengpeng Chen, Xinhang Song, Shuqiang Jiang, "Focal Loss for Region Proposal Network," The Chinese Conference on Pattern Recognition and Computer Vision(PRCV2018), Guangzhou, China, 2018.11. 【pdf】

17.    Xinhang Song, Luis Herranz, Shuqiang Jiang, "Depth CNNs for RGB-D Scene Recognition: Learning from Scratch Better than Transferring from RGB-CNNs," AAAI Conference on Artificial Intelligence(AAAI2017), 2017. 【pdf】

18.    Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian, "Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval," ACM Multimedia Conference(ACMMM2017), Mountain View, CA, USA, 2017. 【pdf】

19.    Weiqing Min, Shuqiang Jiang, Shuhui Wang, Jitao Sang, Shuhuan Mei, "A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Aributes," ACM Multimedia Conference(ACMMM2017), 2017. 【pdf】

20.    Xinhang Song, Chengpeng Chen, Shuqiang Jiang, "RGB-D Scene Recognition with Object-to-Object Relation," ACM Multimedia Conference(ACMMM2017), 2017. 【pdf】


视觉信息处理和学习组
  • 单位地址:北京海淀区中关村科学院南路6号
  • 邮编:100190
  • 联系电话:010-62600514
  • Email:yi.cheng@vipl.ict.ac.cn
  • Valse

  • 深度学习大讲堂

版权所有 @ 中科院计算所视觉信息处理与学习组 京ICP备05002829号 京公网安备1101080060