中科院计算所视觉信息处理与学习组
中科院计算所视觉信息处理与学习组
王树徽 (Shuhui Wang)

职 称:副研究员 (Associate Professor)

研究方向:多媒体分析、机器学习、数据挖掘

王树徽,2006年于清华大学获得工学学士学位,2012年7月于中国科学院计算技术研究所获得工学博士学位,2014年10月从中国科学院计算所博士后出站。从事多媒体内容分析检索技术、大数据理论与方法、机器学习等方面的研究,已在诸如IEEE/ACM顶级汇刊TKDE、TMM、TIP、TCSVT、TKDD、TIST,以及ACM Multimedia、ICCV、CVPR、ECCV、SIGMOD、VLDB、ICDM,CIKM等多媒体分析,计算机视觉、数据库和数据挖掘领域的顶级期刊和会议上发表和录用学术论文70多篇,授权国家专利3项。作为项目负责人或研究骨干,承担和参与了多项国家和省部级项目的研究,包括国家973课题、863计划项目、国家自然基金重点项目,国家自然基金面上和青年项目等。担任ICME、PCM、ICIMCS等国际会议的会议组织工作,并担任TIP、TMM、TCSVT、TOMM、ACMMM、NIPS、CVPR、IJCAI等高水平国际期刊和会议的审稿人。长期与本领域国内外多个单位的学者与专家保持着广泛的接触、交流与合作。曾获中科院院长奖,CCF科学技术奖等奖励。



学术服务
刊物服务
  • Reviewer of Information Science (Elsevier) and Pattern Recognition (Elsevier).
  • Reviewer of ACM-TKDD and ACM-TOMCCAP
  • Reviewer of IEEE-TIP, IEEE-TMM, IEEE-TCSVT, IEEE-TCYB, IEEE-TBD and IEEE Access
会议服务
  • TPC member of IJCAI'18, AAAI'19, ACCV'18, PCM'18, PRCV'18, ChinaMM'18.
  • Publication Chair of PCM 2017.
  • PC Co-chair of the MASS workshop, with APWEB-WAIM 2017, Jul. 7, 2017.
  • PC Co-Chair, 1st International Workshop on Mobility Analytics for Spatio-temporal and Social Data (MATES),VLDB'17, Sept. 1, 2017.
  • Publication Chair, ACM International Conference on Internet Multimedia Computing and Service (ICIMCS'15), 17th-21th, Aug, 2015, Zhangjiajie, Hunan.
经历
教育经历
  • 2006.9 ~ 2012.7 中国科学院计算技术研究所 计算机应用技术 工学博士(免试)
  • 2002.9 ~ 2006.7 清华大学 电子信息工程 工学学士
学术经历
  • 2016~今, 面向开放环境的多模态主动感知与理解技术研究
  • 2011~今,视觉-语言/跨媒体关联学习技术研究
  • 2009~2013:基于多特征融合学习的图像分类技术研究
  • 2006~2009: 海量图像视频理解与检索技术研究
  • 2006: 基于降维学习的人脸分类技术研究
研究内容

1.   Heterogeneous multimedia big data analytics

Focus on recommendation systems, learning with noisy Web data and community discovery based on content and social context.

2.   Multimodal perception

Focus on knowledge/memory/language guided multimodal perception and representation, and human-like multimodal active perception.

3.   Vision and Multimedia Understanding

Focus on visual learning on large number of categories, domain adaptation and multimodal semantic analytics.

4.   Multimodal reasoning and interaction

Focus on cross-modal representation learning, visual-linguistic retrieval, and interactive multimodal QA.
科研项目

1.   异构大数据的对象建模及跨域分析技术研究

项目类型:国家自然科学基金面上项目
项目时间:2017.01-2020.12
项目负责人:王树徽

2.   跨媒体学习与内容理解

项目类型:973计划课题
项目时间:2012.1.1~2016.12.31
项目负责人:黄庆明

3.   基于多源信息融合和网络社群行为建模的跨媒体分析技术研究

项目类型:国家自然科学基金青年基金项目
项目时间:2014.01-2016.12
项目负责人:王树徽
著论

著作

1.   Siyuan Liu , Shuhui Wang, Qiang Qu. Trajectory Mining. Book chapter of Encyclopedia of GIS, Springer, ISBN: 978-3-319-23519-6 (Online), 2017. 【pdf】

论文

Major Pubulication (Please see DBLP for full publicatoin list):

1. Yangyu Chen, Shuhui Wang, Weigang Zhang, Qingming Huang. Less is More: Picking Informative Frames for Video Captioning. ECCV, 2018. 

2. Shuhui Wang, Yangyu Chen, Junbao Zhuo, Qingming Huang, Qi Tian. Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval. ACM Multimedia, 2018. (CCF-A类会议,Oral, Acceptance rate: 8%)

3. Yiling Wu, Shuhui Wang, Qingming Huang. Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval. ACM Multimedia, 2018. (CCF-A类会议)

4. Liang Li, Shuhui Wang, Shuqiang Jiang, Qingming Huang. Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification. ACM Multimedia, 2018. (CCF-A类会议)

5. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian: Multimodal Similarity Gaussian Process Latent Variable Model. IEEE Trans. Image Processing 26(9): 4168-4181 (2017). (CCF-A类期刊)

6. Jiaming Zhang, Shuhui Wang, Qingming Huang: Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval. ACM TIST 8(3): 38:1-38:21 (2017).

7. Siyuan Liu, Shuhui Wang: Trajectory Community Discovery and Recommendation by Multi-Source Diffusion Modeling. IEEE Trans. Knowl. Data Eng. 29(4): 898-911 (2017). (CCF-A类期刊)

8. Yiling Wu, Shuhui Wang, Qingming Huang: Online Asymmetric Similarity Learning for Cross-Modal Retrieval. CVPR 2017: 3984-3993. (CCF-A类会议)

9. Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang: A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning. CVPR 2017: 7053-7061. (CCF-A类会议)

10. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian: Multimodal Gaussian Process Latent Variable Models with Harmonization. ICCV 2017: 5039-5047. (CCF-A类会议)

12. Junbao Zhuo, Shuhui Wang, Weigang Zhang, Qingming Huang: Deep Unsupervised Convolutional Domain Adaptation. ACM Multimedia 2017: 261-269. (CCF-A类会议)

13. Weiqing Min, Shuqiang Jiang, Shuhui Wang, Jitao Sang, Shuhuan Mei: A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Attributes. ACM Multimedia 2017: 402-410. (CCF-A类会议)

14. Lingyang Chu, Yanyan Zhang, Guorong Li, Shuhui Wang, Weigang Zhang, Qingming Huang: Effective Multimodality Fusion Framework for Cross-Media Topic Detection. IEEE Trans. Circuits Syst. Video Techn. 26(3): 556-569 (2016).

15. Yan Hua, Shuhui Wang, Siyuan Liu, Anni Cai, Qingming Huang: Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation. IEEE Trans. Multimedia 18(6): 1201-1216 (2016).

16. Shuhui Wang, Fuzhen Zhuang, Shuqiang Jiang, Qingming Huang, Qi Tian: Cluster-sensitive Structured Correlation Analysis for Web cross-modal retrieval. Neurocomputing 168: 747-760 (2015).

17. Lingyang Chu, Shuhui Wang, Siyuan Liu, Qingming Huang, Jian Pei: ALID: Scalable Dominant Cluster Detection. PVLDB 8(8): 826-837 (2015). (CCF-A类会议)

18. Li Shen, Gang Sun, Qingming Huang, Shuhui Wang, Zhouchen Lin, Enhua Wu: Multi-Level Discriminative Dictionary Learning With Application to Large Scale Image Classification. IEEE Trans. Image Processing 24(10): 3109-3123 (2015). (CCF-A类期刊)

19. Siyuan Liu, Qiang Qu, Shuhui Wang: Rationality Analytics from Trajectories. TKDD 10(1): 10:1-10:22 (2015).

20. Siyuan Liu, Shuhui Wang, Feida Zhu: Structured Learning from Heterogeneous Behavior for Social Identity Linkage. IEEE Trans. Knowl. Data Eng. 27(7): 2005-2019 (2015). (CCF-A类期刊)

21. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian: Similarity Gaussian Process Latent Variable Model for Multi-modal Data Analysis. ICCV 2015: 4050-4058. (CCF-A类会议)

22. Shuhui Wang, Yiling Wu, Qingming Huang: Improving cross-modal correlation learning with hyperlinks. ICME 2015: 1-6.

23. Jiaming Zhang, Shuhui Wang, Qingming Huang: Location-Based Parallel Tag Completion for Geo-tagged Social Image Retrieval. ICMR 2015: 355-362.

24. Yan Hua, Shuhui Wang, Siyuan Liu, Qingming Huang, Anni Cai: TINA: Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation. ICDM 2014: 190-199.

25. Shuhui Wang, Zhenjun Wang, Shuqiang Jiang, Qingming Huang: Cross media topic analytics based on synergetic content and user behavior modeling. ICME 2014: 1-6.

26. Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, Ramayya Krishnan: HYDRA: large-scale social identity linkage via heterogeneous behavior modeling. SIGMOD Conference 2014: 51-62. (CCF-A类会议)

27. Lingyang Chu, Shuqiang Jiang, Shuhui Wang, Yanyan Zhang, Qingming Huang: Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval. IEEE Trans. Multimedia 15(8): 1982-1996 (2013).

28. Siyuan Liu, Shuhui Wang, Kasthuri Jayarajah, Archan Misra, Ramayya Krishnan: TODMIS: mining communities from trajectories. CIKM 2013: 2109-2118.

29. Li Shen, Shuhui Wang, Gang Sun, Shuqiang Jiang, Qingming Huang: Multi-level Discriminative Dictionary Learning towards Hierarchical Visual Categorization. CVPR 2013: 383-390. (CCF-A类会议)

30. Xin Jin, Fuzhen Zhuang, Shuhui Wang, Qing He, Zhongzhi Shi: Shared Structure Learning for Multiple Tasks with Multiple Views. ECML/PKDD (2) 2013: 353-368.

31. Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian, Lei Qin: Nearest-neighbor method using multiple neighborhood similarities for social media data mining. Neurocomputing 95: 105-116 (2012).

32. Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian: S3MKL: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications. IEEE Trans. Multimedia 14(4): 1259-1274 (2012).

33. Shuhui Wang, Shuqiang Jiang, Qingming Huang, Qi Tian: Multi-feature metric learning with knowledge transfer among semantics and social tagging. CVPR 2012: 2240-2247. (CCF-A类会议)

34. Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian: Efficient lp-norm multiple feature metric learning for image categorization. CIKM 2011: 2077-2080.

35. Shuhui Wang, Shuqiang Jiang, Qingming Huang, Qi Tian: Multiple Kernel Learning with High Order Kernels. ICPR 2010: 2138-2141.

36. Shuhui Wang, Shuqiang Jiang, Qingming Huang, Qi Tian: S3MKL: scalable semi-supervised multiple kernel learning for image data mining. ACM Multimedia 2010: 163-172. (CCF-A类会议)

37. Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian: Nearest-neighbor classification using unlabeled data for real world image application. ACM Multimedia 2010: 1151-1154.


视觉信息处理和学习组
  • 单位地址:北京海淀区中关村科学院南路6号
  • 邮编:100190
  • 联系电话:010-62600514
  • Email:yi.cheng@vipl.ict.ac.cn
  • Valse

  • 深度学习大讲堂

版权所有 @ 中科院计算所视觉信息处理与学习组 京ICP备05002829号 京公网安备1101080060