中科院计算所视觉信息处理与学习组
中科院计算所视觉信息处理与学习组
王树徽 (Shuhui Wang)

职 称:副研究员 (Associate Professor)

电子邮箱:wangshuhui@ict.ac.cn and shuhui.wang@vipl.ict.ac.cn

通讯地址:北京市海淀区科学院南路6号(Tel: 010-62600573)

研究方向:视觉及媒体内容理解与交互、机器学习、数据挖掘

王树徽,2006年于清华大学获得工学学士学位,2012年7月于中国科学院计算技术研究所获得工学博士学位,2014年10月从中国科学院计算所博士后出站。从事多媒体内容分析检索技术、大数据理论与方法、机器学习等方面的研究,已在诸如IEEE/ACM顶级汇刊TKDE、TMM、TIP、TCSVT、TKDD、TIST,以及ACM Multimedia、ICCV、CVPR、ECCV、SIGMOD、VLDB、ICDM,CIKM等多媒体分析,计算机视觉、数据库和数据挖掘领域的顶级期刊和会议上发表和录用学术论文70多篇,授权国家专利3项。作为项目负责人或研究骨干,承担和参与了多项国家和省部级项目的研究,包括国家973课题、863计划项目、国家自然基金重点项目,国家自然基金面上和青年项目等。担任ICME、PCM、ICIMCS等国际会议的会议组织工作,并担任TIP、TKDE、TMM、TCSVT、TOMM、ACMMM、NIPS、CVPR、AAAI、IJCAI等高水平国际期刊和会议的审稿人。长期与本领域国内外多个单位的学者与专家保持着广泛的接触、交流与合作。曾获中科院院长奖,CCF科学技术奖等奖励。



学术服务
刊物服务
  • Reviewer of Information Science (Elsevier) and Pattern Recognition (Elsevier).
  • Reviewer of ACM-TKDD and ACM-TOMCCAP
  • Reviewer of IEEE-TIP, IEEE-TMM, IEEE-TCSVT, IEEE-TCYB, IEEE-TBD and IEEE Access
会议服务
  • TPC member of IJCAI'18, AAAI'19, ACCV'18, PCM'18, PRCV'18, ChinaMM'18.
  • Publication Chair of PCM 2017.
  • PC Co-chair of the MASS workshop, with APWEB-WAIM 2017, Jul. 7, 2017.
  • PC Co-Chair, 1st International Workshop on Mobility Analytics for Spatio-temporal and Social Data (MATES),VLDB'17, Sept. 1, 2017.
  • Publication Chair, ACM International Conference on Internet Multimedia Computing and Service (ICIMCS'15), 17th-21th, Aug, 2015, Zhangjiajie, Hunan.
经历
教育经历
  • 2006.9 ~ 2012.7 中国科学院计算技术研究所 计算机应用技术 工学博士(免试)
  • 2002.9 ~ 2006.7 清华大学 电子信息工程 工学学士
学术经历
  • 2016~今, 面向开放环境的多模态主动感知与理解技术研究
  • 2011~今,视觉-语言/跨媒体关联学习技术研究
  • 2009~2013:基于多特征融合学习的图像分类技术研究
  • 2006~2009: 海量图像视频理解与检索技术研究
  • 2006: 基于降维学习的人脸分类技术研究
研究内容

1.   多源异构媒体大数据分析

包括:多媒体内容推荐,网络有噪声数据的机器学习方法、网络社群分析

2.   多模态主动感知与表达

感知增强,知识、记忆和语言驱动的类人多模态主动感知技术。

3.   视觉与跨媒体理解

知识补全与知识泛化,领域适应迁移学习,视觉物体、场景、事件检测与分类,多模协同的媒体主题分析

4.   多模态推理及交互

跨模态表征学习,视觉-语言检索,视觉内容概述,多模态问答及对话等。

科研项目

1.   异构大数据的对象建模及跨域分析技术研究

项目类型:国家自然科学基金面上项目
项目时间:2017.01-2020.12
项目负责人:王树徽

2.   跨媒体学习与内容理解

项目类型:973计划课题
项目时间:2012.1.1~2016.12.31
项目负责人:黄庆明

3.   基于多源信息融合和网络社群行为建模的跨媒体分析技术研究

项目类型:国家自然科学基金青年基金项目
项目时间:2014.01-2016.12
项目负责人:王树徽
著论

著作

1.   Siyuan Liu , Shuhui Wang, Qiang Qu. Trajectory Mining. Book chapter of Encyclopedia of GIS, Springer, ISBN: 978-3-319-23519-6 (Online), 2017. 【pdf】

论文

Major Publication (Please see DBLP for full publication list):



1.  Shuhui Wang, Liang Li, Chenxue Yang, Qingming Huang. Regularized Topic-aware Latent Influence Propagation in Dynamic Relational Networks. GeoInformatica, accepted.

2. Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang, Qi Tian. SkeletonNet: A Hybrid Network with a Skeleton-Embedding Process for Multi-view Image Representation Learning. IEEE Transactions on Multimedia, accepted.

3. Yiling Wu, Shuhui Wang, Guoli Song, Qingming Huang. Online Asymmetric Metric Learning with Multi-Layer Similarity Aggregation for Cross-Modal Retrieval. IEEE Transaction on Image Processing (CCF-A类期刊) , accepted. Code

4. Junbao Zhuo, Shuhui Wang, Shuhao Cui, Qingming Huang. Unsupervised Domain Adaptation by Semantic Discrepancy Minimization. In CVPR, 2019. (CCF-A类会议) PaperCode

5. Zhe Xue, Guorong Li, Shuhui Wang, Weigang Zhang, Qingming Huang. Bilevel Multiview Latent Space Learning. IEEE Trans. Circuits Syst. Video Techn. 28(2): 327-341, 2018.

6. Yangyu Chen, Shuhui Wang, Weigang Zhang, Qingming Huang. Less is More: Picking Informative Frames for Video Captioning. ECCV, 2018. 

7. Shuhui Wang, Yangyu Chen, Junbao Zhuo, Qingming Huang, Qi Tian. Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval. ACM Multimedia, 2018. (CCF-A类会议,Oral, Acceptance rate: 8%)

8. Yiling Wu, Shuhui Wang, Qingming Huang. Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval. ACM Multimedia, 2018. (CCF-A类会议)

9. Liang Li, Shuhui Wang, Shuqiang Jiang, Qingming Huang. Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification. ACM Multimedia, 2018. (CCF-A类会议)

10. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian: Multimodal Similarity Gaussian Process Latent Variable Model. IEEE Trans. Image Processing 26(9): 4168-4181 (2017). (CCF-A类期刊) Code

11. Jiaming Zhang, Shuhui Wang, Qingming Huang: Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval. ACM TIST 8(3): 38:1-38:21 (2017).

12. Siyuan Liu, Shuhui Wang: Trajectory Community Discovery and Recommendation by Multi-Source Diffusion Modeling. IEEE Trans. Knowl. Data Eng. 29(4): 898-911 (2017). (CCF-A类期刊)

13. Yiling Wu, Shuhui Wang, Qingming Huang: Online Asymmetric Similarity Learning for Cross-Modal Retrieval. CVPR 2017: 3984-3993. (CCF-A类会议)

14. Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang: A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning. CVPR 2017: 7053-7061. (CCF-A类会议)

15. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian: Multimodal Gaussian Process Latent Variable Models with Harmonization. ICCV 2017: 5039-5047. (CCF-A类会议) Code

16. Junbao Zhuo, Shuhui Wang, Weigang Zhang, Qingming Huang: Deep Unsupervised Convolutional Domain Adaptation. ACM Multimedia 2017: 261-269. (CCF-A类会议)

17. Weiqing Min, Shuqiang Jiang, Shuhui Wang, Jitao Sang, Shuhuan Mei: A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Attributes. ACM Multimedia 2017: 402-410. (CCF-A类会议)

18. Lingyang Chu, Yanyan Zhang, Guorong Li, Shuhui Wang, Weigang Zhang, Qingming Huang: Effective Multimodality Fusion Framework for Cross-Media Topic Detection. IEEE Trans. Circuits Syst. Video Techn. 26(3): 556-569 (2016).

19. Yan Hua, Shuhui Wang, Siyuan Liu, Anni Cai, Qingming Huang: Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation. IEEE Trans. Multimedia 18(6): 1201-1216 (2016).

20. Shuhui Wang, Fuzhen Zhuang, Shuqiang Jiang, Qingming Huang, Qi Tian: Cluster-sensitive Structured Correlation Analysis for Web cross-modal retrieval. Neurocomputing 168: 747-760 (2015).

21. Lingyang Chu, Shuhui Wang, Siyuan Liu, Qingming Huang, Jian Pei: ALID: Scalable Dominant Cluster Detection. PVLDB 8(8): 826-837 (2015). (CCF-A类会议)

22. Li Shen, Gang Sun, Qingming Huang, Shuhui Wang, Zhouchen Lin, Enhua Wu: Multi-Level Discriminative Dictionary Learning With Application to Large Scale Image Classification. IEEE Trans. Image Processing 24(10): 3109-3123 (2015). (CCF-A类期刊)

23. Siyuan Liu, Qiang Qu, Shuhui Wang: Rationality Analytics from Trajectories. TKDD 10(1): 10:1-10:22 (2015).

24. Siyuan Liu, Shuhui Wang, Feida Zhu: Structured Learning from Heterogeneous Behavior for Social Identity Linkage. IEEE Trans. Knowl. Data Eng. 27(7): 2005-2019 (2015). (CCF-A类期刊)

25. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian: Similarity Gaussian Process Latent Variable Model for Multi-modal Data Analysis. ICCV 2015: 4050-4058. (CCF-A类会议)

26. Yan Hua, Shuhui Wang, Siyuan Liu, Qingming Huang, Anni Cai: TINA: Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation. ICDM 2014: 190-199.

27. Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, Ramayya Krishnan: HYDRA: large-scale social identity linkage via heterogeneous behavior modeling. SIGMOD Conference 2014: 51-62. (CCF-A类会议)

28. Lingyang Chu, Shuqiang Jiang, Shuhui Wang, Yanyan Zhang, Qingming Huang: Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval. IEEE Trans. Multimedia 15(8): 1982-1996 (2013).

29. Siyuan Liu, Shuhui Wang, Kasthuri Jayarajah, Archan Misra, Ramayya Krishnan: TODMIS: mining communities from trajectories. CIKM 2013: 2109-2118.

30. Li Shen, Shuhui Wang, Gang Sun, Shuqiang Jiang, Qingming Huang: Multi-level Discriminative Dictionary Learning towards Hierarchical Visual Categorization. CVPR 2013: 383-390. (CCF-A类会议)

31. Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian, Lei Qin: Nearest-neighbor method using multiple neighborhood similarities for social media data mining. Neurocomputing 95: 105-116 (2012).

32. Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian: S3MKL: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications. IEEE Trans. Multimedia 14(4): 1259-1274 (2012).

33. Shuhui Wang, Shuqiang Jiang, Qingming Huang, Qi Tian: Multi-feature metric learning with knowledge transfer among semantics and social tagging. CVPR 2012: 2240-2247. (CCF-A类会议)

34. Shuhui Wang, Shuqiang Jiang, Qingming Huang, Qi Tian: S3MKL: scalable semi-supervised multiple kernel learning for image data mining. ACM Multimedia 2010: 163-172. (CCF-A类会议)





视觉信息处理和学习组
  • 单位地址:北京海淀区中关村科学院南路6号
  • 邮编:100190
  • 联系电话:010-62600514
  • Email:yi.cheng@vipl.ict.ac.cn
  • Valse

  • 深度学习大讲堂

版权所有 @ 中科院计算所视觉信息处理与学习组 京ICP备05002829号 京公网安备1101080060