中科院计算所视觉信息处理与学习组
中科院计算所视觉信息处理与学习组
王树徽 (Shuhui Wang)

职 称:副研究员 (Associate Professor)

电子邮箱:wangshuhui@ict.ac.cn and shuhui.wang@vipl.ict.ac.cn

通讯地址:北京市海淀区科学院南路6号(Tel: 010-62600573)

研究方向:视觉及媒体内容理解与交互、机器学习、数据挖掘

王树徽,2006年于清华大学获得工学学士学位,2012年7月于中国科学院计算技术研究所获得工学博士学位,2014年10月从中国科学院计算所博士后出站。从事多媒体内容分析检索技术、大数据理论与方法、机器学习等方面的研究,已在诸如IEEE/ACM顶级汇刊TKDE、TMM、TIP、TCSVT、TKDD、TIST,以及ACM Multimedia、ICCV、CVPR、ECCV、SIGMOD、VLDB、ICDM,CIKM等多媒体分析,计算机视觉、数据库和数据挖掘领域的顶级期刊和会议上发表和录用学术论文70多篇,授权国家专利3项。担任ICME、PCM、ICIMCS等国际会议的会议组织工作,并担任TIP、TKDE、TMM、TCSVT、TOMM、ACMMM、NIPS、CVPR、ICCV、AAAI、IJCAI等高水平国际期刊和会议的审稿人。


欢迎对图像视频理解,图文检索与内容转换生成,跨媒体分析推理,跨媒体知识工程等前沿研究有强烈兴趣和相关研究背景的同学报考研究生!

学术服务
刊物服务
  • Reviewer of Information Science (Elsevier) and Pattern Recognition (Elsevier).
  • Reviewer of ACM-TKDD and ACM-TOMCCAP
  • Reviewer of IEEE-TIP, IEEE-TKDE, IEEE-TMM, IEEE-TCSVT, IEEE-TCYB, IEEE-TBD and IEEE Access
会议服务
  • Area Chair of ACM Multimedia 2019.
  • TPC member of ICCV'19, CVPR'19, AAAI'19, IJCAI'18, ACCV'18, PCM'18, PRCV'18, ChinaMM'18.
  • Publication Chair of PCM 2017.
  • PC Co-chair of the MASS workshop, with APWEB-WAIM 2017, Jul. 7, 2017.
  • PC Co-Chair, 1st International Workshop on Mobility Analytics for Spatio-temporal and Social Data (MATES),VLDB'17, Sept. 1, 2017.
  • Publication Chair, ACM International Conference on Internet Multimedia Computing and Service (ICIMCS'15), 17th-21th, Aug, 2015, Zhangjiajie, Hunan.
经历
教育经历
  • 2006.9 ~ 2012.7 中国科学院计算技术研究所 计算机应用技术 工学博士(免试)
  • 2002.9 ~ 2006.7 清华大学 电子信息工程 工学学士
学术经历
  • 2016~今, 面向开放环境的多模态主动感知与理解技术研究
  • 2011~今,视觉-语言/跨媒体关联学习技术研究
  • 2009~2013:基于多特征融合学习的图像分类技术研究
  • 2006~2009: 海量图像视频理解与检索技术研究
  • 2006: 基于降维学习的人脸分类技术研究
研究内容

1.   多源异构媒体大数据分析

包括:多媒体内容推荐,网络有噪声数据的机器学习方法、网络社群分析。

2.   多模态主动感知与表达

包括:感知增强,知识、记忆和语言驱动的类人多模态主动感知技术。

3.   视觉与跨媒体理解

包括:领域适应迁移学习,图像视频物体、场景、事件分类,多模协同的媒体主题语义分析等,跨媒体知识图谱构建与学习。

4.   多模态推理及交互

跨模态表征学习,知识推理与知识泛化,觉-语言检索,视觉内容概述与内容生成,多模态问答及对话等。

科研项目

1.   异构大数据的对象建模及跨域分析技术研究

项目类型:国家自然科学基金面上项目
项目时间:2017.01-2020.12
项目负责人:王树徽

2.   跨媒体学习与内容理解

项目类型:973计划课题
项目时间:2012.1.1~2016.12.31
项目负责人:黄庆明

3.   基于多源信息融合和网络社群行为建模的跨媒体分析技术研究

项目类型:国家自然科学基金青年基金项目
项目时间:2014.01-2016.12
项目负责人:王树徽
著论

著作

1.   Siyuan Liu , Shuhui Wang, Qiang Qu. Trajectory Mining. Book chapter of Encyclopedia of GIS, Springer, ISBN: 978-3-319-23519-6 (Online), 2017. 【pdf】

论文

Major Publication (Please see DBLP for full list):


1.  Yiling Wu, Shuhui Wang, Qingming Huang. Online Fast Adaptive Low-rank Similarity Learning for Cross-Modal Retrieval. IEEE Transactions on Multimedia (TMM), accepted.

2. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian. Harmonized Multimodal Learning with Gaussian Process Latent Variable Models. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), accepted. Paper

3. Xuejing Liu, Liang Li, Shuhui Wang, Zheng-Jun Zha, Dechao Meng, Qingming Huang. Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding. ICCV, 2019. Code

4. Yiling Wu, Shuhui Wang, Guoli Song, Qingming Huang. Learning Fragment Self-Attention Embeddings for Image-Text Matching . ACM Multimedia, 2019. (oral)  Code

5. Xuejing Liu, Liang Li, Shuhui Wang, Zhengjun Zha, Li Su, Qingming Huang. Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding. ACM Multimedia, 2019. (oral)

6. Shijie Yang, Liang Li, Shuhui Wang, Dechao Meng, Qingming Huang and Qi Tian. Structured Stochastic Recurrent Network for Linguistic Video Prediction. ACM Multimedia 2019. (oral)

7. Shuhui Wang, Liang Li, Chenxue Yang, Qingming Huang. Regularized Topic-aware Latent Influence Propagation in Dynamic Relational Networks. GeoInformatica, 23(3)329-352, 2019. Paper

8. Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang, Qi Tian. SkeletonNet: A Hybrid Network with a Skeleton-Embedding Process for Multi-view Image Representation Learning. IEEE Transactions on Multimedia, accepted.

9. Yiling Wu, Shuhui Wang, Guoli Song, Qingming Huang. Online Asymmetric Metric Learning with Multi-Layer Similarity Aggregation for Cross-Modal Retrieval. IEEE Transaction on Image Processing, vol. 28, no. 9, pp. 4299-4312, 2019. Code

10. Junbao Zhuo, Shuhui Wang , Shuhao Cui, Qingming Huang. Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization. In CVPR, 2019. PaperCode

11. Zhe Xue, Guorong Li, Shuhui Wang, Weigang Zhang, Qingming Huang. Bilevel Multiview Latent Space Learning. IEEE Trans. Circuits Syst. Video Techn. 28(2): 327-341, 2018.

12. Yangyu Chen, Shuhui Wang, Weigang Zhang, Qingming Huang. Less is More: Picking Informative Frames for Video Captioning. ECCV, 2018. Code

13. Shuhui Wang, Yangyu Chen, Junbao Zhuo, Qingming Huang, Qi Tian. Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval. ACM Multimedia, 2018. (Oral, Acceptance rate: 8%)

14. Yiling Wu, Shuhui Wang, Qingming Huang. Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval. ACM Multimedia, 2018. 

15. Liang Li, Shuhui Wang, Shuqiang Jiang, Qingming Huang. Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification. ACM Multimedia, 2018. 

16. Guoli Song, Shuhui Wang , Qingming Huang, Qi Tian: Multimodal Similarity Gaussian Process Latent Variable Model. IEEE Trans. Image Processing 26(9): 4168-4181 (2017). Code

17. Jiaming Zhang, Shuhui Wang, Qingming Huang: Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval. ACM TIST 8(3): 38:1-38:21 (2017).

18. Siyuan Liu, Shuhui Wang: Trajectory Community Discovery and Recommendation by Multi-Source Diffusion Modeling. IEEE Trans. Knowl. Data Eng. 29(4): 898-911 (2017). 

19. Yiling Wu, Shuhui Wang, Qingming Huang: Online Asymmetric Similarity Learning for Cross-Modal Retrieval. CVPR 2017: 3984-3993. 

20. Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang: A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning. CVPR 2017: 7053-7061. 

21. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian: Multimodal Gaussian Process Latent Variable Models with Harmonization. ICCV 2017: 5039-5047. Code

22. Junbao Zhuo, Shuhui Wang, Weigang Zhang, Qingming Huang: Deep Unsupervised Convolutional Domain Adaptation. ACM Multimedia 2017: 261-269. 

23. Weiqing Min, Shuqiang Jiang, Shuhui Wang, Jitao Sang, Shuhuan Mei: A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Attributes. ACM Multimedia 2017: 402-410. 

24. Lingyang Chu, Yanyan Zhang, Guorong Li, Shuhui Wang, Weigang Zhang, Qingming Huang: Effective Multimodality Fusion Framework for Cross-Media Topic Detection. IEEE Trans. Circuits Syst. Video Techn. 26(3): 556-569 (2016).

25. Yan Hua, Shuhui Wang, Siyuan Liu, Anni Cai, Qingming Huang: Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation. IEEE Trans. Multimedia 18(6): 1201-1216 (2016).

26. Lingyang Chu, Shuhui Wang, Siyuan Liu, Qingming Huang, Jian Pei: ALID: Scalable Dominant Cluster Detection. PVLDB 8(8): 826-837 (2015). 

27. Li Shen, Gang Sun, Qingming Huang, Shuhui Wang, Zhouchen Lin, Enhua Wu: Multi-Level Discriminative Dictionary Learning With Application to Large Scale Image Classification. IEEE Trans. Image Processing 24(10): 3109-3123 (2015). 

28. Siyuan Liu, Qiang Qu, Shuhui Wang: Rationality Analytics from Trajectories. TKDD 10(1): 10:1-10:22 (2015).

29. Siyuan Liu, Shuhui Wang, Feida Zhu: Structured Learning from Heterogeneous Behavior for Social Identity Linkage. IEEE Trans. Knowl. Data Eng. 27(7): 2005-2019 (2015). 

30. Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian: Similarity Gaussian Process Latent Variable Model for Multi-modal Data Analysis. ICCV 2015: 4050-4058. 

31. Yan Hua, Shuhui Wang, Siyuan Liu, Qingming Huang, Anni Cai: TINA: Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation. ICDM 2014: 190-199.

32. Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, Ramayya Krishnan: HYDRA: large-scale social identity linkage via heterogeneous behavior modeling. SIGMOD Conference 2014: 51-62. 

33. Lingyang Chu, Shuqiang Jiang, Shuhui Wang, Yanyan Zhang, Qingming Huang: Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval. IEEE Trans. Multimedia 15(8): 1982-1996 (2013).

34. Siyuan Liu, Shuhui Wang, Kasthuri Jayarajah, Archan Misra, Ramayya Krishnan: TODMIS: mining communities from trajectories. CIKM 2013: 2109-2118.

35. Li Shen, Shuhui Wang, Gang Sun, Shuqiang Jiang, Qingming Huang: Multi-level Discriminative Dictionary Learning towards Hierarchical Visual Categorization. CVPR 2013: 383-390. 

36. Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian, Lei Qin: Nearest-neighbor method using multiple neighborhood similarities for social media data mining. Neurocomputing 95: 105-116 (2012).

37. Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian: S3MKL: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications. IEEE Trans. Multimedia 14(4): 1259-1274 (2012).

38. Shuhui Wang, Shuqiang Jiang, Qingming Huang, Qi Tian: Multi-feature metric learning with knowledge transfer among semantics and social tagging. CVPR 2012: 2240-2247. 

39. Shuhui Wang, Shuqiang Jiang, Qingming Huang, Qi Tian: S3MKL: scalable semi-supervised multiple kernel learning for image data mining. ACM Multimedia 2010: 163-172. 





视觉信息处理和学习组
  • 单位地址:北京海淀区中关村科学院南路6号
  • 邮编:100190
  • 联系电话:010-62600514
  • Email:yi.cheng@vipl.ict.ac.cn
  • Valse

  • 深度学习大讲堂

版权所有 @ 中科院计算所视觉信息处理与学习组 京ICP备05002829号 京公网安备1101080060