Visual Information Processing and Learning
Visual Information Processing and Learning

Lip Reading

Leader:Shuang Yang / Shiguang Shan (Professor)

Email:shuang.yang [at] ict dot ac dot cn; sgshan [at] ict dot ac dot cn

* The Lip-Reading (LR) Group has been founded since 2017, which mainly focuses on Lip Reading related tasks, including but not limited to Visual Speech Recognition, Visual Voice Activity Detection, Visual Key-Word Spotting, Talking Face Generation and so on.

* News:

2018.10: The LR Group has released the large-scale naturally distributed lip reading dataset LRW-1000. This dataset is not only the currently largest word-level lip reading dataset, but also the only one public Mandarin lip reading dataset. For more details, please refer to the data pape.

2018.4~2018.10: The LR Group has been invited by CCTV-1 to show the lip reading technology and system to the whole television audiences. For more details, please click here.

* Research Topics:

※ Visual Speech Recognition

※ Visual Key-Word Spotting

※ Talking Face Generation

※ Visual Voice Activity Detection

※ Audio-Visual Related Tasks

* Potential Applications:

※ Lip Password, Spoofing Detection, Command Recognition, Evaluation of Pronunciation, etc.


Journal Papers

Conference Papers

1.    Shuang Yang, Yuanhang Zhang, Dalu Feng, Mingmin Yang, Chenhao Wang, Jingyun Xiao, Keyu Long, Shiguang Shan, Xilin Chen, "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild," IEEE FG 2019 (Oral)

Visual Information Processing and Learning
  • Address :No.6 Kexueyuan South Road
  • Zhongguancun,Haidian District
  • Beijing,China
  • Postcode :100190
  • Tel : (8610)62600514
  • Valse

  • Big Lecture of DL

Copyright @ Visual Information Processing and Learning 京ICP备05002829号 京公网安备1101080060