LipType: A Silent Speech Recognizer Augmented with an Independent Repair Model

要旨

Speech recognition is unreliable in noisy places, compromises privacy and security when around strangers, and inaccessible to people with speech disorders. Lip reading can mitigate many of these challenges but the existing silent speech recognizers for lip reading are error prone. Developing new recognizers and acquiring new datasets is impractical for many since it requires enormous amount of time, effort, and other resources. To address these, first, we develop LipType, an optimized version of LipNet for improved speed and accuracy. We then develop an independent repair model that processes video input for poor lighting conditions, when applicable, and corrects potential errors in output for increased accuracy. We tested this model with both LipType and other speech and silent speech recognizers to demonstrate its effectiveness.

著者
Laxmi Pandey
University of California, Merced, Merced, California, United States
Ahmed Sabbir. Arif
University of California, Merced, Merced, California, United States
DOI

10.1145/3411764.3445565

論文URL

https://doi.org/10.1145/3411764.3445565

動画

会議: CHI 2021

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2021.acm.org/)

セッション: Vision and Sensing

[A] Paper Room 10, 2021-05-12 17:00:00~2021-05-12 19:00:00 / [C] Paper Room 10, 2021-05-13 09:00:00~2021-05-13 11:00:00 / [B] Paper Room 10, 2021-05-13 01:00:00~2021-05-13 03:00:00
Paper Room 10
13 件の発表
2021-05-12 17:00:00
2021-05-12 19:00:00
日本語まとめ
読み込み中…