LipType: A Silent Speech Recognizer Augmented with an Independent Repair Model

Speech recognition is unreliable in noisy places, compromises privacy and security when around strangers, and inaccessible to people with speech disorders. Lip reading can mitigate many of these challenges but the existing silent speech recognizers for lip reading are error prone. Developing new recognizers and acquiring new datasets is impractical for many since it requires enormous amount of time, effort, and other resources. To address these, first, we develop LipType, an optimized version of LipNet for improved speed and accuracy. We then develop an independent repair model that processes video input for poor lighting conditions, when applicable, and corrects potential errors in output for increased accuracy. We tested this model with both LipType and other speech and silent speech recognizers to demonstrate its effectiveness.

University of California, Merced, Merced, California, United States

10.1145/3411764.3445565

https://doi.org/10.1145/3411764.3445565

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2021.acm.org/)

[A] Paper Room 10, 2021-05-12 17:00:00~2021-05-12 19:00:00 / [C] Paper Room 10, 2021-05-13 09:00:00~2021-05-13 11:00:00 / [B] Paper Room 10, 2021-05-13 01:00:00~2021-05-13 03:00:00

Paper Room 10

13 件の発表

開始日時2021-05-12 17:00:00

終了日時2021-05-12 19:00:00

読み込み中…

お気に入り

あとで読む

コレクション

LipType: A Silent Speech Recognizer Augmented with an Independent Repair Model

要旨

著者

DOI

論文URL

動画

会議: CHI 2021

セッション: Vision and Sensing

日本語まとめ