Visible Nuances: A Caption System to Visualize Paralinguistic Speech Cues for Deaf and Hard-of-Hearing Individuals

Captions help deaf and hard-of-hearing (DHH) individuals visually communicate voice information to better understand video content. In speech, the literal content and paralinguistic cues (e.g., pitch and nuance) work together to create real intention. However, current captions are limited in their capacity to deliver fine nuances because they cannot fully convey these paralinguistic cues. This paper proposes an audio-visualized caption system that automatically visualizes paralinguistic cues into various caption elements (thickness, height, font type and motion). A comparative study with 20 DHH participants demonstrates how our system supports DHH individuals to be better accessible to paralinguistic cues while watching videos. Particularly in the case of formal talks, they could accurately identify the speaker’s nuance more often compared to current captions, without any practice or training. Addressing some issues on legibility and familiarity, the proposed caption system has potentials to enrich DHH individuals’ video watching experience more as hearing people enjoy.

Gwangju Institute of Science and Technology, Gwangju, Korea, Republic of

https://doi.org/10.1145/3544548.3581130

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

Room Y03+Y04

6 件の発表

開始日時2023-04-27 18:00:00

終了日時2023-04-27 19:30:00

お気に入り

あとで読む

コレクション

Visible Nuances: A Caption System to Visualize Paralinguistic Speech Cues for Deaf and Hard-of-Hearing Individuals

要旨

著者

論文URL

動画

会議: CHI 2023

セッション: Accessible Interaction Techniques B