Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers

要旨

In this paper we develop approaches to automatic speech recognition (ASR) development that suit the needs and functions of under-heard language speakers. Our novel contribution to HCI is to show how community-engagement can surface key technical and social issues and opportunities for more effective speech-based systems. We introduce a bespoke toolkit of technologies and showcase how we utilised the toolkit to engage communities of under-heard language speakers; and, through that engagement process, situate key aspects of ASR development in community contexts. The toolkit consists of (1) an information appliance to facilitate spoken-data collection on topics of community interest, (2) a mobile app to create crowdsourced transcripts of collected data, and (3) demonstrator systems to showcase ASR capabilities and to feed back research results to community members. Drawing on the sensibilities we cultivated through this research, we present a series of challenges to the orthodoxy of state-of-the-art approaches to ASR development.

著者
Thomas Reitmaier
Swansea University, Swansea, United Kingdom
Electra Wallington
University of Edinburgh, Edinburgh, United Kingdom
Ondrej Klejch
University of Edinburgh, Edinburgh, United Kingdom
Nina Markl
University of Edinburgh , Edinburgh , United Kingdom
Léa-Marie Lam-Yee-Mui
Université Paris-Saclay, CNRS, Laboratoire Interdisciplinaire des Sciences du Numérique, Orsay, France
Jennifer Pearson
Swansea University, Swansea, Wales, United Kingdom
Matt Jones
Swansea University, Swansea, United Kingdom
Peter Bell
University of Edinburgh , Edinburgh , United Kingdom
Simon Robinson
Swansea University, Swansea, United Kingdom
論文URL

https://doi.org/10.1145/3544548.3581385

動画

会議: CHI 2023

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

セッション: Interaction modalities

Room Y01+Y02
6 件の発表
2023-04-25 23:30:00
2023-04-26 00:55:00