Opportunities and Challenges of Automatic Speech Recognition Systems for Low-Resource Language Speakers

要旨

Automatic Speech Recognition (ASR) researchers are turning their attention towards supporting low-resource languages, such as isiXhosa or Marathi, with only limited training resources. We report and reflect on collaborative research across ASR & HCI to situate ASR-enabled technologies to suit the needs and functions of two communities of low-resource language speakers, on the outskirts of Cape Town, South Africa and in Mumbai, India. We build on longstanding community partnerships and draw on linguistics, media studies and HCI scholarship to guide our research. We demonstrate diverse design methods to: remotely engage participants; collect speech data to test ASR models; and ultimately field-test models with users. Reflecting on the research, we identify opportunities, challenges, and use-cases of ASR, in particular to support pervasive use of WhatsApp voice messaging. Finally, we uncover implications for collaborations across ASR & HCI that advance important discussions at CHI surrounding data, ethics, and AI.

著者
Thomas Reitmaier
Swansea University, Swansea, United Kingdom
Electra Wallington
University of Edinburgh, Edinburgh, United Kingdom
Dani Kalarikalayil Raju
Studio Hasi, Mumbai, India
Ondrej Klejch
University of Edinburgh, Edinburgh, United Kingdom
Jennifer Pearson
Swansea University, Swansea, Wales, United Kingdom
Matt Jones
Swansea University, Swansea, United Kingdom
Peter Bell
University of Edinburgh , Edinburgh , United Kingdom
Simon Robinson
Swansea University, Swansea, United Kingdom
論文URL

https://dl.acm.org/doi/abs/10.1145/3491102.3517639

動画

会議: CHI 2022

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2022.acm.org/)

セッション: Technology for Developing Regions and Underserved Populations

394
4 件の発表
2022-05-04 18:00:00
2022-05-04 19:15:00