Cultivating Spoken Language Technologies for Unwritten Languages

要旨

We report on community-centered, collaborative research that weaves together HCI, natural language processing, linguistic, and design insights to develop spoken language technologies for unwritten languages. Across three visits to a Banjara farming community in India, we use participatory, technical, and creative methods to engage community members, collect spoken language photo annotations, and develop an information retrieval (IR) system. Drawing on orality theory, we interrogate assumptions and biases of current speech interfaces and create a simple application that leverages our IR system to match fluidly spoken queries with recorded annotations and surface corresponding photos. In-situ evaluations show how our novel approach returns reliable results and inspired the co-creation of media retrieval use-cases that are more appropriate in oral contexts. The very low (< 4h) spoken data requirements makes our approach adaptable to other contexts where languages are unwritten or have no digital language resources available.

受賞
Honorable Mention
著者
Thomas Reitmaier
Swansea University, Swansea, United Kingdom
Dani Kalarikalayil Raju
Studio Hasi, Mumbai, India
Ondrej Klejch
University of Edinburgh, Edinburgh, United Kingdom
Electra Wallington
University of Edinburgh, Edinburgh, United Kingdom
Nina Markl
University of Essex, Essex, United Kingdom
Jennifer Pearson
Swansea University, Swansea, Wales, United Kingdom
Matt Jones
Swansea University, Swansea, United Kingdom
Peter Bell
University of Edinburgh , Edinburgh , United Kingdom
Simon Robinson
Swansea University, Swansea, United Kingdom
論文URL

https://doi.org/10.1145/3613904.3642026

動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Indigeonus Communities and Cutural Heritage A

319
4 件の発表
2024-05-16 01:00:00
2024-05-16 02:20:00