Cultivating Spoken Language Technologies for Unwritten Languages

We report on community-centered, collaborative research that weaves together HCI, natural language processing, linguistic, and design insights to develop spoken language technologies for unwritten languages. Across three visits to a Banjara farming community in India, we use participatory, technical, and creative methods to engage community members, collect spoken language photo annotations, and develop an information retrieval (IR) system. Drawing on orality theory, we interrogate assumptions and biases of current speech interfaces and create a simple application that leverages our IR system to match fluidly spoken queries with recorded annotations and surface corresponding photos. In-situ evaluations show how our novel approach returns reliable results and inspired the co-creation of media retrieval use-cases that are more appropriate in oral contexts. The very low (< 4h) spoken data requirements makes our approach adaptable to other contexts where languages are unwritten or have no digital language resources available.

Swansea University, Swansea, United Kingdom

Studio Hasi, Mumbai, India

University of Edinburgh, Edinburgh, United Kingdom

University of Essex, Essex, United Kingdom

Swansea University, Swansea, Wales, United Kingdom

Swansea University, Swansea, United Kingdom

University of Edinburgh , Edinburgh , United Kingdom

Swansea University, Swansea, United Kingdom

https://doi.org/10.1145/3613904.3642026

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

319

4 件の発表

開始日時2024-05-16 01:00:00

終了日時2024-05-16 02:20:00

お気に入り

あとで読む

コレクション

要旨

受賞
Honorable Mention

著者

論文URL

動画

会議: CHI 2024

セッション: Indigeonus Communities and Cutural Heritage A

Cultivating Spoken Language Technologies for Unwritten Languages

要旨

受賞Honorable Mention

著者

論文URL

動画

会議: CHI 2024

セッション: Indigeonus Communities and Cutural Heritage A

受賞
Honorable Mention