A hunt for the Snark: Annotator Diversity in Data Practices

Diversity in datasets is a key component to building responsible AI/ML. Despite this recognition, we know little about the diversity among the annotators involved in data production. We investigated the approaches to annotator diversity through 16 semi-structured interviews and a survey with 44 AI/ML practitioners. While practitioners described nuanced understandings of annotator diversity, they rarely designed dataset production to account for diversity in the annotation process. The lack of action was explained through operational barriers: from the lack of visibility in the annotator hiring process, to the conceptual difficulty in incorporating worker diversity. We argue that such operational barriers and the widespread resistance to accommodating annotator diversity surface a prevailing logic in data practices---where neutrality, objectivity and 'representationalist thinking' dominate. By understanding this logic to be part of a regime of existence, we explore alternative ways of accounting for annotator subjectivity and diversity in data practices.

Google Research, Bengaluru, India

City, London, United Kingdom

Google , Singapore, Singapore

https://doi.org/10.1145/3544548.3580645

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

Hall C

6 件の発表

開始日時2023-04-26 23:30:00

終了日時2023-04-27 00:55:00

お気に入り

あとで読む

コレクション

要旨

受賞
Honorable Mention

著者

論文URL

動画

会議: CHI 2023

セッション: Critical Fairness

A hunt for the Snark: Annotator Diversity in Data Practices

要旨

受賞Honorable Mention

著者

論文URL

動画

会議: CHI 2023

セッション: Critical Fairness

受賞
Honorable Mention