Designing Ground Truth and the Social Life of Labels

要旨

Ground-truth labeling is an important activity in machine learning. Many studies have examined how crowdworkers apply labels to records in machine learning datasets. However, there have been few studies that have examined the work of domain experts when their knowledge and expertise are needed to apply labels. We provide a grounded account of the work of labeling teams with domain experts, including the experiences of labeling, collaborative configurations and work-practices, and quality issues. We show three major patterns in the social design of ground truth data: Principled design, Iterative design, and Improvisational design. We interpret our results through theories of from Human Centered Data Science, and particularly work on human interventions in data science work through the design and creation of data.

著者
Michael Muller
IBM Research, Cambridge, Massachusetts, United States
Christine T.. Wolf
Independent Consultant, San Jose, California, United States
Josh Andres
IBM Research Australia, Melbourne, Victoria, Australia
Michael Desmond
IBM Research, Yorktown Heights, New York, United States
Narendra Nath Joshi
IBM, Cambridge, Massachusetts, United States
Zahra Ashktorab
IBM Research, Yorktown Heights, New York, United States
Aabhas Sharma
IBM Research, Cambridge, Massachusetts, United States
Kristina Brimijoin
Mrs., Hastings on Hudson, New York, United States
Qian Pan
IBM Research, Cambridge, Massachusetts, United States
Evelyn Duesterwald
IBM Research, Yorktown Heights, New York, United States
Casey Dugan
IBM Research, Cambridge, Massachusetts, United States
DOI

10.1145/3411764.3445402

論文URL

https://doi.org/10.1145/3411764.3445402

動画

会議: CHI 2021

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2021.acm.org/)

セッション: Technology Resistance / HCI and Distinct Populations / Queering Technologies

[B] Paper Room 03, 2021-05-14 01:00:00~2021-05-14 03:00:00 / [C] Paper Room 03, 2021-05-14 09:00:00~2021-05-14 11:00:00 / [A] Paper Room 03, 2021-05-13 17:00:00~2021-05-13 19:00:00
Paper Room 03
12 件の発表
2021-05-14 01:00:00
2021-05-14 03:00:00
日本語まとめ
読み込み中…