Expert Discussions Improve Comprehension of Difficult Cases in Medical Image Assessment

要旨

Medical data labeling workflows critically depend on accurate assessments from human experts. Yet human assessments can vary markedly, even among medical experts. Prior research has demonstrated benefits of labeler training on performance. Here we utilized two types of labeler training feedback: highlighting incorrect labels for difficult cases ("individual performance" feedback), and expert discussions from adjudication of these cases. We presented ten generalist eye care professionals with either individual performance alone, or individual performance and expert discussions from specialists. Compared to performance feedback alone, seeing expert discussions significantly improved generalists' understanding of the rationale behind the correct diagnosis while motivating changes in their own labeling approach; and also significantly improved average accuracy on one of four pathologies in a held-out test set. This work suggests that image adjudication may provide benefits beyond developing trusted consensus labels, and that exposure to specialist discussions can be an effective training intervention for medical diagnosis.

キーワード
Medical Images
Diagnosis
Adjudication
Labeler Training
著者
Mike Schaekermann
Google & University of Waterloo, Mountain View, CA, USA
Carrie J. Cai
Google, Mountain View, CA, USA
Abigail E. Huang
Google, Mountain View, CA, USA
Rory Sayres
Google, Mountain View, CA, USA
DOI

10.1145/3313831.3376290

論文URL

https://doi.org/10.1145/3313831.3376290

会議: CHI 2020

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2020.acm.org/)

セッション: Crowdsourcing & the value of discussion

Paper session
316C MAUI
5 件の発表
2020-04-29 23:00:00
2020-04-30 00:15:00
日本語まとめ
読み込み中…