Harnessing Biomedical Literature to Calibrate Clinicians' Trust in AI Decision Support Systems

Clinical decision support tools (DSTs), powered by Artificial Intelligence (AI), promise to improve clinicians' diagnostic and treatment decision-making. However, no AI model is always correct. DSTs must enable clinicians to validate each AI suggestion, convincing them to take the correct suggestions while rejecting its errors. While prior work often tried to do so by explaining AI's inner workings or performance, we chose a different approach: We investigated how clinicians validated each other's suggestions in practice (often by referencing scientific literature) and designed a new DST that embraces these naturalistic interactions. This design uses GPT-3 to draw literature evidence that shows the AI suggestions' robustness and applicability (or the lack thereof). A prototyping study with clinicians from three disease areas proved this approach promising. Clinicians' interactions with the prototype also revealed new design and research opportunities around (1) harnessing the complementary strengths of literature-based and predictive decision supports; (2) mitigating risks of de-skilling clinicians; and (3) offering low-data decision support with literature.

Cornell University, Ithaca, New York, United States

University of California, San Diego, San Diego, California, United States

Cornell University, Ithaca, New York, United States

Cornell Tech, New York, New York, United States

Weill Cornell Medicine, New York, New York, United States

https://doi.org/10.1145/3544548.3581393

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

Room Y01+Y02

6 件の発表

開始日時2023-04-25 18:00:00

終了日時2023-04-25 19:30:00

お気に入り