Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology

要旨

Recent advances in AI combine large language models (LLMs) with vision encoders that bring forward unprecedented technical capabilities to leverage for a wide range of healthcare applications. Focusing on the domain of radiology, vision-language models (VLMs) achieve good performance results for tasks such as generating radiology findings based on a patient's medical image, or answering visual questions (e.g., ``Where are the nodules in this chest X-ray?''). However, the clinical utility of potential applications of these capabilities is currently underexplored. We engaged in an iterative, multidisciplinary design process to envision clinically relevant VLM interactions, and co-designed four VLM use concepts: Draft Report Generation, Augmented Report Review, Visual Search and Querying, and Patient Imaging History Highlights. We studied these concepts with 13 radiologists and clinicians who assessed the VLM concepts as valuable, yet articulated many design considerations. Reflecting on our findings, we discuss implications for integrating VLM capabilities in radiology, and for healthcare AI more generally.

著者
Nur Yildirim
Carnegie Mellon University, Pittsburgh, Pennsylvania, United States
Hannah Richardson
Microsoft Health Futures, Cambridge, United Kingdom
Maria Teodora Wetscherek
Cambridge University Hospitals NHS Foundation Trust, Cambridge, United Kingdom
Junaid Bajwa
Microsoft Health Futures, Cambridge, United Kingdom
Joseph Jacob
University College London, London, United Kingdom
Mark Ames. Pinnock
University College London, London, United Kingdom
Stephen Harris
University College London Hospital NHS Foundation Trust, London, United Kingdom
Daniel Coelho de Castro
Microsoft Health Futures, Cambridge, United Kingdom
Shruthi Bannur
Microsoft Health Futures, Cambridge, United Kingdom
Stephanie Hyland
Microsoft Health Futures, Cambridge, United Kingdom
Pratik Ghosh
Microsoft Health Futures, Cambridge, United Kingdom
Mercy Ranjit
Microsoft Health Futures, Bengaluru, India
Kenza Bouzid
Microsoft Health Futures, Cambridge, United Kingdom
Anton Schwaighofer
Microsoft Health Futures, Cambridge, United Kingdom
Fernando Pérez-García
Microsoft Health Futures, Cambridge, United Kingdom
Harshita Sharma
Microsoft Health Futures, Cambridge, United Kingdom
Ozan Oktay
Microsoft Health Futures, Cambridge, United Kingdom
Matthew Lungren
Microsoft Nuance, Palo Alto, California, United States
Javier Alvarez-Valle
Microsoft Health Futures, Cambridge, United Kingdom
Aditya Nori
Microsoft Health Futures, Cambridge, United Kingdom
Anja Thieme
Microsoft Health Futures, Cambridge, United Kingdom
論文URL

doi.org/10.1145/3613904.3642013

動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Health and AI B

315
5 件の発表
2024-05-16 01:00:00
2024-05-16 02:20:00