Interpreting Interpretability: Understanding Data Scientists' Use of Interpretability Tools for Machine Learning

要旨

Machine learning (ML) models are now routinely deployed in domains ranging from criminal justice to healthcare. With this newfound ubiquity, ML has moved beyond academia and grown into an engineering discipline. To that end, interpretability tools have been designed to help data scientists and machine learning practitioners better understand how ML models work. However, there has been little evaluation of the extent to which these tools achieve this goal. We study data scientists' use of two existing interpretability tools, the InterpretML implementation of GAMs and the SHAP Python package. We conduct a contextual inquiry (N=11) and a survey (N=197) of data scientists to observe how they use interpretability tools to uncover common issues that arise when building and evaluating ML models. Our results indicate that data scientists over-trust and misuse interpretability tools. Furthermore, few of our participants were able to accurately describe the visualizations output by these tools. We highlight qualitative themes for data scientists' mental models of interpretability tools. We conclude with implications for researchers and tool designers, and contextualize our findings in the social science literature.

受賞
Honorable Mention
キーワード
Interpretability
Machine learning
User-centric evaluation
著者
Harmanpreet Kaur
University of Michigan, Ann Arbor, MI, USA
Harsha Nori
Microsoft Research, Seattle, WA, USA
Samuel Jenkins
Microsoft Research, Redmond, WA, USA
Rich Caruana
Microsoft Research, Redmond, WA, USA
Hanna Wallach
Microsoft Research, New York City, NY, USA
Jennifer Wortman Vaughan
Microsoft Research, New York, NY, USA
DOI

10.1145/3313831.3376219

論文URL

https://doi.org/10.1145/3313831.3376219

会議: CHI 2020

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2020.acm.org/)

セッション: Coping with AI: not agAIn!

Paper session
316C MAUI
5 件の発表
2020-04-29 18:00:00
2020-04-29 19:15:00
日本語まとめ
読み込み中…