Evaluating the Interpretability of Generative Models by Interactive Reconstruction

For machine learning models to be most useful in numerous sociotechnical systems, many have argued that they must be human-interpretable. However, despite increasing interest in interpretability, there remains no firm consensus on how to measure it. This is especially true in representation learning, where interpretability research has focused on "disentanglement" measures only applicable to synthetic datasets and not grounded in human factors. We introduce a task to quantify the human-interpretability of generative model representations, where users interactively modify representations to reconstruct target instances. On synthetic datasets, we find performance on this task much more reliably differentiates entangled and disentangled models than baseline approaches. On a real dataset, we find it differentiates between representation learning methods widely believed but never shown to produce more or less interpretable models. In both cases, we ran small-scale think-aloud studies and large-scale experiments on Amazon Mechanical Turk to confirm that our qualitative and quantitative results agreed.

Harvard University, Cambridge, Massachusetts, United States

Harvard College, Cambridge, Massachusetts, United States

Harvard University, Cambridge, Massachusetts, United States

10.1145/3411764.3445296

https://doi.org/10.1145/3411764.3445296

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2021.acm.org/)

[B] Paper Room 02, 2021-05-14 01:00:00~2021-05-14 03:00:00 / [C] Paper Room 02, 2021-05-14 09:00:00~2021-05-14 11:00:00 / [A] Paper Room 02, 2021-05-13 17:00:00~2021-05-13 19:00:00

Paper Room 02

12 件の発表

開始日時2021-05-14 01:00:00

終了日時2021-05-14 03:00:00

読み込み中…

お気に入り

あとで読む

コレクション

要旨

受賞
Honorable Mention

著者

DOI

論文URL

動画

会議: CHI 2021

セッション: Computational AI Development and Explanation

日本語まとめ