Explanations, Fairness, and Appropriate Reliance in Human-AI Decision-Making

要旨

In this work, we study the effects of feature-based explanations on distributive fairness of AI-assisted decisions, specifically focusing on the task of predicting occupations from short textual bios. We also investigate how any effects are mediated by humans' fairness perceptions and their reliance on AI recommendations. Our findings show that explanations influence fairness perceptions, which, in turn, relate to humans' tendency to adhere to AI recommendations. However, we see that such explanations do not enable humans to discern correct and incorrect AI recommendations. Instead, we show that they may affect reliance irrespective of the correctness of AI recommendations. Depending on which features an explanation highlights, this can foster or hinder distributive fairness: when explanations highlight features that are task-irrelevant and evidently associated with the sensitive attribute, this prompts overrides that counter AI recommendations that align with gender stereotypes. Meanwhile, if explanations appear task-relevant, this induces reliance behavior that reinforces stereotype-aligned errors. These results imply that feature-based explanations are not a reliable mechanism to improve distributive fairness.

受賞
Honorable Mention
著者
Jakob Schoeffer
University of Texas at Austin, Austin, Texas, United States
Maria De-Arteaga
The University of Texas at Austin, Austin, Texas, United States
Niklas Kühl
University of Bayreuth, Bayreuth, Germany
論文URL

doi.org/10.1145/3613904.3642621

動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Sensemaking with AI A

324
5 件の発表
2024-05-16 01:00:00
2024-05-16 02:20:00