Editable XAI: Toward Bidirectional Human-AI Alignment with Co-Editable Explanations of Interpretable Attributes

While Explainable AI (XAI) helps users understand AI decisions, misalignment in domain knowledge can lead to disagreement. This inconsistency hinders understanding, and because explanations are often read-only, users lack the control to improve alignment. We propose making XAI editable, allowing users to write rules to improve control and gain deeper understanding through the generation effect of active learning. We developed CoExplain, leveraging a neural network for universal representation and symbolic rules for intuitive reasoning on interpretable attributes. CoExplain explains the neural network with a faithful proxy decision tree, parses user-written rules as an equivalent neural network graph, and collaboratively optimizes the decision tree. In a user study (N=43), CoExplain and manually editable XAI improved user understanding and model alignment compared to read-only XAI. CoExplain was easier to use with fewer edits and less time. This work contributes Editable XAI for bidirectional AI alignment, improving understanding and control.

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 130

7 件の発表

開始日時2026-04-14 18:00:00

終了日時2026-04-14 19:30:00

お気に入り

あとで読む

コレクション

Editable XAI: Toward Bidirectional Human-AI Alignment with Co-Editable Explanations of Interpretable Attributes

要旨

著者

動画

会議: CHI 2026

セッション: Personalization and Human-AI Alignment