Transformer Explainer: Learning LLM Transformers with Interactive Visual Explanation and Experimentation

要旨

The Transformer architecture underpins modern large language models powering state-of-the-art text generation and AI applications. However, its complexity makes it difficult for non-experts to learn. Existing resources often lack interactivity, rely on static descriptions of simplified architectures, or fail to reflect models’ behavior with real data. To address this gap, we introduce Transformer Explainer, an interactive visualization tool for non-experts to learn Transformers. The tool integrates an overview illustrating the Transformer's data flow with on-demand explanations that gradually reveal mathematical details. Smooth transitions across abstraction levels highlight the interplay between high-level structures and low-level operations. Running a live GPT-2 instance directly in the browser, Transformer Explainer empowers learners to experiment with custom input and hyperparameters without setup, observing next-token predictions in real time. A 90-participant user study showed that our tool offered significant advantages in improving user understanding and engagement. Transformer Explainer has attracted over 490,000 users.

著者
Aeree Cho
Georgia Institute of Technology, Atlanta, Georgia, United States
Grace C.. Kim
Georgia Institute of Technology, Atlanta, Georgia, United States
Alexander Karpekov
Georgia Tech, Atlanta, Georgia, United States
Seongmin Lee
Georgia Tech, Atlanta, Georgia, United States
Alec Helbling
Georgia Institute of Technology, Atlanta, Georgia, United States
Benjamin Hoover
IBM Research AI, Cambridge, Massachusetts, United States
Zijie J.. Wang
Georgia Tech, Atlanta, Georgia, United States
Minsuk Kahng
Yonsei University, Seoul, Korea, Republic of
Duen Horng (Polo) Chau
Georgia Tech, Atlanta, Georgia, United States

会議: CHI 2026

ACM CHI Conference on Human Factors in Computing Systems

セッション: AI & Data Visualization

M2 - Room M211/212
6 件の発表
2026-04-15 18:00:00
2026-04-15 19:30:00