The Impact of Response Latency and Task Type on Human-LLM Interaction and Perception

要旨

Responsiveness in large language model (LLM) applications is widely assumed to be critical, yet the impact of latency on user behavior and perception of output quality has not been systematically explored. We report a controlled experiment varying time-to-first-token latency (2, 9, 20 seconds) across two taxonomy-driven knowledge task types (Creation and Advice). Log analyses reveal that user interaction behaviors were robust to latency, yet varied by task type: Creation tasks elicited more frequent prompting than Advice tasks. In contrast, participants who experienced 2-second latencies rated the LLM’s outputs less thoughtful and useful than those who experienced 9- or 20-second latencies. Participants attributed delays to AI deliberation, though long waits occasionally shifted this interpretation toward frustration or concerns about reliability. Overall, this work demonstrates that latency is not simply a cost to reduce but a tunable design variable with ethical implications. We offer design strategies for enhancing human-LLM interaction.

著者
Felicia Fang-Yi Tan
New York University, New York, New York, United States
Moritz Alexander. Messerschmidt
National University of Singapore, Singapore, Singapore
Wen Yin
New York University, New York, New York, United States
Oded Nov
New York University, New York, New York, United States

会議: CHI 2026

ACM CHI Conference on Human Factors in Computing Systems

セッション: AI & Timing Matters

P1 - Room 129
7 件の発表
2026-04-13 20:15:00
2026-04-13 21:45:00