The Impact of Response Latency and Task Type on Human-LLM Interaction and Perception

Responsiveness in large language model (LLM) applications is widely assumed to be critical, yet the impact of latency on user behavior and perception of output quality has not been systematically explored. We report a controlled experiment varying time-to-first-token latency (2, 9, 20 seconds) across two taxonomy-driven knowledge task types (Creation and Advice). Log analyses reveal that user interaction behaviors were robust to latency, yet varied by task type: Creation tasks elicited more frequent prompting than Advice tasks. In contrast, participants who experienced 2-second latencies rated the LLM’s outputs less thoughtful and useful than those who experienced 9- or 20-second latencies. Participants attributed delays to AI deliberation, though long waits occasionally shifted this interpretation toward frustration or concerns about reliability. Overall, this work demonstrates that latency is not simply a cost to reduce but a tunable design variable with ethical implications. We offer design strategies for enhancing human-LLM interaction.

New York University, New York, New York, United States

National University of Singapore, Singapore, Singapore

New York University, New York, New York, United States

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 129

7 件の発表

開始日時2026-04-13 20:15:00

終了日時2026-04-13 21:45:00

お気に入り

あとで読む

コレクション

The Impact of Response Latency and Task Type on Human-LLM Interaction and Perception

要旨

著者

会議: CHI 2026

セッション: AI & Timing Matters