Characterizing Unintended Consequences of GUI Agents For Web Browsing

The integration of LLMs into GUI agents promises to revolutionize web browsing automation, yet the practical user experience remains challenging. This paper systematically characterizes user-reported issues with GUI agents by focusing on three dimensions: phenomena, influences, and user-centric mitigation. We adopted a two-phase method combining social media analysis (N=221 posts) and semi-structured interviews (N=21). Our findings reveal a taxonomy of complaints unique to GUI agents, including deficits in grounding abstract intent into concrete interface affordances, the inability to adapt to dynamic visual states, and the execution of erroneous actions. These lead to influences distinct from text-based hallucinations, ranging from task abandonment to security risks like uncontrolled file system access. In response, users are forced to employ ad-hoc mitigation strategies, including ecological sandboxing, and cursor shadowing to correct GUI agents behaviors. We contribute: (1) a comprehensive characterization of complaints specific to GUI agents interaction, (2) an analysis of how these phenomena degrade interaction integrity, and (3) design implications for creating consequence-aware agents.

Tsinghua University, Beijing, China

Cornell University, Ithaca, New York, United States

The Chinese University of Hong Kong, Shenzhen, Shenzhen, China

Institute of Future Human Habitat, Shenzhen, China

Tsinghua University, Beijing, China

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 119

6 件の発表

開始日時2026-04-17 18:00:00

終了日時2026-04-17 19:30:00

お気に入り

あとで読む

コレクション

要旨

著者

会議: CHI 2026

セッション: Capturing Experience & Generating Meaning