Agentic Audio Moderators vs Humans in Think-Aloud Usability Testing

Agentic AI holds promise for usability testing, yet its role as an audio moderator in think-aloud protocols is not well understood. This study explores: (1) how to design and develop an agentic audio moderator for think-aloud usability testing, and (2) how participants moderated by an agentic moderator differ from those moderated by a human regarding task performance, verbalization behaviors, user experience, and social perceptions of the moderator. Using a design-based research approach, we interviewed nine UX experts, iteratively developed an AI moderator, and evaluated it in a randomized controlled trial (N=60) with a note-taking application. Results suggest that significant differences were not observed between AI and human moderators in task performance or verbalization behaviors, though AI moderators received lower social perception ratings. This work contributes the first design-oriented evaluation of AI moderators in usability testing, offering implications for developing more acceptable and effective agentic audio moderators.

The Hong Kong Polytechnic University, Kowloon, Hong Kong

The Hong Kong Polytechnic University, Hong Kong, Hong Kong

The Hong Kong Polytechnic University, Hung Hom, Hong Kong, China

Southern University of Science and Technology, Shenzhen, China

Piipivo Technology, Hangzhou, China

The Hong Kong Polytechnic University , Hong Kong, Hong Kong

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 130

6 件の発表

開始日時2026-04-15 18:00:00

終了日時2026-04-15 19:30:00

お気に入り