"Rewind to the Jiggling Meat Part": Understanding Voice Control of Instructional Videos in Everyday Tasks


Voice interaction has long been envisioned as enabling users to transform physical interaction into hands-free, such as allowing fine-grained control of instructional videos without physically disengaging from the task at hand. While significant engineering advances have brought us closer to this ideal, we do not fully understand the user requirements for voice interactions that should be supported in such contexts. This paper presents an ecologically-valid wizard-of-oz elicitation study exploring realistic user requirements for an ideal instructional video playback control while cooking. Through the analysis of the issued commands and performed actions during this non-linear and complex task, we identify (1) patterns of command formulation, (2) challenges for design, and (3) how task and voice-based commands are interwoven in real-life. We discuss implications for the design and research of voice interactions for navigating instructional videos while performing complex tasks.

Yaxi Zhao
University of Toronto, Toronto, Ontario, Canada
Razan Jaber
Stockholm University , Stockholm, Sweden
Donald McMillan
Stockholm University , Stockholm, Sweden
Cosmin Munteanu
University of Toronto Mississauga, Mississauga, Ontario, Canada



会議: CHI 2022

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2022.acm.org/)

セッション: Voice, Conversation and Design

New Orleans Theater A
5 件の発表
2022-05-03 20:00:00
2022-05-03 21:15:00