What Could Possibly Go Wrong When Interacting with Proactive Smart Speakers? A Case Study Using an ESM Application

Voice user interfaces (VUIs) have made their way into people's daily lives, from voice assistants to smart speakers. Although VUIs typically just react to direct user commands, increasingly, they incorporate elements of proactive behaviors. In particular, proactive smart speakers have the potential for many applications, ranging from healthcare to entertainment; however, their usability in everyday life is subject to interaction errors. To systematically investigate the nature of errors, we designed a voice-based Experience Sampling Method (ESM) application to run on proactive speakers. We captured 1,213 user interactions in a 3-week field deployment in 13 participants' homes. Through auxiliary audio recordings and logs, we identify substantial interaction errors and strategies that users apply to overcome those errors. We further analyze the interaction timings and provide insights into the time cost of errors. We find that, even for answering simple ESMs, interaction errors occur frequently and can hamper the usability of proactive speakers and user experience. Our work also identifies multiple facets of VUIs that can be improved in terms of the timing of speech.

University of Melbourne, Melbourne, Victoria, Australia

The University of Melbourne, Melbourne, Victoria, Australia

University of Melbourne, Melbourne, Victoria, Australia

https://dl.acm.org/doi/abs/10.1145/3491102.3517432

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2022.acm.org/)

290

4 件の発表

開始日時2022-05-03 01:15:00

終了日時2022-05-03 02:30:00

お気に入り

あとで読む

コレクション