FrownOnError: Interrupting Responses from Smart Speakers by Facial Expressions

要旨

In the conversations with smart speakers, misunderstandings of users' requests lead to erroneous responses. We propose FrownOnError, a novel interaction technique that enables users to interrupt the responses by intentional but natural facial expressions. This method leverages the human nature that the facial expression changes when we receive unexpected responses. We conducted a first user study (N=12) to understand users' intuitive reactions to the correct and incorrect responses. Our results reveal the significant difference in the frequency of occurrence and intensity of users' facial expressions between two conditions, and frowning and raising eyebrows are intuitive to perform and easy to control. Our second user study (N=16) evaluated the user experience and interruption efficiency of FrownOnError and the third user study (N=12) explored suitable conversation recovery strategies after the interruptions. Our results show that FrownOnError can be accurately detected (precision: 97.4\%, recall: 97.6\%), provides the most timely interruption compared to the baseline methods of wake-up word and button press, and is rated as most intuitive and easiest to be performed by users.

受賞
Honorable Mention
キーワード
Voice User Interface
Facial Expression
Conversation Interruption
著者
Yukang Yan
Tsinghua University, Beijing, China
Chun Yu
Tsinghua University; Ministry of Education, Beijing, China
Wengrui Zheng
Tsinghua University, Beijing, China
Ruining Tang
Tsinghua University, Beijing, China
Xuhai Xu
Tsinghua University, Beijing, China
Yuanchun Shi
Tsinghua University; Ministry of Education, Beijing, China
DOI

10.1145/3313831.3376810

論文URL

https://doi.org/10.1145/3313831.3376810

動画

会議: CHI 2020

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2020.acm.org/)

セッション: In dialogue with AI

Paper session
316C MAUI
5 件の発表
2020-04-29 01:00:00
2020-04-29 02:15:00
日本語まとめ
読み込み中…