Enabling Voice-Accompanying Hand-to-Face Gesture Recognition with Cross-Device Sensing

要旨

Gestures performed accompanying the voice are essential for voice interaction to convey complementary semantics for interaction purposes such as wake-up state and input modality. In this paper, we investigated voice-accompanying hand-to-face (VAHF) gestures for voice interaction. We targeted on hand-to-face gestures because such gestures relate closely with speech and yield significant acoustic features (e.g., impeding voice propagation). We conducted a user study to explore the design space of VAHF gestures, where we first gathered candidate gestures and then applied a structural analysis to them in different dimensions (e.g., contact position and type), outputting a total of 8 VAHF gestures with good usability and least confusion. To facilitate VAHF gesture recognition, we proposed a novel cross-device sensing method that leverages heterogeneous channels (vocal, ultrasound, and IMU) of data from commodity devices (earbuds, watches, and rings). Our recognition model achieved an accuracy of 97.3\% for recognizing 3 gestures and 91.5\% for recognizing 8 gestures \revision{(excluding the "empty" gesture)}, proving the high applicability. Quantitative analysis also shed light on the recognition capability of each sensor channel and their different combinations. In the end, we illustrated the feasible use cases and their design principles to demonstrate the applicability of our system in various scenarios.

受賞
Honorable Mention
著者
Zisu Li
The Hong Kong University of Science and Technology, Hong Kong SAR, China
Chen Liang
Tsinghua University, Beijing, Beijing, China
Yuntao Wang
Tsinghua University, Beijing, China
Yue Qin
Tsinghua University, Beijing, China
Chun Yu
Tsinghua University, Beijing, China
Yukang Yan
Tsinghua University, Beijing, China
Mingming Fan
The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China
Yuanchun Shi
Tsinghua University, Beijing, China
論文URL

https://doi.org/10.1145/3544548.3581008

動画

会議: CHI 2023

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

セッション: Haptic and sensing devices

Hall C
6 件の発表
2023-04-26 18:00:00
2023-04-26 19:30:00