NasoVoce: A Nose-Mounted Low-Audibility Speech Interface for Always-Available Speech Interaction

要旨

Silent and whispered speech offer promise for always-available voice interaction with AI, yet existing methods struggle to balance vocabulary size, wearability, silence, and noise robustness. We present NasoVoce, a nose-bridge–mounted interface that integrates a microphone and a vibration sensor. Positioned at the nasal pads of smart glasses, it unobtrusively captures both acoustic and vibration signals. The nasal bridge, close to the mouth, allows access to bone- and skin-conducted speech and enables reliable capture of low-volume utterances such as whispered speech. While the microphone captures high-quality audio, it is highly sensitive to environmental noise. Conversely, the vibration sensor is robust to noise but yields lower signal quality. By fusing these complementary inputs, NasoVoce generates high-quality speech robust against interference. Evaluation with Whisper Large-v2, PESQ, STOI, and MUSHRA ratings confirms improved recognition and quality. NasoVoce demonstrates the feasibility of a practical interface for always-available, continuous, and discreet AI voice conversations.

著者
Jun Rekimoto
Sony Computer Science Laboratories, Kyoto, Kyoto, Kyoto, Japan
Yu Nishimura
Sony Computer Science Laboratories, Tokyo, Japan
Bojian Yang
Sony Computer Science Laboratories, Tokyo, Japan

会議: CHI 2026

ACM CHI Conference on Human Factors in Computing Systems

セッション: Wearable, Audio and Novel Interactive Devices

P1 - Room 132
7 件の発表
2026-04-14 20:15:00
2026-04-14 21:45:00