CLARIS: Clear and Intelligible Speech from Whispered and Dysarthric Voices

Whispered and dysarthric speech hinder effective communication and undermine the reliability of voice-enabled systems. We present CLARIS, a compact speech-to-speech restoration system that turns such atypical input into clear, expressive speech. CLARIS requires no disorder-specific architectural tuning, generalizes across languages, and adapts quickly to new accents and speakers, enabling practical personalization. On whispered English, Hindi, and clinically challenging dysarthric speech, CLARIS delivers state-of-the-art intelligibility and naturalness, with listener studies confirming gains in quality, intelligibility, naturalness, and prosody. The system runs in real time, converting one second of input in about 30ms and enables inclusive, private, and personalized voice interaction. Audio samples are available at https://claris-w2s.github.io/CLARIS/

TCS Research, Pune, Maharashtra, India

CVIT, IIIT Hyderabad, Hyderabad, Telangana, India

TCS Research, Pune, Maharashtra, India

IIIT Hyderabad, Hyderabad, India

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 120

7 件の発表

開始日時2026-04-15 20:15:00

終了日時2026-04-15 21:45:00

お気に入り

あとで読む

コレクション

要旨

著者

会議: CHI 2026

セッション: Sound, Music, and Dance Accessibility