Performative Vocal Synthesis for Foreign Language Intonation Practice

Typical foreign language (L2) pronunciation training focuses mainly on individual sounds. Intonation, the patterns of pitch change across words or phrases is often neglected, despite its key role in word-level intelligibility and in the expression of attitudes and affect. This paper examines hand-controlled real-time vocal synthesis, known as Performative Vocal Synthesis (PVS), as an interaction technique for practicing L2 intonation in computer aided pronunciation training (CAPT). We evaluate a tablet-based interface where users gesturally control the pitch of a pre-recorded utterance by drawing curves on the touchscreen. 24 subjects (12 French learners, 12 British controls) imitated English phrases with their voice and the interface. Results of an acoustic analysis and expert perceptive evaluation showed that learners’ gestural imitations yielded more accurate results than vocal imitations of the fall-rise intonation pattern typically difficult for francophones, suggesting that PVS can help learners produce intonation patterns beyond the capabilities of their natural voice.

Léonard de Vinci Pôle Universitaire, Research Center, Paris La Défense, France

Sorbonne Nouvelle, Paris, France

Sorbonne Université, Paris, France

Sorbonne Nouvelle, Paris, France

CNRS Sorbonne Université, Paris, France

https://doi.org/10.1145/3544548.3581210

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

Hall C

6 件の発表

開始日時2023-04-26 20:10:00

終了日時2023-04-26 21:35:00

お気に入り