Giving Robots a Voice: Human-in-the-Loop Voice Creation and open-ended Labeling

Speech is a natural interface for humans to interact with robots. Yet, aligning a robot's voice to its appearance is challenging due to the rich vocabulary of both modalities. Previous research has explored a few labels to describe robots and tested them on a limited number of robots and existing voices. Here, we develop a robot-voice creation tool followed by large-scale behavioral human experiments (N=2,505). First, participants collectively tune robotic voices to match 175 robot images using an adaptive human-in-the-loop pipeline. Then, participants describe their impression of the robot or their matched voice using another human-in-the-loop paradigm for open-ended labeling. The elicited taxonomy is then used to rate robot attributes and to predict the best voice for an unseen robot. We offer a web interface to aid engineers in customizing robot voices, demonstrating the synergy between cognitive science and machine learning for engineering tools.

Max Planck Institute for Empirical Aesthetics, Frankfurt am Main, Germany

Augsburg University, Augsburg, Germany

Max Planck Institute for Empirical Aesthetics, Frankfurt, Deutschland, Germany

Augsburg University, Augsburg, Germany

https://doi.org/10.1145/3613904.3642038

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

318B

4 件の発表

開始日時2024-05-14 23:00:00

終了日時2024-05-15 00:20:00

お気に入り

あとで読む