VoicePilot: Harnessing LLMs as Speech Interfaces for Assistive Robotics

要旨

Physically assistive robots present an opportunity to significantly increase the well-being and independence of individuals with motor impairments or other forms of disability who are unable to complete activities of daily living. Speech interfaces, especially ones that utilize Large Language Models (LLMs), can enable individuals to effectively and naturally communicate high-level commands and nuanced preferences to robots. Frameworks for integrating LLMs as interfaces to robots for high level task planning and code generation have been proposed, but fail to incorporate human-centric considerations which are essential while developing assistive interfaces. In this work, we present a framework for incorporating LLMs as speech interfaces for physically assistive robots, constructed iteratively with 3 stages of testing involving a feeding robot, culminating in an evaluation with 11 older adults at an independent living facility. We use both quantitative and qualitative data from the final study to validate our framework and additionally provide design guidelines for using LLMs as speech interfaces for assistive robots. Videos, code, and supporting files are located on our project website\footnote{\url{https://sites.google.com/andrew.cmu.edu/voicepilot/}}

著者
Akhil Padmanabha
Carnegie Mellon University, Pittsburgh, Pennsylvania, United States
Jessie Yuan
Carnegie Mellon University, Pittsburgh, Pennsylvania, United States
Janavi Gupta
Carnegie Mellon University, Pittsburgh, Pennsylvania, United States
Zulekha Karachiwalla
Carnegie Mellon, Pittsburgh, Pennsylvania, United States
Carmel Majidi
Carnegie Mellon University, Pittsburgh, Pennsylvania, United States
Henny Admoni
Carnegie Mellon University, Pittsburgh, Pennsylvania, United States
Zackory Erickson
Carnegie Mellon University, Pittsburgh, Pennsylvania, United States
論文URL

https://doi.org/10.1145/3654777.3676401

動画

会議: UIST 2024

ACM Symposium on User Interface Software and Technology

セッション: 3. LLM: New applications

Westin: Allegheny 3
4 件の発表
2024-10-16 20:00:00
2024-10-16 21:00:00