Rambler: Supporting Writing With Speech via LLM-Assisted Gist Manipulation

要旨

Dictation enables efficient text input on mobile devices. However, writing with speech can produce disfluent, wordy, and incoherent text and thus requires heavy post-processing. This paper presents Rambler, an LLM-powered graphical user interface that supports gist-level manipulation of dictated text with two main sets of functions: gist extraction and macro revision. Gist extraction generates keywords and summaries as anchors to support the review and interaction with spoken text. LLM-assisted macro revisions allow users to respeak, split, merge, and transform dictated text without specifying precise editing locations. Together they pave the way for interactive dictation and revision that help close gaps between spontaneously spoken words and well-structured writing. In a comparative study with 12 participants performing verbal composition tasks, \tool outperformed the baseline of a speech-to-text editor + ChatGPT, as it better facilitates iterative revisions with enhanced user control over the content while supporting surprisingly diverse user strategies.

著者
Susan Lin
UC Berkeley, Berkeley, California, United States
Jeremy Warner
UC Berkeley, Berkeley, California, United States
J.D. Zamfirescu-Pereira
UC Berkeley, Berkeley, California, United States
Matthew G. Lee
UC Berkeley, Berkeley, California, United States
Sauhard Jain
University of California, Berkeley, Berkeley, California, United States
Shanqing Cai
Google, Mountain View, California, United States
Piyawat Lertvittayakumjorn
Google, Mountain View, California, United States
Michael Xuelin Huang
Google, Mountain View, California, United States
Shumin Zhai
Google, Mountain View, California, United States
Bjoern Hartmann
UC Berkeley, Berkeley, California, United States
Can Liu
City University of Hong Kong, Hong Kong, China
論文URL

https://doi.org/10.1145/3613904.3642217

動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Writing and AI B

310 Lili'u Theater
4 件の発表
2024-05-15 23:00:00
2024-05-16 00:20:00