WaitGPT: Monitoring and Steering Conversational LLM Agent in Data Analysis with On-the-Fly Code Visualization

要旨

Large language models (LLMs) support data analysis through conversational user interfaces, as exemplified in OpenAI's ChatGPT (formally known as Advanced Data Analysis or Code Interpreter). Essentially, LLMs produce code for accomplishing diverse analysis tasks. However, presenting raw code can obscure the logic and hinder user verification. To empower users with enhanced comprehension and augmented control over analysis conducted by LLMs, we propose a novel approach to transform LLM-generated code into an interactive visual representation. In the approach, users are provided with a clear, step-by-step visualization of the LLM-generated code in real time, allowing them to understand, verify, and modify individual data operations in the analysis. Our design decisions are informed by a formative study (N=8) probing into user practice and challenges. We further developed a prototype named WaitGPT and conducted a user study (N=12) to evaluate its usability and effectiveness. The findings from the user study reveal that WaitGPT facilitates monitoring and steering of data analysis performed by LLMs, enabling participants to enhance error detection and increase their overall confidence in the results.

著者
Liwenhan Xie
The Hong Kong University of Science and Technology, Hong Kong, China
Chengbo Zheng
Hong Kong University of Science and Technology, Hong Kong, Hong Kong
Haijun Xia
University of California, San Diego, San Diego, California, United States
Huamin Qu
The Hong Kong University of Science and Technology, Hong Kong, China
Chen Zhu-Tian
University of Minnesota-Twin Cities, Minneapolis, Minnesota, United States
論文URL

https://doi.org/10.1145/3654777.3676374

動画

会議: UIST 2024

ACM Symposium on User Interface Software and Technology

セッション: 3. LLM: New applications

Westin: Allegheny 3
4 件の発表
2024-10-16 20:00:00
2024-10-16 21:00:00