Dango: A Mixed-Initiative Data Wrangling System using Large Language Model

Data wrangling is a time-consuming and challenging task in the early stages of a data science pipeline. However, existing tools often fail to effectively interpret user intent. We propose Dango, a mixed-initiative multi-agent system that helps users generate data wrangling scripts. Compared to existing tools, Dango enhances user communication of intent by: (1) allowing users to demonstrate on multiple tables and use natural language prompts in a conversation interface, (2) enabling users to clarify their intent by answering LLM-posed multiple-choice clarification questions, and (3) providing multiple forms of feedback such as step-by-step NL explanations and data provenance to help users evaluate the data wrangling scripts. In a within-subjects, think-aloud study (n=38), the results show that Dango's features can significantly improve intent clarification, accuracy, and efficiency in data wrangling tasks.

Purdue University, West Lafayette, Indiana, United States

Huazhong University of Science and Technology, Wuhan, China

University of Iowa, Iowa City, Iowa, United States

Purdue University, West Lafayette, Indiana, United States

10.1145/3706598.3714135

https://dl.acm.org/doi/10.1145/3706598.3714135

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2025.acm.org/)

Annex Hall F206

7 件の発表

開始日時2025-04-30 01:20:00

終了日時2025-04-30 02:50:00

読み込み中…