3. AI as Copilot

https://doi.org/10.1145/3654777.3676345

LLM-powered tools like ChatGPT Data Analysis, have the potential to help users tackle the challenging task of data analysis programming, which requires expertise in data processing, programming, and statistics. However, our formative study (n=15) uncovered serious challenges in verifying AI-generated results and steering the AI (i.e., guiding the AI system to produce the desired output). We developed two contrasting approaches to address these challenges. The first (Stepwise) decomposes the problem into step-by-step subgoals with pairs of editable assumptions and code until task completion, while the second (Phasewise) decomposes the entire problem into three editable, logical phases: structured input/output assumptions, execution plan, and code. A controlled, within-subjects experiment (n=18) compared these systems against a conversational baseline. Users reported significantly greater control with the Stepwise and Phasewise systems, and found intervention, correction, and verification easier, compared to the baseline. The results suggest design guidelines and trade-offs for AI-assisted data analysis tools.

University of Toronto, Toronto, Ontario, Canada

Microsoft Research, Cambridge, United Kingdom

University of Toronto, Toronto, Ontario, Canada

Microsoft, Redmond, Washington, United States

Microsoft Research , Cambridge, Cambridgeshire, United Kingdom

Microsoft Research, Cambridge, United Kingdom

https://doi.org/10.1145/3654777.3676347

Programming instructors often conduct collaborative learning activities, like Peer Instruction, to foster a deeper understanding in students and enhance their engagement with learning. These activities, however, may not always yield productive outcomes due to the diversity of student mental models and their ineffective collaboration. In this work, we introduce VizGroup, an AI-assisted system that enables programming instructors to easily oversee students' real-time collaborative learning behaviors during large programming courses. VizGroup leverages Large Language Models (LLMs) to recommend event specifications for instructors so that they can simultaneously track and receive alerts about key correlation patterns between various collaboration metrics and ongoing coding tasks. We evaluated VizGroup with 12 instructors in a comparison study using a dataset collected from a Peer Instruction activity that was conducted in a large programming lecture. The results showed that VizGroup helped instructors effectively overview, narrow down, and track nuances throughout students' behaviors.

Virginia Tech, Blacksburg, Virginia, United States

University of Washington, Seattle, Washington, United States

University of Toronto, Toronto, Ontario, Canada

Virginia Tech, Blacksburg, Virginia, United States

Georgia Institute of Technology, Atlanta, Georgia, United States

Virginia Tech, Blacksburg, Virginia, United States

https://doi.org/10.1145/3654777.3676335

The increasing proliferation of AI and GenAI requires new interfaces tailored to how their specific affordances and human requirements meet. As GenAI is capable of taking over tasks from users on an unprecedented scale, designing the experience of agency -- if and how users experience control over the process and responsibility over the outcome -- is crucial. As an initial step towards design guidelines for shaping agency, we present a study that explores how features of AI-generated images influence users' experience of agency. We use two measures; temporal binding to implicitly estimate pre-reflective agency and magnitude estimation to assess user judgments of agency. We observe that abstract images lead to more temporal binding than images with semantic meaning. In contrast, the closer an image aligns with what a user might expect, the higher the agency judgment. When comparing the experiment results with objective metrics of image differences, we find that temporal binding results correlate with semantic differences, while agency judgments are better explained by local differences between images. This work contributes towards a future where agency is considered an important design dimension for GenAI interfaces.

University College Dublin, Dublin, Ireland

MPI Informatik, Saarbrücken, Germany

Dresden University of Applied Sciences, Dresden, Saxony, Germany

University College Dublin, Dublin, Ireland

MPI Informatik, Saarbruecken, Germany

Max Planck Institute for Informatics, Saarland Informatics Campus, Saarbrücken, Germany

https://doi.org/10.1145/3654777.3676462

We introduce FathomGPT, an open source system for the interactive investigation of ocean science data via a natural language interface. FathomGPT was developed in close collaboration with marine scientists to enable researchers and ocean enthusiasts to explore and analyze the FathomNet image database. FathomGPT provides a custom information retrieval pipeline that leverages OpenAI’s large language models to enable: the creation of complex queries to retrieve images, taxonomic information, and scientific measurements; mapping common names and morphological features to scientific names; generating interactive charts on demand; and searching by image or specified patterns within an image. In designing FathomGPT, particular emphasis was placed on enhancing the user's experience by facilitating free-form exploration and optimizing response times. We present an architectural overview and implementation details of FathomGPT, along with a series of ablation studies that demonstrate the effectiveness of our approach to name resolution, fine tuning, and prompt modification. Additionally, we present usage scenarios of interactive data exploration sessions and document feedback from ocean scientists and machine learning experts.

Purdue University, West Lafayette, Indiana, United States

Monterey Bay Aquarium Research Institute, Moss Landing, California, United States

Purdue University, West Lafayette, Indiana, United States