DiLLS: Interactive Diagnosis of LLM-based Multi-agent Systems via Layered Summary of Agent Behaviors

要旨

Large language model (LLM)-based multi-agent systems have demonstrated impressive capabilities in handling complex tasks. However, the complexity of agentic behaviors makes these systems difficult to understand. When failures occur, developers often struggle to identify root causes and to determine actionable paths for improvement. Traditional methods that rely on inspecting raw log records are inefficient, given both the large volume and complexity of data. To address this challenge, we propose a framework and an interactive system, DiLLS, designed to reveal and structure the behaviors of multi-agent systems. The key idea is to organize information across three levels of query completion: activities, actions, and operations. By probing the multi-agent system through natural language, DiLLS derives and organizes information about planning and execution into a structured, multi-layered summary. Through a user study, we show that DiLLS significantly improves developers’ effectiveness and efficiency in identifying, diagnosing, and understanding failures in LLM-based multi-agent systems.

著者
Rui Sheng
The Hong Kong University of Science and Technology, Hong Kong, China
Yukun Yang
Tongji University, Shanghai, Shanghai, China
Chuhan Shi
Southeast University, Nanjing, China
Yanna Lin
University of Waterloo, Waterloo, Ontario, Canada
Zixin Chen
The Hong Kong University of Science and Technology, Hong Kong, China
Huamin Qu
The Hong Kong University of Science and Technology, Hong Kong, China
Furui Cheng
ETH Zürich, Zürich, Switzerland

会議: CHI 2026

ACM CHI Conference on Human Factors in Computing Systems

セッション: Multi-Agent Reasoning Systems for Sensemaking and Planning

P1 - Room 134
7 件の発表
2026-04-17 18:00:00
2026-04-17 19:30:00