DiLLS: Interactive Diagnosis of LLM-based Multi-agent Systems via Layered Summary of Agent Behaviors

Large language model (LLM)-based multi-agent systems have demonstrated impressive capabilities in handling complex tasks. However, the complexity of agentic behaviors makes these systems difficult to understand. When failures occur, developers often struggle to identify root causes and to determine actionable paths for improvement. Traditional methods that rely on inspecting raw log records are inefficient, given both the large volume and complexity of data. To address this challenge, we propose a framework and an interactive system, DiLLS, designed to reveal and structure the behaviors of multi-agent systems. The key idea is to organize information across three levels of query completion: activities, actions, and operations. By probing the multi-agent system through natural language, DiLLS derives and organizes information about planning and execution into a structured, multi-layered summary. Through a user study, we show that DiLLS significantly improves developers’ effectiveness and efficiency in identifying, diagnosing, and understanding failures in LLM-based multi-agent systems.

The Hong Kong University of Science and Technology, Hong Kong, China

Tongji University, Shanghai, Shanghai, China

Southeast University, Nanjing, China

University of Waterloo, Waterloo, Ontario, Canada

The Hong Kong University of Science and Technology, Hong Kong, China

ETH Zürich, Zürich, Switzerland

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 134

7 件の発表

開始日時2026-04-17 18:00:00

終了日時2026-04-17 19:30:00

お気に入り

あとで読む

コレクション

DiLLS: Interactive Diagnosis of LLM-based Multi-agent Systems via Layered Summary of Agent Behaviors

要旨

著者

会議: CHI 2026

セッション: Multi-Agent Reasoning Systems for Sensemaking and Planning