Peeking Ahead of the Field Study: Exploring VLM Personas as Support Tools for Embodied Studies in HCI

Field studies are irreplaceable but costly, time-consuming, and error-prone, which need careful preparation. Inspired by rapid-prototyping in manufacturing, we propose a fast, low-cost evaluation method using Vision-Language Model (VLM) personas to simulate outcomes comparable to field results. While LLMs show human-like reasoning and language capabilities, autonomous vehicle (AV)-pedestrian interaction requires spatial awareness, emotional empathy, and behavioral generation. This raises our research question: To what extent can VLM personas mimic human responses in field studies? We conducted parallel studies: 1) one real-world study with 20 participants, and 2) one video-study using 20 VLM personas, both on a street-crossing task. We compared their responses and interviewed five HCI researchers on potential applications. Results show that VLM personas mimic human response patterns (e.g., average crossing times of 5.25 s vs. 5.07 s) lack the behavioral variability and depth. They show promise for formative studies, field study preparation, and human data augmentation.

The University of Tokyo, Tokyo, Japan

UCL Interaction Centre, London, United Kingdom

Keio University, Fujisawa-shi, Japan

The University of Tokyo, Bunkyo, Tokyo, Japan

The University of Tokyo, Tokyo, Japan

Google, Tokyo, Japan

The University of Tokyo, Tokyo, Japan

Kyoto University, Kyoto, Kyoto, Japan

National Taiwan University of Arts, Taipei, Taiwan

The University of Tokyo, Tokyo, Japan

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 134

7 件の発表

開始日時2026-04-15 18:00:00

終了日時2026-04-15 19:30:00

お気に入り

あとで読む

コレクション

Peeking Ahead of the Field Study: Exploring VLM Personas as Support Tools for Embodied Studies in HCI

要旨

受賞
Honorable Mention

著者

動画

会議: CHI 2026

セッション: Human-Robot Interaction & Embodied Sensing

Peeking Ahead of the Field Study: Exploring VLM Personas as Support Tools for Embodied Studies in HCI

要旨

受賞Honorable Mention

著者

動画

会議: CHI 2026

セッション: Human-Robot Interaction & Embodied Sensing

受賞
Honorable Mention