Is It AI or Is It Me? Understanding Users’ Prompt Journey with Text-to-Image Generative AI Tools

要旨

Generative Artificial Intelligence (AI) has witnessed unprecedented growth in text-to-image AI tools. Yet, much remains unknown about users' prompt journey with such tools in the wild. In this paper, we posit that designing human-centered text-to-image AI tools requires a clear understanding of how individuals intuitively approach crafting prompts, and what challenges they may encounter. To address this, we conducted semi-structured interviews with 19 existing users of a text-to-image AI tool. Our findings (1) offer insights into users’ prompt journey including structures and processes for writing, evaluating, and refining prompts in text-to-image AI tools and (2) indicate that users must overcome barriers to aligning AI to their intents, and mastering prompt crafting knowledge. From the findings, we discuss the prompt journey as an individual yet a social experience and highlight opportunities for aligning text-to-image AI tools and users’ intents.

著者
Atefeh Mahdavi Goloujeh
Georgia Institute of Technology, Atlanta, Georgia, United States
Anne Sullivan
Georgia Institute of Technology, Atlanta, Georgia, United States
Brian Magerko
Georgia Tech, Atlanta, Georgia, United States
論文URL

doi.org/10.1145/3613904.3642861

動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Creativity: Visualizations and AI

318B
5 件の発表
2024-05-13 23:00:00
2024-05-14 00:20:00