FusAIn: Composing Generative AI Visual Prompts Using Pen-based Interaction

Although current generative AI (GenAI) enables designers to create novel images, its focus on text-based and whole-image interaction limits expressive engagement with visual materials. Based on the design concept of deconstruction and reconstruction of digital visual attributes for visual prompts, we present FusAIn, a GenAI prompt composition tool that lets designers create personalized pens by loading them with objects or attributes such as color or texture. GenAI then fuses the pen's contents to create new images. Extracting and reusing inspirational material matches designers' existing work practices, making GenAI more contextualized for professional design. A study with 12 designers shows how FusAIn improves their ability to define visual details at different levels that are difficult to express with current GenAI prompts. Pen-based interaction lets them maintain fine-grained control over generated results, increasing GenAI image's editability and reusability. We discuss the benefits of "composition as prompts" and directions for future research.

Université Paris-Saclay, CNRS, Inria, Orsay, France

UMR 9189 CRIStAL, Lille, France

Université Paris-Saclay, CNRS, Inria, Orsay, France

10.1145/3706598.3714027

https://dl.acm.org/doi/10.1145/3706598.3714027

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2025.acm.org/)

G304

7 件の発表

開始日時2025-05-01 18:00:00

終了日時2025-05-01 19:30:00

読み込み中…

お気に入り