WorldSmith: A Multi-Modal Image Synthesis Tool for Fictional World Building

要旨

Crafting a rich and unique environment is crucial for fictional world-building, but can be difficult to achieve since illustrating a world from scratch requires time and significant skill. We investigate the use of recent multi-modal image generation systems to enable users iteratively visualize and modify elements of their fictional world using a combination of text input, sketching, and region-based filling. WorldSmith enables novice world builders to quickly visualize a fictional world with layered edits and hierarchical compositions. Through a formative study (4 participants) and first-use study (13 participants) we demonstrate that WorldSmith offers more expressive interactions with prompt-based models. With this work, we explore how creatives can be empowered to leverage prompt-based generative AI as a tool in their creative process, beyond current "click-once" prompting UI paradigms.

著者
Hai Dang
University of Bayreuth, Bayreuth, Germany
Frederik Brudy
Autodesk Research, Toronto, Ontario, Canada
George Fitzmaurice
Autodesk Research, Toronto, Ontario, Canada
Fraser Anderson
Autodesk Research, Toronto, Ontario, Canada
論文URL

https://doi.org/10.1145/3586183.3606772

動画

会議: UIST 2023

ACM Symposium on User Interface Software and Technology

セッション: Teamwork Triumphs: Collaborative Experiences

Venetian Room
6 件の発表
2023-10-31 23:10:00
2023-11-01 00:30:00