WorldSmith: A Multi-Modal Image Synthesis Tool for Fictional World Building

Crafting a rich and unique environment is crucial for fictional world-building, but can be difficult to achieve since illustrating a world from scratch requires time and significant skill. We investigate the use of recent multi-modal image generation systems to enable users iteratively visualize and modify elements of their fictional world using a combination of text input, sketching, and region-based filling. WorldSmith enables novice world builders to quickly visualize a fictional world with layered edits and hierarchical compositions. Through a formative study (4 participants) and first-use study (13 participants) we demonstrate that WorldSmith offers more expressive interactions with prompt-based models. With this work, we explore how creatives can be empowered to leverage prompt-based generative AI as a tool in their creative process, beyond current "click-once" prompting UI paradigms.

University of Bayreuth, Bayreuth, Germany

Autodesk Research, Toronto, Ontario, Canada

https://doi.org/10.1145/3586183.3606772

ACM Symposium on User Interface Software and Technology

Venetian Room

6 件の発表

開始日時2023-10-31 23:10:00

終了日時2023-11-01 00:30:00

お気に入り

あとで読む

コレクション

要旨

著者

論文URL

動画

会議: UIST 2023

セッション: Teamwork Triumphs: Collaborative Experiences