XCreation: A Graph-Based Crossmodal Generative Creativity Support Tool

要旨

Creativity Support Tools (CSTs) aid in the efficient and effective composition of creative content, such as picture books. However, many existing CSTs only allow for mono-modal creation, whereas previous studies have become theoretically and technically mature to support multi-modal innovative creations. To overcome this limitation, we introduce XCreation, a novel CST that leverages generative AI to support cross-modal storybook creation. Nevertheless, directly deploying AI models to CSTs can still be problematic as they are mostly black-box architectures that are not comprehensible to human users. Therefore, we integrate an interpretable entity-relation graph to intuitively represent picture elements and their relations, improving the usability of the underlying generative structures. Our between-subject user study demonstrates that XCreation supports continuous plot creation with increased creativity, controllability, usability, and interpretability. XCreation is applicable to various scenarios, including interactive storytelling and picture book creation, thanks to its multimodal nature.

著者
Zihan Yan
MIT Media Lab, Cambridge, Massachusetts, United States
Chunxu Yang
UCLA, Los Angeles, California, United States
Qihao Liang
National University of Singapore, Singapore, Singapore
Xiang 'Anthony' Chen
UCLA, Los Angeles, California, United States
論文URL

https://doi.org/10.1145/3586183.3606826

動画

会議: UIST 2023

ACM Symposium on User Interface Software and Technology

セッション: Creative Visions: Creativity Support Tools

Venetian Room
6 件の発表
2023-10-31 19:50:00
2023-10-31 21:10:00