Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models

Text-to-image generative models have demonstrated remarkable capabilities in generating high-quality images based on textual prompts. However, crafting prompts that accurately capture the user's creative intent remains challenging. It often involves laborious trial-and-error procedures to ensure that the model interprets the prompts in alignment with the user's intention. To address these challenges, we present Promptify, an interactive system that supports prompt exploration and refinement for text-to-image generative models. Promptify utilizes a suggestion engine powered by large language models to help users quickly explore and craft diverse prompts. Our interface allows users to organize the generated images flexibly, and based on their preferences, Promptify suggests potential changes to the original prompt. This feedback loop enables users to iteratively refine their prompts and enhance desired features while avoiding unwanted ones. Our user study shows that Promptify effectively facilitates the text-to-image workflow, allowing users to create visually appealing images on their first attempt while requiring significantly less cognitive load than a widely-used baseline tool.

University of Toronto, Toronto, Ontario, Canada

Dalhousie University, Halifax, Nova Scotia, Canada

University of Toronto, Toronto, Ontario, Canada

https://doi.org/10.1145/3586183.3606725

ACM Symposium on User Interface Software and Technology

Gold Room

6 件の発表

開始日時2023-11-01 19:50:00

終了日時2023-11-01 21:10:00

お気に入り

あとで読む

コレクション