Creative Professionals and AI A

https://doi.org/10.1145/3613904.3642868

Short videos on social media are the dominant way young people consume content. News outlets aim to reach audiences through news reels---short videos conveying news---but struggle to translate traditional journalistic formats into short, entertaining videos. To translate news into social media reels, we support journalists in reframing the narrative. In literature, narrative framing is a high-level structure that shapes the overall presentation of a story. We identified three narrative framings for reels that adapt social media norms but preserve news value, each with a different balance of information and entertainment. We introduce ReelFramer, a human-AI co-creative system that helps journalists translate print articles into scripts and storyboards. ReelFramer supports exploring multiple narrative framings to find one appropriate to the story. AI suggests foundational narrative details, including characters, plot, setting, and key information. ReelFramer also supports visual framing; AI suggests character and visual detail designs before generating a full storyboard. Our studies show that narrative framing introduces the necessary diversity to translate various articles into reels, and establishing foundational details helps generate scripts that are more relevant and coherent. We also discuss the benefits of using narrative framing and foundational details in content retargeting.

Columbia University, New York, New York, United States

Syracuse University, Syracuse, New York, United States

Adobe Research, Seattle, Washington, United States

Syracuse University, Syracuse, New York, United States

Columbia University, New York, New York, United States

Stevens Institute of Technology, Hoboken, New Jersey, United States

Columbia University, New York, New York, United States

https://doi.org/10.1145/3613904.3642812

Creative design is a nonlinear process where designers generate diverse ideas in the pursuit of an open-ended goal and converge towards consensus through iterative remixing. In contrast, AI-powered design tools often employ a linear sequence of incremental and precise instructions to approximate design objectives. Such operations violate customary creative design practices and thus hinder AI agents' ability to complete creative design tasks. To explore better human-AI co-design tools, we first summarize human designers’ practices through a formative study with 12 design experts. Taking graphic design as a representative scenario, we formulate a nonlinear human-AI co-design framework and develop a proof-of-concept prototype, OptiMuse. We evaluate OptiMuse and validate the nonlinear framework through a comparative study. We notice a subconscious change in people's attitudes towards AI agents, shifting from perceiving them as mere executors to regarding them as opinionated colleagues. This shift effectively fostered the exploration and reflection processes of individual designers.

Zhejiang University, Hangzhou, Zhejiang, China

State Key Lab of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China

Zhejiang University, Hangzhou, Zhejiang, China

The Hong Kong University of Science and Technology, Hong Kong, China

Microsoft Research Asia, Beijing, China

Zhejiang University, Hangzhou, Zhejiang, China

https://doi.org/10.1145/3613904.3642824

Landscape renderings are realistic images of landscape sites, allowing stakeholders to perceive better and evaluate design ideas. While recent advances in Generative Artificial Intelligence (GAI) enable automated generation of landscape renderings, the end-to-end methods are not compatible with common design processes, leading to insufficient alignment with design idealizations and limited cohesion of iterative landscape design. Informed by a formative study for comprehending design requirements, we present PlantoGraphy, an iterative design system that allows for interactive configuration of GAI models to accommodate human-centered design practice. A two-stage pipeline is incorporated: first, the concretization module transforms conceptual ideas into concrete scene layouts with a domain-oriented large language model; and second, the illustration module converts scene layouts into realistic landscape renderings with a layout-guided diffusion model fine-tuned through Low-Rank Adaptation. PlantoGraphy has undergone a series of performance evaluations and user studies, demonstrating its effectiveness in landscape rendering generation and the high recognition of its interactive functionality.

The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, Guangdong, China

The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China

Lappeenranta-Lahti University of Technology, Lahti, Finland

Hong Kong University of Science and Technology (Guangzhou), Guangzhou, Guangdong, China

The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, Guangdong, China

https://doi.org/10.1145/3613904.3642908

This paper investigates the potential impact of deep generative models on the work of creative professionals. We argue that current generative modeling tools lack critical features that would make them useful creativity support tools, and introduce our own tool, generative.fashion, which was designed with theoretical principles of design space exploration in mind. Through qualitative studies with fashion design apprentices, we demonstrate how generative.fashion supported both divergent and convergent thinking, and compare it with a state-of-the-art text-based interface using Stable Diffusion. In general, the apprentices preferred generative.fashion, citing the features explicitly designed to support ideation. In two follow-up studies, we provide quantitative results that support and expand on these insights. We conclude that text-only prompts in existing models restrict creative exploration, especially for novices. Our work demonstrates that interfaces which are theoretically aligned with principles of design space exploration are essential for unlocking the full creative potential of generative AI.

EPFL, Lausanne, Switzerland

Bern University of Applied Sciences, Bern, Switzerland

École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland

ETH Zurich, Zurich, Switzerland

EPFL, Lausanne, Switzerland