GANSlider: How Users Control Generative Models for Images using Multiple Sliders with and without Feedforward Information

要旨

We investigate how multiple sliders with and without feedforward visualizations influence users' control of generative models. In an online study (N=138), we collected a dataset of people interacting with a generative adversarial network (StyleGAN2) in an image reconstruction task. We found that more control dimensions (sliders) significantly increase task difficulty and user actions. Visual feedforward partly mitigates this by enabling more goal-directed interaction. However, we found no evidence of faster or more accurate task performance. This indicates a tradeoff between feedforward detail and implied cognitive costs, such as attention. Moreover, we found that visualizations alone are not always sufficient for users to understand individual control dimensions. Our study quantifies fundamental UI design factors and resulting interaction behavior in this context, revealing opportunities for improvement in the UI design for interactive applications of generative models. We close by discussing design directions and further aspects.

著者
Hai Dang
University of Bayreuth, Bayreuth, Germany
Lukas Mecke
Bundeswehr University Munich, Munich, Germany
Daniel Buschek
University of Bayreuth, Bayreuth, Germany
論文URL

https://dl.acm.org/doi/abs/10.1145/3491102.3502141

動画

会議: CHI 2022

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2022.acm.org/)

セッション: Trust and Control in AI Systems

394
5 件の発表
2022-05-03 18:00:00
2022-05-03 19:15:00