Partiality and Misconception: Investigating Cultural Representativeness in Text-to-Image Models

要旨

Text-to-image (T2I) models enable users worldwide to create high-definition and realistic images through text prompts, where the underrepresentation and potential misinformation of images have raised growing concerns. However, few existing works examine cultural representativeness, especially involving whether the generated content can fairly and accurately reflect global cultures. Combining automated and human methods, we investigate this issue in multiple dimensions quantificationally and conduct a set of evaluations on three prevailing T2I models (DALL-E v2, Stable Diffusion v1.5 and v2.1). Introducing attributes of cultural cluster and subject, we provide a fresh interdisciplinary perspective to bias analysis. The benchmark dataset UCOGC is presented, which encompasses authentic images of unique cultural objects from global clusters. Our results reveal that the culture of a disadvantaged country is prone to be neglected, some specified subjects often present a stereotype or a simple patchwork of elements, and over half of cultural objects are mispresented.

著者
Lili Zhang
Hainan University, Haikou, China
Xi Liao
Hainan University, Haikou, China
Zaijia Yang
Hainan University, Haikou, China
Baihang Gao
Hainan University, Haikou, China
Chunjie Wang
Hainan University, Haikou, China
Qiuling Yang
Hainan University, Haikou, China
Deshun Li
Hainan University, Haikou, China
論文URL

doi.org/10.1145/3613904.3642877

動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Indigeonus Communities and Cutural Heritage B

313C
4 件の発表
2024-05-14 18:00:00
2024-05-14 19:20:00