From Provenance to Aberrations: Image Creator and Screen Reader User Perspectives on Alt Text for AI-Generated Images

AI-generated images are proliferating as a new visual medium. However, state-of-the-art image generation models do not output alternative (alt) text with their images, rendering them largely inaccessible to screen reader users (SRUs). Moreover, less is known about what information would be most desirable to SRUs in this new medium. To address this, we invited AI image creators and SRUs to evaluate alt text prepared from various sources and write their own alt text for AI images. Our mixed-methods analysis makes three contributions. First, we highlight creators’ perspectives on alt text, as creators are well-positioned to write descriptions of their images. Second, we illustrate SRUs’ alt text needs particular to the emerging medium of AI images. Finally, we discuss the promises and pitfalls of utilizing text prompts written as input for AI models in alt text generation, and areas where broader digital accessibility guidelines could expand to account for AI images.

Northeastern University, Boston, Massachusetts, United States

Google, Seattle, Washington, United States

Google DeepMind, Seattle, Washington, United States

Google Research, Boulder, Colorado, United States

Google, New York, New York, United States

https://doi.org/10.1145/3613904.3642325

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

313B

5 件の発表

開始日時2024-05-14 23:00:00

終了日時2024-05-15 00:20:00

お気に入り