Treading the Transparency Tightrope: A Taxonomy of Risks and Benefits of Foundation Model Data Transparency for Transparency Advocates

要旨

Data powering AI is often opaque. Researchers, NGOs, and law and policy leaders have called for greater transparency about how data is used for training, fine-tuning, and evaluation. While data transparency is often championed as crucial, what it concretely enables is largely implicit. Similarly, the concerns developers seem to have about transparency go unstated. This lack of clarity has led some researchers to critique transparency demands as disconnected from the actual benefits—or risks—to specific stakeholders. We analyze documentation from four stakeholder groups to create a taxonomy of the risks and benefits of dataset transparency. Data transparency is perceived as either a risk or a benefit given a stakeholder's position, rather than wholesale. We also propose data availability and data documentation as two lenses through which to consider transparency. We discuss how best to strategically promote situational data transparency that takes into account the relationship between stakeholder position, transparency modality, and benefits/risks.

著者
Morgan Klaus. Scheuerman
Sony AI, Broomfield, Colorado, United States
Wiebke Hutiri
Sony AI, Zurich, Switzerland
Aida Rahmattalabi
Sony AI, Los Angeles, California, United States
Victoria Matthews
Sony AI, New York, New York, United States
Alice Xiang
Sony AI, Seattle, Washington, United States
Jerone Andrews
Sony AI, London, United Kingdom

会議: CHI 2026

ACM CHI Conference on Human Factors in Computing Systems

セッション: Trust and Transparency in Everyday Life

P1 - Room 128
6 件の発表
2026-04-13 20:15:00
2026-04-13 21:45:00