Treading the Transparency Tightrope: A Taxonomy of Risks and Benefits of Foundation Model Data Transparency for Transparency Advocates

Data powering AI is often opaque. Researchers, NGOs, and law and policy leaders have called for greater transparency about how data is used for training, fine-tuning, and evaluation. While data transparency is often championed as crucial, what it concretely enables is largely implicit. Similarly, the concerns developers seem to have about transparency go unstated. This lack of clarity has led some researchers to critique transparency demands as disconnected from the actual benefits—or risks—to specific stakeholders. We analyze documentation from four stakeholder groups to create a taxonomy of the risks and benefits of dataset transparency. Data transparency is perceived as either a risk or a benefit given a stakeholder's position, rather than wholesale. We also propose data availability and data documentation as two lenses through which to consider transparency. We discuss how best to strategically promote situational data transparency that takes into account the relationship between stakeholder position, transparency modality, and benefits/risks.

Sony AI, Broomfield, Colorado, United States

Sony AI, Zurich, Switzerland

Sony AI, Los Angeles, California, United States

Sony AI, New York, New York, United States

Sony AI, Seattle, Washington, United States

Sony AI, London, United Kingdom

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 128

6 件の発表

開始日時2026-04-13 20:15:00

終了日時2026-04-13 21:45:00

お気に入り

あとで読む

コレクション

Treading the Transparency Tightrope: A Taxonomy of Risks and Benefits of Foundation Model Data Transparency for Transparency Advocates

要旨

著者

会議: CHI 2026

セッション: Trust and Transparency in Everyday Life