Don't Look at the Data! How Differential Privacy Reconfigures the Practices of Data Science

要旨

Across academia, government, and industry, data stewards are facing increasing pressure to make datasets more openly accessible for researchers while also protecting the privacy of data subjects. Differential privacy (DP) is one promising way to offer privacy along with open access, but further inquiry is needed into the tensions between DP and data science. In this study, we conduct interviews with 19 data practitioners who are non-experts in DP as they use a DP data analysis prototype to release privacy-preserving statistics about sensitive data, in order to understand perceptions, challenges, and opportunities around using DP. We find that while DP is promising for providing wider access to sensitive datasets, it also introduces challenges into every stage of the data science workflow. We identify ethics and governance questions that arise when socializing data scientists around new privacy constraints and offer suggestions to better integrate DP and data science.

著者
Jayshree Sarathy
Harvard University, Cambridge, Massachusetts, United States
Sophia Song
UC Berkeley, Berkeley, California, United States
Audrey Haque
Harvard University, Cambridge, Massachusetts, United States
Tania Schlatter
Harvard University, Cambridge, Massachusetts, United States
Salil Vadhan
Harvard University, Cambridge, Massachusetts, United States
論文URL

https://doi.org/10.1145/3544548.3580791

動画

会議: CHI 2023

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

セッション: Data practices, permissions, and consent

Room Y01+Y02
6 件の発表
2023-04-25 20:10:00
2023-04-25 21:35:00