multiverse: Multiplexing Alternative Data Analyses in R Notebooks

要旨

There are myriad ways to analyse a dataset. But which one to trust? In the face of such uncertainty, analysts may adopt multiverse analysis: running all reasonable analyses on the dataset. Yet this is cognitively and technically difficult with existing tools—how does one specify and execute all combinations of reasonable analyses of a dataset?—and often requires discarding existing workflows. We present multiverse, a tool for implementing multiverse analyses in R with expressive syntax supporting existing computational notebook workflows. multiverse supports building up a multiverse through local changes to a single analysis and optimises execution by pruning redundant computations. We evaluate how multiverse supports programming multiverse analyses using (a) principles of cognitive ergonomics to compare with two existing multiverse tools; and (b) case studies based on semi-structured interviews with researchers who have successfully implemented an end-to-end analysis using multiverse. We identify design tradeoffs (e.g. increased flexibility versus learnability), and suggest future directions for multiverse tool design.

受賞
Honorable Mention
著者
Abhraneel Sarma
Northwestern University, Evanston, Illinois, United States
Alex Kale
University of Washington, Seattle, Washington, United States
Michael Jongho. Moon
University of Toronto, Toronto, Ontario, Canada
Nathan Taback
University of Toronto, Toronto, Ontario, Canada
Fanny Chevalier
University of Toronto, Toronto, Ontario, Canada
Jessica Hullman
Northwestern University, Evanston, Illinois, United States
Matthew Kay
Northwestern University, Chicago, Illinois, United States
論文URL

https://doi.org/10.1145/3544548.3580726

動画

会議: CHI 2023

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

セッション: Data Analyses and Representation

Hall F
6 件の発表
2023-04-25 18:00:00
2023-04-25 19:30:00