Wrex: A Unified Programming-by-Example Interaction for Synthesizing Readable Code for Data Scientists

要旨

Data wrangling is a difficult and time-consuming activity in computational notebooks, and existing wrangling tools do not fit the exploratory workflow for data scientists in these environments. We propose a unified interaction model based on programming-by-example that generates readable code for a variety of useful data transformations, implemented as a Jupyter notebook extension called Wrex. User study results demonstrate that data scientists are significantly more effective and efficient at data wrangling with Wrex over manual programming. Qualitative participant feedback indicates that Wrex was useful and reduced barriers in having to recall or look up the usage of various data transform functions. The synthesized code allowed data scientists to verify the intended data transformation, increased their trust and confidence in Wrex, and fit seamlessly within their cell-based notebook workflows. This work suggests that presenting readable code to professional data scientists is an indispensable component of offering data wrangling tools in notebooks.

受賞
Best Paper
キーワード
Computational Notebooks
Program Synthesis
Data Science
著者
Ian Drosos
University of California, San Diego, La Jolla, CA, USA
Titus Barik
Microsoft, Redmond, WA, USA
Philip J. Guo
University of California, San Diego, La Jolla, CA, USA
Robert DeLine
Microsoft, Redmond, WA, USA
Sumit Gulwani
Microsoft, Redmond, WA, USA
DOI

10.1145/3313831.3376442

論文URL

https://doi.org/10.1145/3313831.3376442

会議: CHI 2020

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2020.acm.org/)

セッション: Computational notebooks & tutorials

Paper session
312 NI'IHAU
5 件の発表
2020-04-30 01:00:00
2020-04-30 02:15:00
日本語まとめ
読み込み中…