DiLogics: Creating Web Automation Programs with Diverse Logics

要旨

Knowledge workers frequently encounter repetitive web data entry tasks, like updating records or placing orders. Web automation increases productivity, but translating tasks to web actions accurately and extending to new specifications is challenging. Existing tools can automate tasks that perform the same logical trace of UI actions (e.g., input text in each field in order), but do not support tasks requiring different executions based on varied input conditions. We present DiLogics, a programming-by-demonstration system that utilizes NLP to assist users in creating web automation programs that handle diverse specifications. DiLogics first semantically segments input data to structured task steps. By recording user demonstrations for each step, DiLogics generalizes the web macros to novel but semantically similar task requirements. Our evaluation showed that non-experts can effectively use DiLogics to create automation programs that fulfill diverse input instructions. DiLogics provides an efficient, intuitive, and expressive method for developing web automation programs satisfying diverse specifications.

著者
Kevin Pu
University of Toronto, Toronto, Ontario, Canada
Jim Yang
University of Toronto, Toronto, Ontario, Canada
Angel Yuan
University of Toronto, Toronto, Ontario, Canada
Minyi Ma
University of Toronto, Toronto, Ontario, Canada
Rui Dong
University of Michigan, Ann Arbor, Michigan, United States
Xinyu Wang
University of Michigan, Ann Arbor, Michigan, United States
Yan Chen
University of Toronto, Toronto, Ontario, Canada
Tovi Grossman
University of Toronto, Toronto, Ontario, Canada
論文URL

https://doi.org/10.1145/3586183.3606822

動画

会議: UIST 2023

ACM Symposium on User Interface Software and Technology

セッション: Code Craftsmanship: Programming Support Tools

Gold Room
7 件の発表
2023-11-01 01:00:00
2023-11-01 02:20:00