AQuA: Automated Question-Answering in Software Tutorial Videos with Visual Anchors

要旨

Tutorial videos are a popular help source for learning feature-rich software. However, getting quick answers to questions about tutorial videos is difficult. We present an automated approach for responding to tutorial questions. By analyzing 633 questions found in 5,944 video comments, we identified different question types and observed that users frequently described parts of the video in questions. We then asked participants (N=24) to watch tutorial videos and ask questions while annotating the video with relevant visual anchors. Most visual anchors referred to UI elements and the application workspace. Based on these insights, we built AQuA, a pipeline that generates useful answers to questions with visual anchors. We demonstrate this for Fusion 360, showing that we can recognize UI elements in visual anchors and generate answers using GPT-4 augmented with that visual information and software documentation. An evaluation study (N=16) demonstrates that our approach provides better answers than baseline methods.

著者
Saelyne Yang
Autodesk Research, Toronto, Ontario, Canada
Jo Vermeulen
Autodesk Research, Toronto, Ontario, Canada
George Fitzmaurice
Autodesk Research, Toronto, Ontario, Canada
Justin Matejka
Autodesk Research, Toronto, Ontario, Canada
論文URL

doi.org/10.1145/3613904.3642752

動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Supporting Programmers and Learners A

324
5 件の発表
2024-05-13 20:00:00
2024-05-13 21:20:00