Video2Action: Reducing Human Interactions in Action Annotation of App Tutorial Videos

要旨

Tutorial videos of mobile apps have become a popular and compelling way for users to learn unfamiliar app features. To make the video accessible to the users, video creators always need to annotate the actions in the video, including what actions are performed and where to tap. However, this process can be time-consuming and labor-intensive. In this paper, we introduce a lightweight approach Video2Action, to automatically generate the action scenes and predict the action locations from the video by using image-processing and deep-learning methods. The automated experiments demonstrate the good performance of Video2Action in acquiring actions from the videos, and a user study shows the usefulness of our generated action cues in assisting video creators with action annotation.

著者
Sidong Feng
Monash University, Melbourne, Victoria, Australia
Chunyang Chen
Monash University, Melbourne, Victoria, Australia
Zhenchang Xing
CSIRO's Data61 adn Australian National University, ACTON, ACT, Australia
論文URL

https://doi.org/10.1145/3586183.3606778

動画

会議: UIST 2023

ACM Symposium on User Interface Software and Technology

セッション: Masterful Media: Audio and Video Authoring Tools

Gold Room
6 件の発表
2023-10-30 23:20:00
2023-10-31 00:40:00