Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

Many accessibility features available on mobile platforms require applications (apps) to provide complete and accurate metadata describing user interface (UI) components. Unfortunately, many apps do not provide sufficient metadata for accessibility features to work as expected. In this paper, we explore inferring accessibility metadata for mobile apps from their pixels, as the visual interfaces often best reflect an app's full functionality. We trained a robust, fast, memory-efficient, on-device model to detect UI elements using a dataset of 77,637 screens (from 4,068 iPhone apps) that we collected and annotated. To further improve UI detections and add semantic information, we introduced heuristics (e.g., UI grouping and ordering) and additional models (e.g., recognize UI content, state, interactivity). We built Screen Recognition to generate accessibility metadata to augment iOS VoiceOver. In a study with 9 screen reader users, we validated that our approach improves the accessibility of existing mobile apps, enabling even previously inaccessible apps to be used.

Apple Inc, Seattle, Washington, United States

Apple Inc, Pittsburgh, Pennsylvania, United States

Apple Inc, Seattle, Washington, United States

Apple Inc, San Diego, California, United States

Apple Inc, Pittsburgh, Pennsylvania, United States

Apple Inc, Cupertino, California, United States

Apple Inc, Pittsburgh, Pennsylvania, United States

10.1145/3411764.3445186

https://doi.org/10.1145/3411764.3445186

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2021.acm.org/)

[A] Paper Room 01, 2021-05-10 17:00:00~2021-05-10 19:00:00 / [B] Paper Room 01, 2021-05-11 01:00:00~2021-05-11 03:00:00 / [C] Paper Room 01, 2021-05-11 09:00:00~2021-05-11 11:00:00

Paper Room 01

11 件の発表

開始日時2021-05-10 17:00:00

終了日時2021-05-10 19:00:00

読み込み中…

お気に入り

あとで読む

コレクション

Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

要旨

受賞
Best Paper

著者

DOI

論文URL

動画

会議: CHI 2021

セッション: Accessible Content Creation

日本語まとめ

Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

要旨

受賞Best Paper

著者

DOI

論文URL

動画

会議: CHI 2021

セッション: Accessible Content Creation

日本語まとめ

受賞
Best Paper