Front Row: Automatically Generating Immersive Audio Representations of Tennis Broadcasts for Blind Viewers

要旨

Blind and low-vision (BLV) people face challenges watching sports due to the lack of accessibility of sports broadcasts. Currently, BLV people rely on descriptions from TV commentators, radio announcers, or their friends to understand the game. These descriptions, however, do not allow BLV viewers to visualize the action by themselves. We present Front Row, a system that automatically generates an immersive audio representation of sports broadcasts, specifically tennis, allowing BLV viewers to more directly perceive what is happening in the game. Front Row first recognizes gameplay from the video feed using computer vision, then renders players’ positions and shots via spatialized (3D) audio cues. User evaluations with 12 BLV participants show that Front Row gives BLV viewers a more accurate understanding of the game compared to TV and radio, enabling viewers to form their own opinions on players' moods and strategies. We discuss future implications of Front Row and illustrate several applications, including a Front Row plug-in for video streaming platforms to enable BLV people to visualize the action in sports videos across the Web.

著者
Gaurav Jain
Columbia University, New York, New York, United States
Basel Hindi
Columbia University , New York, New York, United States
Connor Courtien
Hunter College, New York, New York, United States
Xin Yi Therese Xu
Pomona College, Claremont, California, United States
Conrad Wyrick
University of Florida, Gainesville, Florida, United States
Michael C. Malcolm
SUNY At Albany, Albany, New York, United States
Brian A.. Smith
Columbia University, New York, New York, United States
論文URL

https://doi.org/10.1145/3586183.3606830

動画

会議: UIST 2023

ACM Symposium on User Interface Software and Technology

セッション: Inclusive Interactions: Accessibility Techniques and Systems

Gold Room
6 件の発表
2023-10-31 18:00:00
2023-10-31 19:20:00