Eyes on Many: Evaluating Gaze, Hand, and Voice for Multi-Object Selection in Extended Reality

要旨

Interacting with multiple objects simultaneously makes us fast. A pre-step to this interaction is to select the objects, i.e., multi-object selection, which is enabled through two steps: (1) toggling multi-selection mode --- mode-switching --- and then (2) selecting all the intended objects --- subselection. In extended reality (XR), each step can be performed with the eyes, hands, and voice. To examine how design choices affect user performance, we evaluated four mode-switching (\Semipinch, \Fullpinch, \Doublepinch, and \Voice) and three subselection techniques (Gaze+Dwell, Gaze+Pinch, and Gaze+Voice) in a user study. Results revealed that while \Doublepinch paired with Gaze+Pinch yielded the highest overall performance, \Semipinch achieved the lowest performance. Although \Voice-based mode-switching showed benefits, Gaze+Voice subselection was less favored, as the required repetitive vocal commands were perceived as tedious. Overall, these findings provide empirical insights and inform design recommendations for multi-selection techniques in XR.

著者
Mohammad Raihanul Bashar
Concordia University, Montreal, Quebec, Canada
Aunnoy K Mutasim
Simon Fraser University, Vancouver, British Columbia, Canada
Ken Pfeuffer
Aarhus University, Aarhus, Denmark
Anil Ufuk Batmaz
Concordia University, Montreal, Quebec, Canada

会議: CHI 2026

ACM CHI Conference on Human Factors in Computing Systems

セッション: Gaze as Input

P1 - Room 124
6 件の発表
2026-04-15 18:00:00
2026-04-15 19:30:00