CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild

要旨

Non-intrusive, real-time analysis of the dynamics of the eye region allows us to monitor humans’ visual attention allocation and estimate their mental state during the performance of real-world tasks, which can potentially benefit a wide range of human-computer interaction (HCI) applications. While commercial eye-tracking devices have been frequently employed, the difficulty of customizing these devices places unnecessary constraints on the exploration of more efficient, end-to-end models of eye dynamics. In this work, we propose CLERA, a unified model for Cognitive Load and Eye Region Analysis, which achieves precise keypoint detection and spatiotemporal tracking in a joint-learning framework. Our method demonstrates significant efficiency and outperforms prior work on tasks including cognitive load estimation, eye landmark detection, and blink estimation. We also introduce a large-scale dataset of 30k human faces with joint pupil, eye-openness, and landmark annotation, which aims at supporting future HCI research on human factors and eye-related analysis.

著者
Li Ding
Umass Amherst, Amherst, Massachusetts, United States
Jack Terwilliger
University of California San Diego, La Jolla, California, United States
Aishni Parab
University of California, Los Angeles, Los Angeles, California, United States
Meng Wang
UMass Amherst, Amherst, Massachusetts, United States
Lex Fridman
MIT, Cambridge, Massachusetts, United States
Bruce Mehler
Massachusetts Institute of Technology, Cambridge, Massachusetts, United States
Bryan Reimer
MIT, Cambridge, Massachusetts, United States
動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Hand and Gaze

314
5 件の発表
2024-05-16 01:00:00
2024-05-16 02:20:00