WatchHand: Enabling Continuous Hand Pose Tracking On Off-the-Shelf Smartwatches

要旨

Tracking hand poses on wrist-wearables enables rich, expressive interactions, yet remains unavailable on commercial smartwatches, as prior implementations rely on external sensors or custom hardware, limiting their real-world applicability. To address this, we present WatchHand, the first continuous 3D hand pose tracking system implemented on off-the-shelf smartwatches using only their built-in speaker and microphone. WatchHand emits inaudible frequency-modulated continuous waves and captures their reflections from the hand. These acoustic signals are processed by a deep-learning model that estimates 3D hand poses for 20 finger joints. We evaluate WatchHand across diverse real-world conditions---multiple smartwatch models, wearing-hands, body postures, noise conditions, pose-variation protocols---and achieve a mean per-joint position error of 7.87 mm in cross-session tests with device remounting. Although performance drops for unseen users or gestures, the model adapts effectively with lightweight fine-tuning on small amounts of data. Overall, WatchHand lowers the barrier to smartwatch-based hand tracking by eliminating additional hardware while enabling robust, always-available interactions on millions of existing devices.

著者
Jiwan Kim
KAIST, Daejeon, Korea, Republic of
Chi-Jung Lee
Cornell University, Ithaca, New York, United States
Hohurn Jung
KAIST, Deajon, Korea, Republic of
Tianhong Catherine. Yu
Cornell University, Ithaca, New York, United States
Ruidong Zhang
Cornell University, Ithaca, New York, United States
Ian Oakley
KAIST, Daejeon, Korea, Republic of
Cheng Zhang
Cornell University, ITHACA, New York, United States

会議: CHI 2026

ACM CHI Conference on Human Factors in Computing Systems

セッション: Hand Pose & Gestures

P1 - Room 127
7 件の発表
2026-04-13 20:15:00
2026-04-13 21:45:00