WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users

要旨

Existing pose estimation models perform poorly on wheelchair users due to a lack of representation in training data. We present a data synthesis pipeline to address this disparity in data collection and subsequently improve pose estimation performance for wheelchair users. Our configurable pipeline generates synthetic data of wheelchair users using motion capture data and motion generation outputs simulated in the Unity game engine. We validated our pipeline by conducting a human evaluation, investigating perceived realism, diversity, and an AI performance evaluation on a set of synthetic datasets from our pipeline that synthesized different backgrounds, models, and postures. We found our generated datasets were perceived as realistic by human evaluators, had more diversity than existing image datasets, and had improved person detection and pose estimation performance when fine-tuned on existing pose estimation models. Through this work, we hope to create a foothold for future efforts in tackling the inclusiveness of AI in a data-centric and human-centric manner with the data synthesis techniques demonstrated in this work. Finally, for future works to extend upon, we open source all code in this research and provide a fully configurable Unity Environment used to generate our datasets. In the case of any models we are unable to share due to redistribution and licensing policies, we provide detailed instructions on how to source and replace said models. All materials can be found at https://github.com/hilab-open-source/wheelpose.

著者
William Huang
Unversity of California, Los Angeles, Westwood, California, United States
Sam Ghahremani
University of California, Los Angeles, Los Angeles, California, United States
Siyou Pei
University of California, Los Angeles, Los Angeles, California, United States
Yang Zhang
University of California, Los Angeles, Los Angeles, California, United States
論文URL

doi.org/10.1145/3613904.3642555

動画

会議: CHI 2024

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

セッション: Touch, Gesture and Posture

314
4 件の発表
2024-05-14 23:00:00
2024-05-15 00:20:00