The ORBIT India Dataset: Understanding the Challenges of Collecting a Disability-First AI Dataset in Low-Resource Environments

Computer vision systems are increasingly used by blind individuals to navigate their lives, helping, for example, locate objects such as doors or chairs. Yet these recognition systems do not work for many personal objects a blind user might want to find, such as keys or a special notebook. In response, efforts created personalized recognition systems, where individuals train their phones to identify and locate things, like a coffee mug or white cane, using example images/videos. However, these tools are trained on data from high-resource contexts, not necessarily reflecting India’s material culture. This paper discusses the contribution of the ORBIT-India dataset, which extends these tools to the Indian context, home of the world’s largest blind population. The ORBIT-India dataset comprises 105,243 images from 587 videos, representing 76 unique objects. We use this experience to examine dataset collection practices translated from high- to low-resource settings, providing recommendations to support cross-geography dataset collection.

Swansea University, Swansea, Wales, United Kingdom

Microsoft Research, Cambridge, United Kingdom

Microsoft Research EMEA, Boston , Massachusetts, United States

Microsoft Research, Sydney, Australia

Swansea University, Swansea, United Kingdom

Swansea University, Swansea, Wales, United Kingdom

Swansea University, Swansea, United Kingdom

ACM CHI Conference on Human Factors in Computing Systems

P1 - Room 111

6 件の発表

開始日時2026-04-13 20:15:00

終了日時2026-04-13 21:45:00

お気に入り

あとで読む

コレクション

The ORBIT India Dataset: Understanding the Challenges of Collecting a Disability-First AI Dataset in Low-Resource Environments

要旨

著者

会議: CHI 2026

セッション: Diversity and Inclusion