Using Open Data to Automatically Generate Localized Analogies

Numerical analogies (or "perspectives") that translate unfamiliar measurements into comparisons with familiar reference objects (e.g., "275,000 square miles is roughly as large as Texas") have been shown to aid readers' recall, estimation, and error detection for numbers. However, because familiar reference objects are culture-specific, analogies do not always generalize across audiences. Crowdsourcing perspectives has proven effective but is limited by scalability issues and a lack of crowdworking markets in many regions. In this research, we develop an automated technique for generating localized perspectives. We utilize several open data sources for relevance signals and develop a surprisingly simple model capable of localizing analogies to new audiences without any retraining from human judges. We validate the model by testing it in both a new domain and with a different linguistic audience residing in another country. We release the compiled dataset of 400,000 reference objects to the research community.

Microsoft Research, New York, New York, United States

Microsoft Research, NYC, New York, United States

https://doi.org/10.1145/3613904.3642638

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2024.acm.org/)

318B

5 件の発表

開始日時2024-05-13 20:00:00

終了日時2024-05-13 21:20:00

お気に入り

あとで読む

コレクション