Detecting Gender Stereotypes: Lexicon vs. Supervised Learning Methods

要旨

Biases in language influence how we interact with each other and society at large. Language affirming gender stereotypes is often observed in various contexts today, from recommendation letters and Wikipedia entries to fiction novels and movie dialogue. Yet to date, there is little agreement on the methodology to quantify gender stereotypes in natural language (specifically the English language). Common methodology (including those adopted by companies tasked with detecting gender bias) rely on a lexicon approach largely based on the original BSRI study from 1974.<br>In this paper, we reexamine the role of gender stereotype detection in the context of modern tools, by comparatively analyzing efficacy of lexicon-based approaches and end-to-end, ML-based approaches prevalent in state-of-the-art natural language processing systems. Our efforts using a large dataset show that even compared to an updated lexicon-based approach, end-to-end classification approaches are significantly more robust and accurate, even when trained by moderately sized corpora.

受賞
Honorable Mention
キーワード
Gender Bias
Gender Stereotypes
Machine Learning
Natural Language Processing
Lexicon
著者
Jenna Cryan
University of Chicago, Chicago, IL, USA
Shiliang Tang
University of California, Santa Barbara, Santa Barbara, CA, USA
Xinyi Zhang
University of California, Santa Barbara, Santa Barbara, CA, USA
Miriam Metzger
University of California, Santa Barbara, Santa Barbara, CA, USA
Haitao Zheng
University of Chicago, Chicago, IL, USA
Ben Y. Zhao
University of Chicago, chicago, IL, USA
DOI

10.1145/3313831.3376488

論文URL

https://doi.org/10.1145/3313831.3376488

会議: CHI 2020

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2020.acm.org/)

セッション: Gender++

Paper session
317AB KAHO'OLAWE
5 件の発表
2020-04-30 23:00:00
2020-05-01 00:15:00
日本語まとめ
読み込み中…