ModSandbox: Facilitating Online Community Moderation Through Error Prediction and Improvement of Automated Rules

Despite the common use of rule-based tools for online content moderation, human moderators still spend a lot of time monitoring them to ensure they work as intended. Based on surveys and interviews with Reddit moderators who use AutoModerator, we identified the main challenges in reducing false positives and false negatives of automated rules: not being able to estimate the actual effect of a rule in advance and having difficulty figuring out how the rules should be updated. To address these issues, we built ModSandbox, a novel virtual sandbox system that detects possible false positives and false negatives of a rule and visualizes which part of the rule is causing issues. We conducted a comparative, between-subject study with online content moderators to evaluate the effect of ModSandbox in improving automated rules. Results show that ModSandbox can support quickly finding possible false positives and false negatives of automated rules and guide moderators to improve them to reduce future errors.

DGIST, Daegu, Korea, Republic of

KAIST, Daejeon, Korea, Republic of

Krafton Inc. , Seoul, Korea, Republic of

Kakao Corp, Pangyo, Korea, Republic of

KAIST, Daejeon, Korea, Republic of

https://doi.org/10.1145/3544548.3581057

The ACM CHI Conference on Human Factors in Computing Systems (https://chi2023.acm.org/)

Hall G2

6 件の発表

開始日時2023-04-26 23:30:00

終了日時2023-04-27 00:55:00

お気に入り