论文标题
衡量在线社区中反社会行为的普遍性
Measuring the Prevalence of Anti-Social Behavior in Online Communities
论文作者
论文摘要
随着人们对在线反社会行为(例如人身攻击和偏执)的关注,至关重要的是,准确地说明反社会行为的广泛性是至关重要的。在本文中,我们从经验上衡量了世界上最受欢迎的在线社区平台之一中反社会行为的普遍性。我们将这一目标运行是在Reddit上97个最受欢迎的社区中衡量未经修改的评论的比例,违反了八个广泛接受的平台规范。为了实现这一目标,我们为确定这些违规行为和一种自举抽样方法贡献了人类管道,以量化测量不确定性。我们发现,2016年的所有评论中有6.25%(95%的置信区间[5.36%,7.13%],而2020-2021的4.28%(95%CI [2.50%,6.26%])违反了这些规范。大多数反社会行为仍然没有改造:主持人在2016年仅删除了二十个违规评论中的一个,而十分之十的违反评论在2020年。个人攻击是违反规范的最普遍类别;色情和偏执是最有可能调节的,而政治上的炎症性评论和厌女症/庸俗的调节是最不可能得到的。本文提供了一种方法和一组经验结果,用于跟踪这些现象,因为社会实践(例如,适度)和技术实践(例如,设计)的发展。
With increasing attention to online anti-social behaviors such as personal attacks and bigotry, it is critical to have an accurate accounting of how widespread anti-social behaviors are. In this paper, we empirically measure the prevalence of anti-social behavior in one of the world's most popular online community platforms. We operationalize this goal as measuring the proportion of unmoderated comments in the 97 most popular communities on Reddit that violate eight widely accepted platform norms. To achieve this goal, we contribute a human-AI pipeline for identifying these violations and a bootstrap sampling method to quantify measurement uncertainty. We find that 6.25% (95% Confidence Interval [5.36%, 7.13%]) of all comments in 2016, and 4.28% (95% CI [2.50%, 6.26%]) in 2020-2021, are violations of these norms. Most anti-social behaviors remain unmoderated: moderators only removed one in twenty violating comments in 2016, and one in ten violating comments in 2020. Personal attacks were the most prevalent category of norm violation; pornography and bigotry were the most likely to be moderated, while politically inflammatory comments and misogyny/vulgarity were the least likely to be moderated. This paper offers a method and set of empirical results for tracking these phenomena as both the social practices (e.g., moderation) and technical practices (e.g., design) evolve.