论文标题

极端:Parler和Gab中仇恨言论的比较分析

Going Extreme: Comparative Analysis of Hate Speech in Parler and Gab

论文作者

Israeli, Abraham, Tsur, Oren

论文摘要

近年来,其用户群的大幅增长,诸如GAB和PARLER之类的社交平台(如GAB和PARLER)的烙印。这种受欢迎程度主要归因于Twitter,Facebook和Reddit等主流平台执行的更严格的节制。在这项工作中,我们提供了对Parler的仇恨语音的第一个大规模分析。 考虑到仇恨语言检测的一系列算法,证明了该领域转移学习的局限性,鉴于仇恨语音的传达方式的虚幻和不断变化的性质。为了提高分类精度,我们注释了10K Parler帖子,我们用来微调BERT分类器。然后,通过社交网络上的标签传播将单个帖子的分类用于将数百万用户分类。通过将用户分类为传播仇恨的倾向,我们发现仇恨贩子占16.1 \%的Parler Active用户,并且与其他用户群体相比,他们具有独特的特征。我们发现,仇恨贩子更加活跃,更中心和表达不同的情感,并传达了诸如愤怒和悲伤之类的各种情感。我们通过比较Parler中发现的趋势和GAB中发现的趋势进一步补充了我们的分析。 据我们所知,这是最早以定量方式和用户级别分析Parler中仇恨言论的作品,也是第一个提供给社区的注释数据集。

Social platforms such as Gab and Parler, branded as `free-speech' networks, have seen a significant growth of their user base in recent years. This popularity is mainly attributed to the stricter moderation enforced by mainstream platforms such as Twitter, Facebook, and Reddit. In this work we provide the first large scale analysis of hate-speech on Parler. We experiment with an array of algorithms for hate-speech detection, demonstrating limitations of transfer learning in that domain, given the illusive and ever changing nature of the ways hate-speech is delivered. In order to improve classification accuracy we annotated 10K Parler posts, which we use to fine-tune a BERT classifier. Classification of individual posts is then leveraged for the classification of millions of users via label propagation over the social network. Classifying users by their propensity to disseminate hate, we find that hate mongers make 16.1\% of Parler active users, and that they have distinct characteristics comparing to other user groups. We find that hate mongers are more active, more central and express distinct levels of sentiment and convey a distinct array of emotions like anger and sadness. We further complement our analysis by comparing the trends discovered in Parler and those found in Gab. To the best of our knowledge, this is among the first works to analyze hate speech in Parler in a quantitative manner and on the user level, and the first annotated dataset to be made available to the community.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源