论文标题
为什么这么炎症?自动检测炎症社交媒体用户的解释性
Why So Inflammatory? Explainability in Automatic Detection of Inflammatory Social Media Users
论文作者
论文摘要
仇恨言论和错误信息,分布在Facebook和Twitter等社交网络服务(SNS)上,在全球各国激怒了种族和政治暴力。我们认为,在全球南方的背景下,对这个问题的研究有限,并提出了解决问题的方法。先前的工作已经显示了如何使用用户级交互功能构建的机器学习模型可以有效地识别传播炎症内容的用户。尽管这种技术在低资源语言设置中是有益的,在低资源语言设置中,语言资源(例如地面真相数据和处理能力)尚不清楚这些相互作用特征如何促进模型性能。在这项工作中,我们调查并显示了传播炎症含量的用户与其他不使用的其他人之间的相互作用特征的显着差异,并应用了解释性工具来了解我们的训练有素的模型。我们发现,相互作用显着性较高的功能(例如帐户年龄和活动数量)显示出比具有较低互动意义的特征(例如名称长度以及用户在其生物上的位置)更高的解释能力。我们的工作扩展了研究方向,旨在了解低资源,高风险环境中炎症含量的性质,因为全球南方南方的社交媒体使用的增长超出了节制工作。
Hate speech and misinformation, spread over social networking services (SNS) such as Facebook and Twitter, have inflamed ethnic and political violence in countries across the globe. We argue that there is limited research on this problem within the context of the Global South and present an approach for tackling them. Prior works have shown how machine learning models built with user-level interaction features can effectively identify users who spread inflammatory content. While this technique is beneficial in low-resource language settings where linguistic resources such as ground truth data and processing capabilities are lacking, it is still unclear how these interaction features contribute to model performance. In this work, we investigate and show significant differences in interaction features between users who spread inflammatory content and others who do not, applying explainability tools to understand our trained model. We find that features with higher interaction significance (such as account age and activity count) show higher explanatory power than features with lower interaction significance (such as name length and if the user has a location on their bio). Our work extends research directions that aim to understand the nature of inflammatory content in low-resource, high-risk contexts as the growth of social media use in the Global South outstrips moderation efforts.