论文标题
这是一个已知的谎言:检测以前事实检查的主张
That is a Known Lie: Detecting Previously Fact-Checked Claims
论文作者
论文摘要
最近的“假新闻”的扩散引发了许多回应,最著名的是几项手动事实检查计划的出现。结果和随着时间的流逝,已经积累了大量事实检查的主张,这增加了社交媒体中的新主张或政治家的新声明可能已经被一些值得信赖的事实检查组织所事实检查,因为一个病毒式的索赔经常在社交媒体上回来,而在社交媒体上经常回来,以及重复他们最喜欢的言论,或者重复他们最喜欢的言论,又一次地又一次地又一次地又一次地又一次。由于手动事实检查非常耗时(并且完全自动的事实检查存在信誉问题),因此重要的是要节省这项工作并避免浪费时间对已经进行了事实检查的索赔的时间。有趣的是,尽管任务很重要,但迄今为止,研究界在很大程度上被研究所忽略了。在这里,我们的目标是弥合这一差距。特别是,我们制定了任务,并讨论了它与以前的工作的关系,但也有所不同。我们进一步创建了一个专门的数据集,我们将其发布给研究社区。最后,我们提出了学习到级的实验,这些实验证明了对最新检索和文本相似性方法的显着改进。
The recent proliferation of "fake news" has triggered a number of responses, most notably the emergence of several manual fact-checking initiatives. As a result and over time, a large number of fact-checked claims have been accumulated, which increases the likelihood that a new claim in social media or a new statement by a politician might have already been fact-checked by some trusted fact-checking organization, as viral claims often come back after a while in social media, and politicians like to repeat their favorite statements, true or false, over and over again. As manual fact-checking is very time-consuming (and fully automatic fact-checking has credibility issues), it is important to try to save this effort and to avoid wasting time on claims that have already been fact-checked. Interestingly, despite the importance of the task, it has been largely ignored by the research community so far. Here, we aim to bridge this gap. In particular, we formulate the task and we discuss how it relates to, but also differs from, previous work. We further create a specialized dataset, which we release to the research community. Finally, we present learning-to-rank experiments that demonstrate sizable improvements over state-of-the-art retrieval and textual similarity approaches.