论文标题
理解抽象摘要句子之间的对应关系
Understanding Points of Correspondence between Sentences for Abstractive Summarization
论文作者
论文摘要
融合包含不同内容的句子是一种非凡的人类能力,有助于创建内容丰富和简洁的摘要。对于现代抽象摘要的人来说,对于人类而言,这一简单的任务仍然具有挑战性,从而实质上限制了其在现实情况下的适用性。在本文中,我们通过引入信件点的概念,对从文档中得出的句子进行融合的调查进行了调查,这些句子的概念是将任何两个句子绑在一起的凝聚力设备。对应点的类型由文本凝聚力理论描述,涵盖代词和名义引用,重复及以后。我们创建一个包含文档,源和融合句子的数据集以及句子之间对应点的人类注释。我们的数据集弥合了Coreference分辨率和汇总之间的差距。它是公开共享的,以作为衡量句子融合系统成功的未来工作的基础。 (https://github.com/ucfnlp/points-of-correspessence)
Fusing sentences containing disparate content is a remarkable human ability that helps create informative and succinct summaries. Such a simple task for humans has remained challenging for modern abstractive summarizers, substantially restricting their applicability in real-world scenarios. In this paper, we present an investigation into fusing sentences drawn from a document by introducing the notion of points of correspondence, which are cohesive devices that tie any two sentences together into a coherent text. The types of points of correspondence are delineated by text cohesion theory, covering pronominal and nominal referencing, repetition and beyond. We create a dataset containing the documents, source and fusion sentences, and human annotations of points of correspondence between sentences. Our dataset bridges the gap between coreference resolution and summarization. It is publicly shared to serve as a basis for future work to measure the success of sentence fusion systems. (https://github.com/ucfnlp/points-of-correspondence)