利用图来改善抽象性多文件摘要

论文标题

利用图来改善抽象性多文件摘要

Leveraging Graph to Improve Abstractive Multi-Document Summarization

论文作者

Li, Wei, Xiao, Xinyan, Liu, Jiachen, Wu, Hua, Wang, Haifeng, Du, Junping

论文摘要

捕获文本单元之间关系的图表有很大的好处，可以从多个文档中检测显着信息并产生整体连贯的摘要。在本文中，我们开发了一个神经抽象的多文章摘要（MDS）模型，该模型可以利用文档的众所周知的图形表示，例如相似性图和话语图，以更有效地处理多个输入文档并产生抽象性摘要。我们的模型利用图表编码文档以捕获跨文档关系，这对于总结长文档至关重要。我们的模型还可以利用图形来指导摘要生成过程，这对于生成相干和简洁的摘要非常有益。此外，预训练的语言模型可以很容易地与我们的模型结合在一起，从而进一步改善了汇总性能。 WikiSum和Multinews数据集的经验结果表明，所提出的体系结构对几个强大的基准进行了重大改进。

Graphs that capture relations between textual units have great benefits for detecting salient information from multiple documents and generating overall coherent summaries. In this paper, we develop a neural abstractive multi-document summarization (MDS) model which can leverage well-known graph representations of documents such as similarity graph and discourse graph, to more effectively process multiple input documents and produce abstractive summaries. Our model utilizes graphs to encode documents in order to capture cross-document relations, which is crucial to summarizing long documents. Our model can also take advantage of graphs to guide the summary generation process, which is beneficial for generating coherent and concise summaries. Furthermore, pre-trained language models can be easily combined with our model, which further improve the summarization performance significantly. Empirical results on the WikiSum and MultiNews dataset show that the proposed architecture brings substantial improvements over several strong baselines.

下载PDF全文

下载文献需遵守相关版权规定

论文标题