论文标题

使用句子编码器使用短语对齐方式识别结构感知的释义

Towards Structure-aware Paraphrase Identification with Phrase Alignment Using Sentence Encoders

论文作者

Peng, Qiwei, Weir, David, Weeds, Julie

论文摘要

以前的作品证明了根据其句子表示使用预训练的句子编码器的有效性,以实现比较任务。尽管这些表示形式显示以捕获隐藏的语法结构,但它们之间的直接相似性比较表现出对单词顺序和给定句子中结构差异的较弱敏感性。单个相似性得分进一步使比较过程难以解释。因此,我们在这里建议通过将每个句子表示为谓词argument-argument跨度的列表(其中其跨度表示的句子是从句子编码器衍生),并将句子级级别的比较分解为跨度识别任务之间的跨度的对齐方式,将句子编码器与一个对齐组件组合在一起。经验结果表明,对齐组件可以提高各种句子编码的性能和解释性。经过仔细研究后,提出的方法表明对结构差异的敏感性增加,并增强了区分高词汇重叠的非副本的能力。

Previous works have demonstrated the effectiveness of utilising pre-trained sentence encoders based on their sentence representations for meaning comparison tasks. Though such representations are shown to capture hidden syntax structures, the direct similarity comparison between them exhibits weak sensitivity to word order and structural differences in given sentences. A single similarity score further makes the comparison process hard to interpret. Therefore, we here propose to combine sentence encoders with an alignment component by representing each sentence as a list of predicate-argument spans (where their span representations are derived from sentence encoders), and decomposing the sentence-level meaning comparison into the alignment between their spans for paraphrase identification tasks. Empirical results show that the alignment component brings in both improved performance and interpretability for various sentence encoders. After closer investigation, the proposed approach indicates increased sensitivity to structural difference and enhanced ability to distinguish non-paraphrases with high lexical overlap.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源