论文标题
Deim:符合句子的有效的深层编码和交互模型
DEIM: An effective deep encoding and interaction model for sentence matching
论文作者
论文摘要
自然语言句子匹配是比较两个句子并确定它们之间的关系的任务。它在自然语言处理任务中具有广泛的应用,例如阅读理解,问答系统。主要方法是通过注意机制来计算文本表示和句子对之间的相互作用,该机制可以很好地提取句子对之间的语义信息。但是,这种方法在处理复杂的语义特征时无法获得令人满意的结果。为了解决此问题,我们建议基于深度编码和交互的句子匹配方法,以提取深层的语义信息。在编码器层中,我们在编码单个句子的过程中参考另一句话的信息,然后使用启发式算法来融合信息。在相互作用层中,我们使用双向注意机制和自我发项机制来获得深度的语义信息。在本文中,我们执行合并操作并将其输入MLP进行分类。我们在三个任务上评估了我们的模型:认识文本需要,释义识别和答案选择。我们在SNLI和Scitail数据集上进行了实验,以识别文本需要任务,释义识别任务的Quora数据集以及用于答案选择任务的Wikiqa数据集。实验结果表明,所提出的算法可以有效提取深层的语义特征,以验证该算法对句子匹配任务的有效性。
Natural language sentence matching is the task of comparing two sentences and identifying the relationship between them.It has a wide range of applications in natural language processing tasks such as reading comprehension, question and answer systems. The main approach is to compute the interaction between text representations and sentence pairs through an attention mechanism, which can extract the semantic information between sentence pairs well. However,this kind of method can not gain satisfactory results when dealing with complex semantic features. To solve this problem, we propose a sentence matching method based on deep encoding and interaction to extract deep semantic information. In the encoder layer,we refer to the information of another sentence in the process of encoding a single sentence, and later use a heuristic algorithm to fuse the information. In the interaction layer, we use a bidirectional attention mechanism and a self-attention mechanism to obtain deep semantic information.Finally, we perform a pooling operation and input it to the MLP for classification. we evaluate our model on three tasks: recognizing textual entailment, paraphrase recognition, and answer selection. We conducted experiments on the SNLI and SciTail datasets for the recognizing textual entailment task, the Quora dataset for the paraphrase recognition task, and the WikiQA dataset for the answer selection task. The experimental results show that the proposed algorithm can effectively extract deep semantic features that verify the effectiveness of the algorithm on sentence matching tasks.