论文标题
由原型引导的释义多样化的以任务为导向的对话响应产生
Diversifying Task-oriented Dialogue Response Generation with Prototype Guided Paraphrasing
论文作者
论文摘要
以任务为导向的对话系统(TDS)中的对话响应生成(DRG)的现有方法可以分为两类:基于模板和基于语料库。前者提前准备了响应模板的集合,并用系统操作填充插槽,以在运行时产生系统响应。后者通过考虑系统操作来通过令牌产生系统响应令牌。尽管基于模板的DRG提供了高精度和高度可预测的响应,但与基于(神经)基于语料库的方法相比,它们通常缺乏产生多样和自然反应的词。相反,尽管基于语料库的DRG方法能够产生自然响应,但我们不能保证它们的精度或可预测性。此外,当今基于语料库的DRG方法产生的响应的多样性仍然有限。我们建议通过引入基于原型的,释义的神经网络(称为p2-net)来结合基于模板和基于语料库的DRG的优点,该网络旨在以精度和多样性来提高响应的质量。 P2-NET没有从头开始生成响应,而是通过释义基于模板的响应来生成系统响应。为了确保响应的精度,P2-NET学会了将响应分为其语义,上下文影响和释义噪声,并使语义在释义过程中保持不变。为了介绍多样性,P2-NET随机将先前的对话话语视为原型,然后该模型可以从中提取口语样式信息。我们在自动评估和人类评估的多沃兹数据集上进行了广泛的实验。结果表明,P2-NET在保留响应语义的同时,在多样性方面取得了重大改善。
Existing methods for Dialogue Response Generation (DRG) in Task-oriented Dialogue Systems (TDSs) can be grouped into two categories: template-based and corpus-based. The former prepare a collection of response templates in advance and fill the slots with system actions to produce system responses at runtime. The latter generate system responses token by token by taking system actions into account. While template-based DRG provides high precision and highly predictable responses, they usually lack in terms of generating diverse and natural responses when compared to (neural) corpus-based approaches. Conversely, while corpus-based DRG methods are able to generate natural responses, we cannot guarantee their precision or predictability. Moreover, the diversity of responses produced by today's corpus-based DRG methods is still limited. We propose to combine the merits of template-based and corpus-based DRGs by introducing a prototype-based, paraphrasing neural network, called P2-Net, which aims to enhance quality of the responses in terms of both precision and diversity. Instead of generating a response from scratch, P2-Net generates system responses by paraphrasing template-based responses. To guarantee the precision of responses, P2-Net learns to separate a response into its semantics, context influence, and paraphrasing noise, and to keep the semantics unchanged during paraphrasing. To introduce diversity, P2-Net randomly samples previous conversational utterances as prototypes, from which the model can then extract speaking style information. We conduct extensive experiments on the MultiWOZ dataset with both automatic and human evaluations. The results show that P2-Net achieves a significant improvement in diversity while preserving the semantics of responses.