论文标题
自动识别由ISO 24617-2标准对话框ACT注释定义的通用交流功能
Automatic Recognition of the General-Purpose Communicative Functions defined by the ISO 24617-2 Standard for Dialog Act Annotation
论文作者
论文摘要
ISO 24617-2是对话ACT注释的标准,定义了一组层次结构化的通用交流功能。这些功能的自动识别虽然实际上没有探索,但与对话系统有关,因为它们提供了有关细分市场背后的意图以及应如何解释的线索。我们探讨了对话框中通用交流功能的识别,这是根据此标准注释的对话框的参考集。为此,我们建议对现有方法进行平面对话行为识别的改编,以使他们能够处理层次分类问题。更具体地说,我们建议使用具有层叠输出的层次结构网络和最大后验路径估计,以预测层次结构每个级别的沟通函数,保留路径中函数之间的依赖关系,并确定在哪个级别上停止。此外,由于对话框中的对话数量减少了,因此我们依靠转移学习过程来减少过度拟合并提高性能。我们的实验结果表明,分层方法的表现优于扁平的方法,并且其每个组件在识别通用交流功能方面都起着重要作用。
ISO 24617-2, the standard for dialog act annotation, defines a hierarchically organized set of general-purpose communicative functions. The automatic recognition of these functions, although practically unexplored, is relevant for a dialog system, since they provide cues regarding the intention behind the segments and how they should be interpreted. We explore the recognition of general-purpose communicative functions in the DialogBank, which is a reference set of dialogs annotated according to this standard. To do so, we propose adaptations of existing approaches to flat dialog act recognition that allow them to deal with the hierarchical classification problem. More specifically, we propose the use of a hierarchical network with cascading outputs and maximum a posteriori path estimation to predict the communicative function at each level of the hierarchy, preserve the dependencies between the functions in the path, and decide at which level to stop. Furthermore, since the amount of dialogs in the DialogBank is reduced, we rely on transfer learning processes to reduce overfitting and improve performance. The results of our experiments show that the hierarchical approach outperforms a flat one and that each of its components plays an important role towards the recognition of general-purpose communicative functions.