具有反事实推理和对抗性偏见学习的辩护立场检测模型

论文标题

具有反事实推理和对抗性偏见学习的辩护立场检测模型

Debiasing Stance Detection Models with Counterfactual Reasoning and Adversarial Bias Learning

论文作者

Yuan, Jianhua, Zhao, Yanyan, Qin, Bing

论文摘要

立场检测模型可能倾向于将文本部分中的数据集偏差视为捷径，因此无法充分学习目标和文本之间的相互作用。最新的辩论方法通常以较早的步骤将小型模型或大型模型学到的特征作为偏见特征，并提议排除在推断期间学习这些偏见特征的分支。但是，这些方法中的大多数都无法解散文本部分中``好'''姿势特征和``不良''偏差特征。在本文中，我们研究了如何减轻立场检测中的数据集偏差。受因果影响的促进，我们利用了一种新颖的反事实推理框架，这使我们能够将文本部分中的数据集偏差捕获为文本对立场的直接因果效应，并通过从总因果效应中减去直接文本效应来减少文本部分中的数据集偏差。我们将偏差的特征模拟与立场标签相关，但在中间立场推理子任务子任务中失败，并提出了一个对抗性偏见学习模块，以更准确地对偏差进行建模。为了验证我们的模型是否可以更好地模拟文本和目标之间的相互作用，我们在最近提出的测试集上测试了我们的模型，以评估各个方面对任务的理解。实验表明，我们提出的方法（1）可以更好地对偏差特征进行建模，并且（2）在原始数据集和大多数新建的测试集上都优于现有的偏见基线。

Stance detection models may tend to rely on dataset bias in the text part as a shortcut and thus fail to sufficiently learn the interaction between the targets and texts. Recent debiasing methods usually treated features learned by small models or big models at earlier steps as bias features and proposed to exclude the branch learning those bias features during inference. However, most of these methods fail to disentangle the ``good'' stance features and ``bad'' bias features in the text part. In this paper, we investigate how to mitigate dataset bias in stance detection. Motivated by causal effects, we leverage a novel counterfactual inference framework, which enables us to capture the dataset bias in the text part as the direct causal effect of the text on stances and reduce the dataset bias in the text part by subtracting the direct text effect from the total causal effect. We novelly model bias features as features that correlate with the stance labels but fail on intermediate stance reasoning subtasks and propose an adversarial bias learning module to model the bias more accurately. To verify whether our model could better model the interaction between texts and targets, we test our model on recently proposed test sets to evaluate the understanding of the task from various aspects. Experiments demonstrate that our proposed method (1) could better model the bias features, and (2) outperforms existing debiasing baselines on both the original dataset and most of the newly constructed test sets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题