评估可信赖链接预测的知识图嵌入的校准

论文标题

评估可信赖链接预测的知识图嵌入的校准

Evaluating the Calibration of Knowledge Graph Embeddings for Trustworthy Link Prediction

论文作者

Safavi, Tara, Koutra, Danai, Meij, Edgar

论文摘要

关于知识图嵌入（KGE）模型做出的预测的可信度知之甚少。在本文中，我们通过研究KGE模型的校准或输出置信得分的程度来朝着这一方向迈出初步步骤，以反映预测知识图三元的预期正确性。我们首先根据标准的封闭世界假设（CWA）进行评估，其中知识图中尚未预测的三倍被认为是错误的，并表明在这个常见但狭窄的假设下，现有的校准技术对KGE有效。接下来，我们介绍了更现实但具有挑战性的开放世界假设（OWA），在获得地面真相标签之前，未观察到的预测不被视为真或错误。在这里，我们表明，现有的校准技术在OWA下的效率要比CWA少得多，并提供了这种差异的解释。最后，为了从从业者的角度来激励KGE的校准实用性，我们对人类协作进行了独特的案例研究，表明校准的预测可以改善知识图完成任务中的人类绩效。

Little is known about the trustworthiness of predictions made by knowledge graph embedding (KGE) models. In this paper we take initial steps toward this direction by investigating the calibration of KGE models, or the extent to which they output confidence scores that reflect the expected correctness of predicted knowledge graph triples. We first conduct an evaluation under the standard closed-world assumption (CWA), in which predicted triples not already in the knowledge graph are considered false, and show that existing calibration techniques are effective for KGE under this common but narrow assumption. Next, we introduce the more realistic but challenging open-world assumption (OWA), in which unobserved predictions are not considered true or false until ground-truth labels are obtained. Here, we show that existing calibration techniques are much less effective under the OWA than the CWA, and provide explanations for this discrepancy. Finally, to motivate the utility of calibration for KGE from a practitioner's perspective, we conduct a unique case study of human-AI collaboration, showing that calibrated predictions can improve human performance in a knowledge graph completion task.

下载PDF全文

下载文献需遵守相关版权规定

论文标题