论文标题
与上下文化的自我判断链接预测
Link Prediction with Contextualized Self-Supervision
论文作者
论文摘要
链接预测旨在推断网络/图表中的一对节点之间的链接存在。尽管应用了广泛的应用,但传统链接预测算法的成功受到了三个主要挑战(链接稀疏,节点属性噪声和动态变化)的阻碍,这些挑战被许多现实世界网络所面临。为了应对这些挑战,我们提出了一个上下文化的自我监督学习(CSSL)框架,该框架充分利用了链接预测的结构上下文预测。提出的CSSL框架学习了一个链接编码器,以从配对的节点嵌入中推断链接存在概率,这些嵌入是通过节点属性上的转换构建的。为了生成链接预测的信息节点嵌入,结构上下文预测被用作一项自我监管的学习任务,以提高链接预测性能。研究了两种类型的结构环境,即从随机步行和上下文子图收集的上下文节点。 CSSL框架可以以端到端的方式进行培训,并通过通过链接预测和自我监督的学习任务来监督模型参数的学习。所提出的CSSL是一个通用且灵活的框架,它可以同时处理属性和非属性网络,并且在换电和归纳性链接预测设置下运行。对七个现实世界基准网络进行的广泛实验和消融研究表明,在转化和感应性环境下,在不同类型的网络上,提出的基于自我实施的链接链路预测算法优于最先进的基线。拟议的CSSL还可以从大规模网络上的节点属性噪声和可扩展性方面产生竞争性能。
Link prediction aims to infer the link existence between pairs of nodes in networks/graphs. Despite their wide application, the success of traditional link prediction algorithms is hindered by three major challenges -- link sparsity, node attribute noise and dynamic changes -- that are faced by many real-world networks. To address these challenges, we propose a Contextualized Self-Supervised Learning (CSSL) framework that fully exploits structural context prediction for link prediction. The proposed CSSL framework learns a link encoder to infer the link existence probability from paired node embeddings, which are constructed via a transformation on node attributes. To generate informative node embeddings for link prediction, structural context prediction is leveraged as a self-supervised learning task to boost the link prediction performance. Two types of structural context are investigated, i.e., context nodes collected from random walks vs. context subgraphs. The CSSL framework can be trained in an end-to-end manner, with the learning of model parameters supervised by both the link prediction and self-supervised learning tasks. The proposed CSSL is a generic and flexible framework in the sense that it can handle both attributed and non-attributed networks, and operate under both transductive and inductive link prediction settings. Extensive experiments and ablation studies on seven real-world benchmark networks demonstrate the superior performance of the proposed self-supervision based link prediction algorithm over state-of-the-art baselines, on different types of networks under both transductive and inductive settings. The proposed CSSL also yields competitive performance in terms of its robustness to node attribute noise and scalability over large-scale networks.