论文标题
用于聚类签名网络的正则光谱方法
Regularized spectral methods for clustering signed networks
论文作者
论文摘要
我们研究了$ k $ - 道路聚类的问题。近年来,大量关注已致力于分析和建模符号图,其中节点之间的亲和力度量为正值或负值。最近,Cucuringu等。 [CDGT 2019]提出了一种光谱方法,即海绵(在负面的全本特征上签名为正面),该方法将聚类任务铸造为普遍的特征值问题,以优化适当定义的目标函数。这种方法是由社会平衡理论激励的,在这种理论中,聚类任务旨在将给定的网络分解为不相交的组,以便同一组中的个体通过尽可能多的积极边缘连接,而来自不同组的个人则主要通过负边缘连接。通过广泛的数值模拟,海绵被证明可以实现最先进的经验性能。在理论方面,[CDGT 2019]在签名的随机块模型(SSBM)的设置下分析了海绵和流行的签名的Laplacian方法,$ k = 2 $相等大小的群集,在该图中的制度中,该制度是中等密度的。 在这项工作中,我们基于[CDGT 2019]的结果在两个正面的海绵和签名的Laplacian的两个方面。首先,对于这两种算法,我们在[CDGT 2019]中将理论分析扩展到$ k \ geq 2 $不等大小的群集的一般环境中适度密集的制度。其次,我们介绍了两种方法的正则版本来处理稀疏图(标准光谱方法表现不佳的制度),并在相同的SSBM模型下提供理论保证。据我们所知,迄今为止,在聚类签名的图中尚未考虑正则光谱方法。我们通过有关合成数据的广泛数值实验来补充理论结果。
We study the problem of $k$-way clustering in signed graphs. Considerable attention in recent years has been devoted to analyzing and modeling signed graphs, where the affinity measure between nodes takes either positive or negative values. Recently, Cucuringu et al. [CDGT 2019] proposed a spectral method, namely SPONGE (Signed Positive over Negative Generalized Eigenproblem), which casts the clustering task as a generalized eigenvalue problem optimizing a suitably defined objective function. This approach is motivated by social balance theory, where the clustering task aims to decompose a given network into disjoint groups, such that individuals within the same group are connected by as many positive edges as possible, while individuals from different groups are mainly connected by negative edges. Through extensive numerical simulations, SPONGE was shown to achieve state-of-the-art empirical performance. On the theoretical front, [CDGT 2019] analyzed SPONGE and the popular Signed Laplacian method under the setting of a Signed Stochastic Block Model (SSBM), for $k=2$ equal-sized clusters, in the regime where the graph is moderately dense. In this work, we build on the results in [CDGT 2019] on two fronts for the normalized versions of SPONGE and the Signed Laplacian. Firstly, for both algorithms, we extend the theoretical analysis in [CDGT 2019] to the general setting of $k \geq 2$ unequal-sized clusters in the moderately dense regime. Secondly, we introduce regularized versions of both methods to handle sparse graphs -- a regime where standard spectral methods underperform -- and provide theoretical guarantees under the same SSBM model. To the best of our knowledge, regularized spectral methods have so far not been considered in the setting of clustering signed graphs. We complement our theoretical results with an extensive set of numerical experiments on synthetic data.