论文标题
在程度校正的随机块模型下的球形坐标上的光谱聚类
Spectral clustering on spherical coordinates under the degree-corrected stochastic blockmodel
论文作者
论文摘要
光谱聚类是网络图中社区检测的一种流行方法:从图表的矩阵表示开始,节点被聚集在从矩阵的截断光谱分解中获得的低维投影。正确估计社区数量和减少潜在空间的维度对于光谱聚类算法的良好性能至关重要。此外,许多现实世界图,例如在网络安全应用程序中研究的企业计算机网络,通常显示出共享性内分布的异质性。标准光谱聚类算法通常无法很好地捕获这种异质度分布。在本文中,提出了一种新型的光谱聚类算法,用于在程度校正的随机块模型下进行社区检测。所提出的方法基于光谱嵌入到球形坐标的转换,以及在转化空间中的新型建模假设。该方法允许对社区数量的数量同时选择和自动选择,以及具有不平衡节点度的图形嵌入的频谱嵌入的潜在维度。结果表明,在表示计算机网络时的竞争方法上的性能提高了。
Spectral clustering is a popular method for community detection in network graphs: starting from a matrix representation of the graph, the nodes are clustered on a low dimensional projection obtained from a truncated spectral decomposition of the matrix. Estimating correctly the number of communities and the dimension of the reduced latent space is critical for good performance of spectral clustering algorithms. Furthermore, many real-world graphs, such as enterprise computer networks studied in cyber-security applications, often display heterogeneous within-community degree distributions. Such heterogeneous degree distributions are usually not well captured by standard spectral clustering algorithms. In this article, a novel spectral clustering algorithm is proposed for community detection under the degree-corrected stochastic blockmodel. The proposed method is based on a transformation of the spectral embedding to spherical coordinates, and a novel modelling assumption in the transformed space. The method allows for simultaneous and automated selection of the number of communities and the latent dimension for spectral embeddings of graphs with uneven node degrees. Results show improved performance over competing methods in representing computer networks.