论文标题

聚类合奏符合低级张量近似

Clustering Ensemble Meets Low-rank Tensor Approximation

论文作者

Jia, Yuheng, Liu, Hui, Hou, Junhui, Zhang, Qingfu

论文摘要

本文探讨了聚类集合的问题,该集合集合旨在结合多个基础聚类以产生比单个单个的绩效更好的性能。现有的聚类集合方法通常构建共同关联矩阵,这表明样品之间的成对相似性,因为来自不同基础群集的结缔组织的加权线性组合,然后将所得的共同关联矩阵作为临时聚类algorith algorithm的输入,例如,例如。但是,共同关联矩阵可能以较差的基础聚类为主,导致性能较低。在本文中,我们提出了一种新型的低量张量近似方法,以从全球角度解决该问题。具体而言,通过检查是否将两个样品聚集到不同基础聚类下的相同群集,我们得出了一个连贯的链接矩阵,该矩阵包含样本之间有限但高度可靠的关系。然后,我们将相干链接矩阵和共同关联矩阵堆叠以形成三维张量,该张量是进一步探索的低级别属性,以传播与共同相关矩阵的连贯链接矩阵的信息,从而产生了精制的共同相结合矩阵。我们将提出的方法提出为凸的约束优化问题,并有效地解决了它。实验结果超过7个基准数据集表明,与12种最先进的方法相比,所提出的模型在聚类性能方面取得了突破。据我们所知,这是探索低排名张量在聚类合奏中的潜力的第一项工作,这与以前的方法根本不同。

This paper explores the problem of clustering ensemble, which aims to combine multiple base clusterings to produce better performance than that of the individual one. The existing clustering ensemble methods generally construct a co-association matrix, which indicates the pairwise similarity between samples, as the weighted linear combination of the connective matrices from different base clusterings, and the resulting co-association matrix is then adopted as the input of an off-the-shelf clustering algorithm, e.g., spectral clustering. However, the co-association matrix may be dominated by poor base clusterings, resulting in inferior performance. In this paper, we propose a novel low-rank tensor approximation-based method to solve the problem from a global perspective. Specifically, by inspecting whether two samples are clustered to an identical cluster under different base clusterings, we derive a coherent-link matrix, which contains limited but highly reliable relationships between samples. We then stack the coherent-link matrix and the co-association matrix to form a three-dimensional tensor, the low-rankness property of which is further explored to propagate the information of the coherent-link matrix to the co-association matrix, producing a refined co-association matrix. We formulate the proposed method as a convex constrained optimization problem and solve it efficiently. Experimental results over 7 benchmark data sets show that the proposed model achieves a breakthrough in clustering performance, compared with 12 state-of-the-art methods. To the best of our knowledge, this is the first work to explore the potential of low-rank tensor on clustering ensemble, which is fundamentally different from previous approaches.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源