论文标题
提出的评估群集异质性的方法
A Proposed Method for Assessing Cluster Heterogeneity
论文作者
论文摘要
评估足够的群集适合数据集并找到最佳数量的簇是一个困难的过程。建议成员矩阵和成员矩阵的程度来确定群集拟合的均匀性。还建议最大化群集数量滞后1的成员资格的比率,以优化数据集中的簇数量。还建议对均匀群集的成员资格程度的阈值因素。给出了群集模拟,以比较提出的方法与已建立的方法的比较程度。该方法可以应用于层次结构和K-均值聚类的输出。
Assessing how adequate clusters fit a dataset and finding an optimum number of clusters is a difficult process. A membership matrix and the degree of membership matrix is suggested to determine the homogeneity of a cluster fit. Maximisation of the ratio of the overall degree of membership at cluster number lag 1 is also suggested as a method to optimise the number of clusters in a dataset. A threshold factor upon the degree of membership is also suggested for homogeneous clusters. Cluster simulations were given to compare how well the proposed method compares against established methods. This method may be applied to the output of both hierarchical and k-means clustering.