论文标题
估计D维球上簇数的数量
Estimation of the number of clusters on d-dimensional sphere
论文作者
论文摘要
球形数据分布在球体上。数据出现在气象,生物学和自然语言处理等各个领域。但是,一种分析球形数据的方法还不够发展。重要问题之一是估计球形数据中簇数的数量。为了解决这个问题,我提出了一种称为球形X均值(SX均值)的新方法,该方法可以估计D维球上的簇数。 SX均值是基于模型的方法,假设数据是从von mises-fisher分布的混合物生成的。本文解释了所提出的方法,并显示了其群集数量估计的性能。
Spherical data is distributed on the sphere. The data appears in various fields such as meteorology, biology, and natural language processing. However, a method for analysis of spherical data does not develop enough yet. One of the important issues is an estimation of the number of clusters in spherical data. To address the issue, I propose a new method called the Spherical X-means (SX-means) that can estimate the number of clusters on d-dimensional sphere. The SX-means is the model-based method assuming that the data is generated from a mixture of von Mises-Fisher distributions. The present paper explains the proposed method and shows its performance of estimation of the number of clusters.