论文标题
用于尖峰分类的特征提取和聚类的统一模型
A Unified Model of Feature Extraction and Clustering for Spike Sorting
论文作者
论文摘要
Spike分类在理解脑代码中起不可替代的作用。传统的尖峰排序技术在峰值良好后分别执行特征提取和聚类。但是,它通常可能导致许多其他过程,并进一步导致较低和/或不稳定的结果,尤其是在数据集中有噪音和/或重叠的峰值时。为了解决这些问题,在本文中,我们提出了一个统一的优化模型,该模型集成了特征提取和用于尖峰排序的聚类。有趣的是,我们将用于峰值特征提取的主成分分析(PCA)进行,而不是对峰值进行集群的主要组件分析(PCA)进行聚类,而是将PCA和KM统一为一个优化模型,从而减少了其他迭代时间的其他过程。随后,通过嵌入k-means ++策略以初始化和求解过程中的比较更新规则,提出的模型可以很好地处理噪音和/或重叠的干扰。最后,将最佳的聚类有效性指数纳入了建议的模型中,我们得出了一种自动尖峰分类方法。关于合成和现实世界数据集的大量实验结果证实,我们提出的方法的表现优于相关的最新方法。
Spike sorting plays an irreplaceable role in understanding brain codes. Traditional spike sorting technologies perform feature extraction and clustering separately after spikes are well detected. However, it may often cause many additional processes and further lead to low-accurate and/or unstable results especially when there are noises and/or overlapping spikes in datasets. To address these issues, in this paper, we proposed a unified optimisation model integrating feature extraction and clustering for spike sorting. Interestingly, instead of the widely used combination strategies, i.e., performing the principal component analysis (PCA) for spike feature extraction and K-means (KM) for clustering in sequence, we unified PCA and KM into one optimisation model, which reduces additional processes with fewer iteration times. Subsequently, by embedding the K-means++ strategy for initialising and a comparison updating rule in the solving process, the proposed model can well handle the noises and/or overlapping interference. Finally, taking the best of the clustering validity indices into the proposed model, we derive an automatic spike sorting method. Plenty of experimental results on both synthetic and real-world datasets confirm that our proposed method outperforms the related state-of-the-art approaches.