Density Peak Clustering Algorithm Based on K-nearest Neighbors and Optimized Allocation Strategy

doi:10.13328/j.cnki.jos.006462

微信服务号

微信订阅号

2025-4-6- 5

Home > Archive>Volume 33, Issue 4, 2022 >1390-1411. DOI:10.13328/j.cnki.jos.006462

PDF HTML XML Export Cite reminder

Density Peak Clustering Algorithm Based on K-nearest Neighbors and Optimized Allocation Strategy
DOI:
                        10.13328/j.cnki.jos.006462
                    
Author:
                        SUN LinSUN Lin
College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China;Key Laboratory of Artificial Intelligence and Personalized Learning in Education of Henan Province, Xinxiang 453007, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
QIN Xiao-YingQIN Xiao-Ying
College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
XU Jiu-ChengXU Jiu-Cheng
College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
XUE Zhan-AoXUE Zhan-Ao
College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The density peak clustering (DPC) algorithm is a simple and effective clustering analysis algorithm. However, in real-world practical applications, it is difficult for DPC to select the correct cluster centers for datasets with large differences of density among clusters or multi-density peaks in clusters. Furthermore, the allocation method of points in DPC has a domino effect. To address these issues, a density peak clustering algorithm based on the K-nearest neighbors (KNN) and the optimized allocation strategy was proposed. First, the candidate cluster centers using the KNN, densities of points, and boundary points were determined. The path distance was defined to reflect the similarity between the candidate cluster centers, based on which, the density factor and distance factor were proposed to quantify the possibility of candidate cluster centers as cluster centers, and then the cluster centers were determined. Second, to improve the allocation precision of points, according to the shared nearest neighbors, high density nearest neighbor, density difference, and distance between KNN, the similarity measures were constructed, and then some concepts of the neighborhood, similarity set, and similarity domain were proposed to assist in the allocation of points. The initial clustering results were determined according to the similarity domains and boundary points, and then the intermediate clustering results were achieved based on the cluster centers. Finally, according to the intermediate clustering results and similarity set, the clusters were divided into multiple layers from the cluster centers to the cluster boundaries, for which the allocation strategies of points were designed, respectively. To determine the allocation order of points in the specific layer, the positive value was presented based on the similarity domain and positive domain. The point was allocated to the dominant cluster in its positive domain. Thus, the final clustering results were obtained. The experimental results on 11 synthetic datasets and 27 real datasets demonstrate that the proposed algorithm has sound clustering performance in metrics of the purity, F-measure, accuracy, Rand index, adjusted Rand index, and normalized mutual information when compared with the state-of-the-art DPC algorithms.

Key words:density peak clustering;K-nearest neighbors (KNN);cluster center;positive value;allocation strategy

Get Citation

孙林,秦小营,徐久成,薛占熬.基于K近邻和优化分配策略的密度峰值聚类算法.软件学报,2022,33(4):1390-1411

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:January 10,2021
Revised:July 16,2021
Adopted:
Online: October 26,2021
Published: April 06,2022

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History