Optimized Density Peaks Clustering Algorithm Based on Dissimilarity Measure

doi:10.13328/j.cnki.jos.005813

微信服务号

微信订阅号

2025-4-9- 10

Home > Archive>Volume 31, Issue 11, 2020 >3321-3333. DOI:10.13328/j.cnki.jos.005813

PDF HTML XML Export Cite reminder

Optimized Density Peaks Clustering Algorithm Based on Dissimilarity Measure
DOI:
                        10.13328/j.cnki.jos.005813
                    
Author:
                        DING Shi-FeiDING Shi-Fei
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
XU XiaoXU Xiao
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
WANG Yan-RuWANG Yan-Ru
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:National Natural Science Foundation of China (61672522, 61379101); National Program on Key Basic Research Project of China (973) (2013CB329502)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Clustering by fast search and find of density peaks (DPC) is an efficient algorithm for finding cluster centers quickly based on local-density and relative-distance. DPC uses the decision graph to find the density peaks as cluster centers. It does not need to specify the number of clusters in advance and clusters with arbitrary shapes can be obtained. However, the calculation of local-density and relative-distance depends on the similarity matrix which is based on distance metrics simply, thus, DPC is not satisfactory on complex datasets, especially when the datasets with uneven density and higher dimensions. In addition, the measurement of the local-density is not unified and different methods correspond to different datasets. Third, the measurement of d_c only considers the global distribution of datasets, ignoring the local information of the data, so the change of d_c will affect the results of clustering, especially on small scale datasets. Aiming at these shortcomings, this study proposes an optimized density peaks clustering algorithm based on dissimilarity measure (DDPC). DDPC introduces a mass-based dissimilarity measure to calculate the similarity matrix, and calculates the k-nearest neighbor information of the sample based on the new similarity matrix. Then local-density is redefined by the k-nearest neighbor information. Experimental results show that the optimized density peaks clustering algorithm based on dissimilarity measure is superior to the optimized FKNN-DPC and DPC-KNN clustering algorithms, and can be satisfied on datasets with uneven density and higher dimensions. As a result, the local-density measurement method is unified at the same time, which avoids the influence of d_c on the clustering results in the traditional DPC algorithm.

Key words:density peaks clustering;local-density;decision graph;dissimilarity measure;uneven density

Get Citation

丁世飞,徐晓,王艳茹.基于不相似性度量优化的密度峰值聚类算法.软件学报,2020,31(11):3321-3333

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:April 19,2018
Revised:July 25,2018
Adopted:
Online: November 07,2020
Published: November 06,2020

You are the first2034063Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History