Spectral Clustering Algorithm Based on Adaptive Nyström Sampling for Big Data Analysis

doi:10.13328/j.cnki.jos.004643

微信服务号

微信订阅号

2025-5-15- 18

Home > Archive>Volume 25, Issue 9, 2014 >2037-2049. DOI:10.13328/j.cnki.jos.004643

PDF HTML XML Export Cite reminder

Spectral Clustering Algorithm Based on Adaptive Nyström Sampling for Big Data Analysis
DOI:
                        10.13328/j.cnki.jos.004643
                    
Author:
                        DING Shi-FeiDING Shi-Fei
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, The Chinese Academy of Sciences, Beijing 100190, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
JIA Hong-JieJIA Hong-Jie
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, The Chinese Academy of Sciences, Beijing 100190, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
SHI Zhong-ZhiSHI Zhong-Zhi
Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, The Chinese Academy of Sciences, Beijing 100190, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Spectral clustering is a flexible and effective clustering method for complex structure data sets. It is based on spectral graph theory and can produce satisfactory clustering results by mapping the data points into a low-dimensional space constituted by eigenvectors so that the data structure is optimized. But in the process of spectral clustering, the computational complexity of eigen-decomposition is usually O(n³), which limits the application of spectral clustering algorithm in big data problems. Nyström extension method uses partial points sampled from the data set and approximate calculation to simulate the real eigenspace. In this way, the computational complexity can be effectively reduced, which provides a new idea for big data spectral clustering algorithm. The selection of sampling strategy is essential for Nyström extension technology. In this paper, the design of an adaptive Nyström sampling method is presented. The sampling probability of every data point will be updated after each sampling pass, and a proof is given that the sampling error will decrease exponentially with the increase of sample times. Based on the adaptive Nyström sampling method, a spectral clustering algorithm for big data analysis is presented, and its feasibility and effectiveness is verified by experiments.

Key words:big data;spectral clustering;eigen-decomposition;Nyströ;m extension;adaptive sampling

Get Citation

丁世飞,贾洪杰,史忠植.基于自适应Nyström采样的大数据谱聚类算法.软件学报,2014,25(9):2037-2049

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:April 07,2014
Revised:May 14,2014
Adopted:
Online: September 09,2014
Published:

You are the first2044694Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History