Publication Type
Journal Article
Version
acceptedVersion
Publication Date
2-2020
Abstract
Density peak clustering (DPC) is a recently developed density-based clustering algorithm that achieves competitive performance in a non-iterative manner. DPC is capable of effectively handling clusters with single density peak (single center), i.e., based on DPC’s hypothesis, one and only one data point is chosen as the center of any cluster. However, DPC may fail to identify clusters with multiple density peaks (multi-centers) and may not be able to identify natural clusters whose centers have relatively lower local density. To address these limitations, we propose a novel clustering algorithm based on a hierarchical approach, named multi-center density peak clustering (McDPC). Firstly, based on a widely adopted hypothesis that the potential cluster centers are relatively far away from each other. McDPC obtains centers of the initial micro-clusters (named representative data points) whose minimum distance to the other higher-density data points are relatively larger. Secondly, the representative data points are autonomously categorized into different density levels. Finally, McDPC deals with micro-clusters at each level and if necessary, merges the micro-clusters at a specific level into one cluster to identify multi-center clusters. To evaluate the effectiveness of our proposed McDPC algorithm, we conduct experiments on both synthetic and real-world datasets and benchmark the performance of McDPC against other state-of-the-art clustering algorithms. We also apply McDPC to perform image segmentation and facial recognition to further demonstrate its capability in dealing with real-world applications. The experimental results show that our method achieves promising performance.
Keywords
Density peak clustering, Multi-center cluster, Image segmentation
Discipline
Databases and Information Systems | Programming Languages and Compilers | Software Engineering
Research Areas
Data Science and Engineering
Publication
Neural Computing and Applications
First Page
1
Last Page
19
ISSN
0941-0643
Identifier
10.1007/s00521-020-04754-5
Publisher
Springer (part of Springer Nature): Springer Open Choice Hybrid Journals
Citation
WANG, Yizhang; WANG, Di; ZHANG, Xiaofeng; PANG, Wei; MIAO, Chunyan; TAN, Ah-hwee; and ZHOU, You.
McDPC: Multi‐center density peak clustering. (2020). Neural Computing and Applications. 1-19.
Available at: https://ink.library.smu.edu.sg/sis_research/5186
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1007/s00521-020-04754-5
Included in
Databases and Information Systems Commons, Programming Languages and Compilers Commons, Software Engineering Commons