Publication Type
Journal Article
Version
acceptedVersion
Publication Date
1-2019
Abstract
Clustering is an important topic in machine learning and data mining. Recently, deep clustering, which learns feature representations for clustering tasks using deep neural networks, has attracted increasing attention for various clustering applications. Deep embedded clustering (DEC) is one of the state-of-theart deep clustering methods. However, DEC does not make use of prior knowledge to guide the learning process. In this paper, we propose a new scheme of semi-supervised deep embedded clustering (SDEC) to overcome this limitation. Concretely, SDEC learns feature representations that favor the clustering tasks and performs clustering assignments simultaneously. In contrast to DEC, SDEC incorporates pairwise constraints in the feature learning process such that data samples belonging to the same cluster are close to each other and data samples belonging to different clusters are far away from each other in the learned feature space. Extensive experiments on real benchmark data sets validate the effectiveness and robustness of the proposed method.
Keywords
Semi-supervised learning, Deep embedded clustering, Pairwise constraints
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Research Areas
Data Science and Engineering
Publication
Neurocomputing
Volume
325
First Page
121
Last Page
130
ISSN
0925-2312
Identifier
10.1016/j.neucom.2018.10.016
Publisher
Elsevier
Citation
REN, Yazhou; HU, Kangrong; DAI, Xinyi; PAN, Lili; HOI, Steven C. H.; and XU, Zenglin.
Semi-supervised deep embedded clustering. (2019). Neurocomputing. 325, 121-130.
Available at: https://ink.library.smu.edu.sg/sis_research/4188
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1016/j.neucom.2018.10.016
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons