Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
11-2015
Abstract
We introduce the concept of Least Similar Nearest Neighbours (LeSiNN) and use LeSiNN to detect anomalies directly. Although there is an existing method which is a special case of LeSiNN, this paper is the first to clearly articulate the underlying concept, as far as we know. LeSiNN is the first ensemble method which works well with models trained using samples of one instance. LeSiNN has linear time complexity with respect to data size and the number of dimensions, and it is one of the few anomaly detectors which can apply directly to both numeric and categorical data sets. Our extensive empirical evaluation shows that LeSiNN is either competitive to or better than six state-of-the-art anomaly detectors in terms of detection accuracy and runtime.
Keywords
Least Similar Nearest Neighbours, kNN, Anomaly Detection, Ensemble
Discipline
Databases and Information Systems | Data Storage Systems
Research Areas
Data Science and Engineering
Publication
Proceedings of the 2015 IEEE International Conference on Data Mining Workshop (ICDMW), Atlantic City, New Jersey, November 14-17
First Page
1
Last Page
8
ISBN
2375-9259
Identifier
10.1109/ICDMW.2015.62
Publisher
{IEEE} Computer Society
City or Country
Atlantic City
Citation
PANG, Guansong; TING, Kai Ming; and ALBRECHT, David.
LeSiNN: Detecting anomalies by Identifying least similar nearest neighbours. (2015). Proceedings of the 2015 IEEE International Conference on Data Mining Workshop (ICDMW), Atlantic City, New Jersey, November 14-17. 1-8.
Available at: https://ink.library.smu.edu.sg/sis_research/7147
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.