Research Collection School Of Computing and Information Systems

Generative semi-supervised graph anomaly detection

Hezhe QIAO
Qingsong WEN
Xiaoli LI
Ee-peng LIM, Singapore Management UniversityFollow
Guansong PANG, Singapore Management UniversityFollow

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

12-2024

Abstract

This work considers a practical semi-supervised graph anomaly detection (GAD) scenario, where part of the nodes in a graph are known to be normal, contrasting to the extensively explored unsupervised setting with a fully unlabeled graph. We reveal that having access to the normal nodes, even just a small percentage of normal nodes, helps enhance the detection performance of existing unsupervised GAD methods when they are adapted to the semi-supervised setting. However, their utilization of these normal nodes is limited. In this paper we propose a novel Generative GAD approach (namely GGAD) for the semi-supervised scenario to better exploit the normal nodes. The key idea is to generate pseudo anomaly nodes, referred to as outlier nodes, for providing effective negative node samples in training a discriminative one-class classifier. The main challenge here lies in the lack of ground truth information about real anomaly nodes. To address this challenge, GGAD is designed to leverage two important priors about the anomaly nodes – asymmetric local affinity and egocentric closeness – to generate reliable outlier nodes that assimilate anomaly nodes in both graph structure and feature representations. Comprehensive experiments on six real-world GAD datasets are performed to establish a benchmark for semi-supervised GAD and show that GGAD substantially outperforms state-of-the-art unsupervised and semi-supervised GAD methods with varying numbers of training normal nodes.

Keywords

Graph anomaly detection, GAD, Generative GAD, Anomaly nodes

Discipline

Databases and Information Systems | Information Security

Research Areas

Data Science and Engineering; Intelligent Systems and Optimization

Areas of Excellence

Digital transformation

Publication

Proceedings of 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024) : Vancouver, Canada, December 10-15

First Page

Last Page

Identifier

10.48550/ARXIV.2402.11887

City or Country

Vancouver

Citation

QIAO, Hezhe; WEN, Qingsong; LI, Xiaoli; LIM, Ee-peng; and PANG, Guansong. Generative semi-supervised graph anomaly detection. (2024). Proceedings of 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024) : Vancouver, Canada, December 10-15. 1-29.
Available at: https://ink.library.smu.edu.sg/sis_research/9763

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.48550/ARXIV.2402.11887

Download

Included in

Databases and Information Systems Commons, Information Security Commons

COinS

Research Collection School Of Computing and Information Systems

Generative semi-supervised graph anomaly detection

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Areas of Excellence

Publication

First Page

Last Page

Identifier

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Generative semi-supervised graph anomaly detection

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Areas of Excellence

Publication

First Page

Last Page

Identifier

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links