Research Collection School Of Computing and Information Systems

Learning feature inversion for multi-class anomaly detection under general-purpose COCO-AD benchmark

Publication Type

Journal Article

Version

acceptedVersion

Publication Date

4-2026

Abstract

Anomaly detection (AD) is often focused on detecting anomaly areas for industrial quality inspection and medical lesion examination. However, due to the specific scenario targets, the data scale for AD is relatively small, and evaluation metrics are still deficient compared to classic vision tasks, such as object detection and semantic segmentation. To fill these gaps, this work first constructs a large-scale and general-purpose COCO-AD dataset by extending COCO to the AD field. This enables fair evaluation and sustainable development for different methods on this challenging benchmark. Moreover, current metrics such as AU-ROC have nearly reached saturation on simple datasets, which prevents a comprehensive evaluation of different methods. Inspired by the metrics in the segmentation field, we further propose several more practical threshold-dependent AD-specific metrics, i.e., mF1 .2 .8 , mAcc.2 .8 , mIoU.2 .8 , and mIoU-max. Motivated by GAN inversion’s high-quality reconstruction capability, we propose a simple but more powerful InvAD framework to achieve high-quality feature reconstruction. Our method improves the effectiveness of reconstruction-based methods on popular MVTec AD, VisA, and our newly proposed COCO-AD datasets under a multi-class unsupervised setting, where only a single detection model is trained to detect anomalies from different classes. Extensive ablation experiments have demonstrated the effectiveness of each component of our InvAD. Full codes and models are available at https://github.com/zhangzjn/ader.

Discipline

Computer Sciences | Graphics and Human Computer Interfaces

Research Areas

Intelligent Systems and Optimization

Publication

International Journal of Computer Vision

Volume

134

Issue

First Page

Last Page

ISSN

0920-5691

Identifier

10.1007/s11263-026-02809-z

Publisher

Springer

Citation

ZHANG, Jiangning; WANG, Chengjie; LI, Xiangtai; TIAN, Guanzhong; XUE, Zhucun; LIU, Yong; PANG, Guansong; and TAO, Dacheng. Learning feature inversion for multi-class anomaly detection under general-purpose COCO-AD benchmark. (2026). International Journal of Computer Vision. 134, (5), 1-23.
Available at: https://ink.library.smu.edu.sg/sis_research/11094

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1007/s11263-026-02809-z

Download

Included in

Graphics and Human Computer Interfaces Commons

COinS

Research Collection School Of Computing and Information Systems

Learning feature inversion for multi-class anomaly detection under general-purpose COCO-AD benchmark

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Learning feature inversion for multi-class anomaly detection under general-purpose COCO-AD benchmark

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links