Learning feature inversion for multi-class anomaly detection under general-purpose COCO-AD benchmark
Publication Type
Journal Article
Version
acceptedVersion
Publication Date
4-2026
Abstract
Anomaly detection (AD) is often focused on detecting anomaly areas for industrial quality inspection and medical lesion examination. However, due to the specific scenario targets, the data scale for AD is relatively small, and evaluation metrics are still deficient compared to classic vision tasks, such as object detection and semantic segmentation. To fill these gaps, this work first constructs a large-scale and general-purpose COCO-AD dataset by extending COCO to the AD field. This enables fair evaluation and sustainable development for different methods on this challenging benchmark. Moreover, current metrics such as AU-ROC have nearly reached saturation on simple datasets, which prevents a comprehensive evaluation of different methods. Inspired by the metrics in the segmentation field, we further propose several more practical threshold-dependent AD-specific metrics, i.e., mF1 .2 .8 , mAcc.2 .8 , mIoU.2 .8 , and mIoU-max. Motivated by GAN inversion’s high-quality reconstruction capability, we propose a simple but more powerful InvAD framework to achieve high-quality feature reconstruction. Our method improves the effectiveness of reconstruction-based methods on popular MVTec AD, VisA, and our newly proposed COCO-AD datasets under a multi-class unsupervised setting, where only a single detection model is trained to detect anomalies from different classes. Extensive ablation experiments have demonstrated the effectiveness of each component of our InvAD. Full codes and models are available at https://github.com/zhangzjn/ader.
Discipline
Computer Sciences | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
International Journal of Computer Vision
Volume
134
Issue
5
First Page
1
Last Page
23
ISSN
0920-5691
Identifier
10.1007/s11263-026-02809-z
Publisher
Springer
Citation
ZHANG, Jiangning; WANG, Chengjie; LI, Xiangtai; TIAN, Guanzhong; XUE, Zhucun; LIU, Yong; PANG, Guansong; and TAO, Dacheng.
Learning feature inversion for multi-class anomaly detection under general-purpose COCO-AD benchmark. (2026). International Journal of Computer Vision. 134, (5), 1-23.
Available at: https://ink.library.smu.edu.sg/sis_research/11094
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1007/s11263-026-02809-z