Research Collection School Of Computing and Information Systems

Weakly-supervised semantic segmentation with image-level labels: From traditional models to foundation models

Zhaozheng CHEN, Singapore Management UniversityFollow
Qianru SUN, Singapore Management UniversityFollow

Publication Type

Journal Article

Version

publishedVersion

Publication Date

1-2025

Abstract

The rapid development of deep learning has driven significant progress in image semantic segmentation—a fundamental task in computer vision. Semantic segmentation algorithms often depend on the availability of pixel-level labels (i.e., masks of objects), which are expensive, time consuming, and labor intensive. Weakly supervised semantic segmentation (WSSS) is an effective solution to avoid such labeling. It utilizes only partial or incomplete annotations and provides a cost-effective alternative to fully supervised semantic segmentation. In this article, our focus is on the WSSS with image-level labels, which is the most challenging form of WSSS. Our work has two parts. First, we conduct a comprehensive survey on traditional methods, primarily focusing on those presented at premier research conferences. We categorize them into four groups based on where their methods operate: pixel-wise, image-wise, cross-image, and external data. Second, we investigate the applicability of visual foundation models, such as the Segment Anything Model (SAM), in the context of WSSS. We scrutinize SAM in two intriguing scenarios: text prompting and zero-shot learning. We provide insights into the potential and challenges of deploying visual foundational models for WSSS, facilitating future developments in this exciting research area. Our code is provided at this link: https://github.com/zhaozhengChen/SAM_WSSS.

Keywords

Weakly supervised, semantic segmentation, segment anything model

Discipline

Artificial Intelligence and Robotics | Graphics and Human Computer Interfaces

Research Areas

Intelligent Systems and Optimization

Areas of Excellence

Digital transformation

Publication

ACM Computing Surveys

Volume

Issue

First Page

Last Page

ISSN

0360-0300

Identifier

10.1145/3707447

Publisher

Association for Computing Machinery (ACM)

Citation

CHEN, Zhaozheng and SUN, Qianru. Weakly-supervised semantic segmentation with image-level labels: From traditional models to foundation models. (2025). ACM Computing Surveys. 57, (5), 1-29.
Available at: https://ink.library.smu.edu.sg/sis_research/10152

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1145/3707447

Download

Included in

Artificial Intelligence and Robotics Commons, Graphics and Human Computer Interfaces Commons

COinS

Research Collection School Of Computing and Information Systems

Weakly-supervised semantic segmentation with image-level labels: From traditional models to foundation models

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Areas of Excellence

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Weakly-supervised semantic segmentation with image-level labels: From traditional models to foundation models

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Areas of Excellence

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links