Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

10-2024

Abstract

Zero-shot object counting (ZOC) aims to enumerate objects in images using only the names of object classes during testing, without the need for manual annotations. However, a critical challenge in current ZOC methods lies in their inability to identify high-quality exemplars effectively. This deficiency hampers scalability across diverse classes and undermines the development of strong visual associations between the identified classes and image content. To this end, we propose the Visual Association-based Zero-shot Object Counting (VA-Count) framework. VACount consists of an Exemplar Enhancement Module (EEM) and a Noise Suppression Module (NSM) that synergistically refine the process of class exemplar identification while minimizing the consequences of incorrect object identification. The EEM utilizes advanced vision-language pretaining models to discover potential exemplars, ensuring the framework’s adaptability to various classes. Meanwhile, the NSM employs contrastive learning to differentiate between optimal and suboptimal exemplar pairs, reducing the negative effects of erroneous exemplars. VA-Count demonstrates its effectiveness and scalability in zero-shot contexts with superior performance on two object counting datasets.

Keywords

Zero-shot object counting, Object classification, Object counting framework, Class exemplar identification

Discipline

Artificial Intelligence and Robotics | Graphics and Human Computer Interfaces

Research Areas

Intelligent Systems and Optimization

Publication

Proceedings of the 18th European Conference on Computer Vision (ECCV 2024) : Milan, Italy, September 29 - October 4

Identifier

10.1007/978-3-031-72652-1_22

Publisher

ECCV

City or Country

Italy

Citation

ZHU, Huilin; YUAN, Jingling; YANG, Zhengwei; GUO, Yu; WANG, Zheng; ZHONG, Xian; and HE, Shengfeng. Zero-shot object counting with good exemplars. (2024). Proceedings of the 18th European Conference on Computer Vision (ECCV 2024) : Milan, Italy, September 29 - October 4.
Available at: https://ink.library.smu.edu.sg/sis_research/9768

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1007/978-3-031-72652-1_22

Download

Included in

Artificial Intelligence and Robotics Commons, Graphics and Human Computer Interfaces Commons

COinS

Research Collection School Of Computing and Information Systems

Zero-shot object counting with good exemplars

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Zero-shot object counting with good exemplars

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links