Multi-granularity distribution alignment for cross-domain crowd counting
Publication Type
Journal Article
Publication Date
12-2025
Abstract
Unsupervised domain adaptation enables the transfer of knowledge from a labeled source domain to an unlabeled target domain, and its application in crowd counting is gaining momentum. Current methods typically align distributions across domains to address inter-domain disparities at a global level. However, these methods often struggle with significant intra-domain gaps caused by domain-agnostic factors such as density, surveillance angles, and scale, leading to inaccurate alignment and unnecessary computational burdens, especially in large-scale training scenarios. To address these challenges, we propose the Multi-Granularity Optimal Transport (MGOT) distribution alignment framework, which aligns domain-agnostic factors across domains at different granularities. The motivation behind multi-granularity is to capture fine-grained domain-agnostic variations within domains. Our method proceeds in three phases: first, clustering coarse-grained features based on intra-domain similarity; second, aligning the granular clusters using an optimal transport framework and constructing a mapping from cluster centers to finer patch levels between domains; and third, re-weighting the aligned distribution for model refinement in domain adaptation. Extensive experiments across twelve cross-domain benchmarks show that our method outperforms existing state-of-the-art methods in adaptive crowd counting. The code will be available at https://github.com/HopooLinZ/MGOT.
Discipline
Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
IEEE Transactions on Image Processing
Volume
34
First Page
3648
Last Page
3662
ISSN
1057-7149
Identifier
10.1109/TIP.2025.3571312
Publisher
Institute of Electrical and Electronics Engineers
Citation
ZHONG, Xian; QIU, Lingyue; ZHU, Huilin; YUAN, Jingling; HE, Shengfeng; and WANG, Zheng.
Multi-granularity distribution alignment for cross-domain crowd counting. (2025). IEEE Transactions on Image Processing. 34, 3648-3662.
Available at: https://ink.library.smu.edu.sg/sis_research/10805