Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
4-2025
Abstract
Semi-supervised learning (SSL), exemplified by FixMatch (Sohn et al., 2020), has shown significant generalization advantages over supervised learning (SL), particularly in the context of deep neural networks (DNNs). However, it is still unclear, from a theoretical standpoint, why FixMatch-like SSL algorithms generalize better than SL on DNNs. In this work, we present the first theoretical justification for the enhanced test accuracy observed in FixMatch-like SSL applied to DNNs by taking convolutional neural networks (CNNs) on classification tasks as an example. Our theoretical analysis reveals that the semantic feature learning processes in FixMatch and SL are rather different. In particular, FixMatch learns all the discriminative features of each semantic class, while SL only randomly captures a subset of features due to the well-known lottery ticket hypothesis. Furthermore, we show that our analysis framework can be applied to other FixMatch-like SSL methods, e.g., FlexMatch, FreeMatch, Dash, and SoftMatch. Inspired by our theoretical analysis, we develop an improved variant of FixMatch, termed Semantic-Aware FixMatch (SA-FixMatch). Experimental results corroborate our theoretical findings and the enhanced generalization capability of SA-FixMatch.
Discipline
Artificial Intelligence and Robotics
Research Areas
Intelligent Systems and Optimization
Areas of Excellence
Digital transformation
Publication
Proceedings of the Thirteenth International Conference on Learning Representations, Singapore, 2025 April 24-28
First Page
1
Last Page
38
City or Country
Singapore
Citation
LI, Jingyang; PAN, Jiachun; TAN, Vincent; TOH, Kim-chuan; and ZHOU, Pan.
Towards understanding why FixMatch generalizes better than supervised learning. (2025). Proceedings of the Thirteenth International Conference on Learning Representations, Singapore, 2025 April 24-28. 1-38.
Available at: https://ink.library.smu.edu.sg/sis_research/10461
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://openreview.net/forum?id=25kAzqzTrz