Publication Type
Journal Article
Version
acceptedVersion
Publication Date
4-2025
Abstract
In the rapidly evolving landscape of neural network security, the resilience of neural networks against bit-flip attacks (i.e., an attacker maliciously flips an extremely small amount of bits within its parameter storage memory system to induce harmful behavior), has emerged as a relevant area of research. Existing studies suggest that quantization may serve as a viable defense against such attacks. Recognizing the documented susceptibility of real-valued neural networks to such attacks and the comparative robustness of quantized neural networks (QNNs), in this work, we introduce BFAVerifier, the first verification framework designed to formally verify the absence of bit-flip attacks against QNNs or to identify all vulnerable parameters in a sound and rigorous manner. BFAVerifier comprises two integral components: an abstraction-based method and an MILP-based method. Specifically, we first conduct a reachability analysis with respect to symbolic parameters that represent the potential bit-flip attacks, based on a novel abstract domain with a sound guarantee. If the reachability analysis fails to prove the resilience of such attacks, then we encode this verification problem into an equivalent MILP problem which can be solved by off-the-shelf solvers. Therefore, BFAVerifier is sound, complete, and reasonably efficient. We conduct extensive experiments, which demonstrate its effectiveness and efficiency across various activation functions, quantization bit-widths, and adversary capabilities.
Keywords
Bit-Flip Attacks, Formal Verification, Quantized Neural Networks, Robustness
Discipline
OS and Networks | Software Engineering
Research Areas
Software and Cyber-Physical Systems
Publication
Proceedings of the ACM on Programming Languages
Volume
9
Issue
OOPSLAI
First Page
1
Last Page
31
Identifier
10.1145/3720471
Publisher
Association for Computing Machinery (ACM)
Citation
ZHANG, Yedi; HUANG, Lei; GAO, Pengfei; SONG, Fu; SUN, Jun; and DONG, Jin Song.
Verification of bit-flip attacks against quantized neural networks. (2025). Proceedings of the ACM on Programming Languages. 9, (OOPSLAI), 1-31.
Available at: https://ink.library.smu.edu.sg/sis_research/10159
Copyright Owner and License
Authors
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3720471