Research Collection School Of Computing and Information Systems

PRUNE: A patching based repair framework for certifiable and privacy-robust unlearning of neural networks

Publication Type

Journal Article

Version

acceptedVersion

Publication Date

9-2026

Abstract

Machine unlearning has emerged as a key mechanism for enabling the “right to be forgotten” in neural network models, allowing the selective removal of specific training data upon request. Existing approaches typically rely on retraining models with the remaining data, which is computationally expensive and difficult to verify, especially when deployed models are distributed or resource-constrained. To address this challenge, our prior conference work introduced PRUNE, a patching-based framework that formulates unlearning as a neural network repair problem. PRUNE achieves targeted forgetting by learning lightweight patch networks that redirect model predictions on the data to be unlearned while preserving performance on the remaining data. In this extended journal version, we make three major advances: (1) we formally define a threat model that characterizes dishonest behaviors of model owners and corresponding privacy risks; (2) we extend PRUNE to support class-level unlearning, enabling removal of all samples from a target category; and (3) we perform additional experiments showing that PRUNE can resist membership inference attacks, demonstrating its privacy robustness. Extensive evaluations on multiple classification benchmarks confirm that PRUNE achieves certifiable unlearning with high efficiency, minimal performance degradation, and strong verifiability.

Keywords

Machine Learning, Machine Unlearning, Privacy Leakage, Data Privacy

Discipline

Information Security | OS and Networks

Publication

Neural Networks

Volume

201

First Page

Last Page

ISSN

0893-6080

Identifier

10.1016/j.neunet.2026.108897

Publisher

Elsevier

Citation

Li, Xuran; Wang, Jingyi; Yuan, Xiaohan; and ZHANG, Peixin. PRUNE: A patching based repair framework for certifiable and privacy-robust unlearning of neural networks. (2026). Neural Networks. 201, 1-16.
Available at: https://ink.library.smu.edu.sg/sis_research/11087

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1016/j.neunet.2026.108897

Download

Included in

Information Security Commons, OS and Networks Commons

COinS

Research Collection School Of Computing and Information Systems

PRUNE: A patching based repair framework for certifiable and privacy-robust unlearning of neural networks

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Publication

Volume

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

PRUNE: A patching based repair framework for certifiable and privacy-robust unlearning of neural networks

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Publication

Volume

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links