Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
11-2024
Abstract
As the typical retraining paradigm is unacceptably time- and resource-consuming, researchers are turning to model editing to find an effective way that supports both consecutive and batch scenarios to edit the model behavior directly. Despite all these practical expectations, existing model editing methods fail to realize all of them. Furthermore, the memory demands for such sequential model editing approaches tend to be prohibitive, frequently necessitating an external memory that grows incrementally over time. To cope with these challenges, we propose CoachHooK, a model editing method that simultaneously supports sequential and batch editing. CoachHooK is memory-friendly as it only needs a small amount of it to store several hook layers whose size remains unchanged over time. Experimental results demonstrate the superiority of our method over other batch-supportive model editing methods under both single-round and consecutive batch editing scenarios. Extensive analyses of CoachHooK have been conducted to verify the stability of our method over a number of consecutive steps.
Keywords
Retraining paradigm, Model editing, Sequential and batch editing
Discipline
Artificial Intelligence and Robotics
Areas of Excellence
Digital transformation
Publication
Proceedings of the 19th Conference on Empirical Methods in Natural Language Processing (EMNLP 2024) : Miami, Florida, USA, November 12-16
First Page
13817
Last Page
13833
Publisher
Association for Computational Linguistics
City or Country
Miami, USA
Citation
LI, Shuaiyi; DENG, Yang; CAI, Deng; LU, Hongyuan; CHEN, Liang; and LAM, Wai.
Consecutive batch model editing with HooK layers. (2024). Proceedings of the 19th Conference on Empirical Methods in Natural Language Processing (EMNLP 2024) : Miami, Florida, USA, November 12-16. 13817-13833.
Available at: https://ink.library.smu.edu.sg/sis_research/9679
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.