Publication Type
Journal Article
Version
publishedVersion
Publication Date
7-2026
Abstract
Learning-based dynamic fault localization techniques play a crucial role in the field of software engineering. These techniques dynamically execute test cases to meticulously extract useful knowledge from the execution information in the program, with the aim of identifying fault locations by leveraging machine learning, deep learning, and large language models. Currently, there is already a flourishing body of research that is intensely focused on learning-based dynamic fault localization. Research literature can be categorized into two main aspects for learning-based dynamic fault localization: data-based enhancements (i.e., the datasets) and model-based enhancements (i.e., the suspiciousness algorithms). Thus, we conduct an extensive literature review on learning-based dynamic fault localization from the aspects of the data task and the model task. Among them, each task is divided into multiple sub-tasks in a systematic manner to comprehensively discuss the details. In addition, we analyze and summarize the datasets and metrics that have been widely used to evaluate the effectiveness of the proposed techniques in recent years, so that researchers can have an intuitive perception of them. We also discuss the present challenges and the directions for future research.
Keywords
Fault localization, machine learning, deep learning, large language models, survey
Discipline
Artificial Intelligence and Robotics | Software Engineering
Publication
ACM Computing Surveys
Volume
58
Issue
9
First Page
1
Last Page
39
ISSN
0360-0300
Identifier
10.1145/3787202
Publisher
Association for Computing Machinery (ACM)
Citation
LIU, Chunyan; LEI, Yan; XIE, Huan; WANG, Jinping; YU, Yue; and LO, David.
Survey on learning-based dynamic fault localization: From traditional machine learning to large language models. (2026). ACM Computing Surveys. 58, (9), 1-39.
Available at: https://ink.library.smu.edu.sg/sis_research/11097
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3787202