Cross-project defect prediction via ASTToken2Vec and BLSTM-based neural network
Publication Type
Conference Proceeding Article
Publication Date
7-2019
Abstract
Cross-project defect prediction (CPDP) as a means to focus quality assurance of software projects was under heavy investigation in recent years. In this paper, we propose a novel CPDP approach via deep learning. In particular, we model each program module via simplified abstract syntax tree (S-AST). For each node in S-AST, only the project-independent node type is remained and other project-specific information (such as name of variable and method) is ignored, so that the modeling method is project-independent and suitable for CPDP issue. Then we extract token sequences from program modules modeled as S-AST. In addition, to construct meaningful vector representations for token sequences, we propose a novel unsupervised embedding method ASTToken2Vec, which learns semantic information from S-AST's natural structure. Finally, we use BLSTM (bi-directional long short-term memory) based neural network to automatically learn semantic features from vectorized token sequences and construct CPDP models. In our empirical studies, 10 real large-scale open source Java projects are chosen as our empirical subjects. Final results show that our proposed CPDP approach can perform significantly better than 5 state-of-the-art CPDP baselines in terms of AUC.
Discipline
OS and Networks | Software Engineering
Research Areas
Software and Cyber-Physical Systems
Publication
Proceedings of the 2019 International Joint Conference on Neural Networks, Budapest, Hungary, July 14-19
ISBN
9781728119854
Identifier
10.1109/IJCNN.2019.8852135
Publisher
IEEE
City or Country
Budapest, Hungary
Citation
LI, Hao; LI, Xiaohong; CHEN, Xiang; XIE, Xiaofei; MU, Yanzhou; and FENG, Zhiyong.
Cross-project defect prediction via ASTToken2Vec and BLSTM-based neural network. (2019). Proceedings of the 2019 International Joint Conference on Neural Networks, Budapest, Hungary, July 14-19.
Available at: https://ink.library.smu.edu.sg/sis_research/7094