Publication Type

Journal Article

Version

publishedVersion

Publication Date

4-2023

Abstract

Do Deep Learning (DL) techniques actually help to improve the performance of duplicate bug report detection? Prior studies suggest that they do, if the duplicate bug report detection task is treated as a binary classification problem. However, in realistic scenarios, the task is often viewed as a ranking problem, which predicts potential duplicate bug reports by ranking based on similarities with existing historical bug reports. There is little empirical evidence to support that DL can be effectively applied to detect duplicate bug reports in the ranking scenario. Therefore, in this paper, we investigate whether well-known DL-based methods outperform classic information retrieval (IR) based methods on the duplicate bug report detection task. In addition, we argue that both IR- and DL-based methods suffer from incompletely evaluating the similarity between bug reports, resulting in the loss of important information. To address this problem, we propose a new method that combines IR and DL techniques to compute textual similarity more comprehensively. Our experimental results show that the DL-based method itself does not yield high performance compared to IR-based methods. However, our proposed combined method improves on the MAP metric of classic IR-based methods by a median of 7.09%–11.34% and a maximum of 17.228%–28.97%.

Keywords

Duplicate bug report detection, Deep learning, Information retrieval, Similarity measure, Realistic evaluation

Discipline

Software Engineering

Research Areas

Software and Cyber-Physical Systems

Publication

Journal of Systems and Software

Volume

198

First Page

Last Page

ISSN

0164-1212

Identifier

10.1016/j.jss.2023.111607

Publisher

Elsevier

Citation

JIANG, Yuan; SU, Xiaohong; TREUDE, Christoph; SHANG, Chao; and WANG, Tiantian. Does deep learning improve the performance of duplicate bug report detection? An empirical study. (2023). Journal of Systems and Software. 198, 1-26.
Available at: https://ink.library.smu.edu.sg/sis_research/8785

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1016/j.jss.2023.111607

Download

Included in

Software Engineering Commons

COinS

Research Collection School Of Computing and Information Systems

Does deep learning improve the performance of duplicate bug report detection? An empirical study

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Does deep learning improve the performance of duplicate bug report detection? An empirical study

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links