Publication Type
Journal Article
Version
acceptedVersion
Publication Date
9-2023
Abstract
In Stack Overflow, developers may not clarify and summarize the critical problems in the question titles due to a lack of domain knowledge or poor writing skills. Previous studies mainly focused on automatically generating the question titles by analyzing the posts’ problem descriptions and code snippets. In this study, we aim to improve title quality from the perspective of question title reformulation and propose a novel approach QETRA motivated by the findings of our formative study. Specifically, by mining modification logs from Stack Overflow, we first extract title reformulation pairs containing the original title and the reformulated title. Then we resort to multi-task learning by formalizing title reformulation for each programming language as separate but related tasks. Later we adopt a pre-trained model T5 to automatically learn the title reformulation patterns. Automated evaluation and human study both show the competitiveness of QETRA after compared with six state-of-the-art baselines. Moreover, our ablation study results also confirm that our studied question title reformulation task is more practical than the direct question title generation task for generating high-quality titles. Finally, we develop a browser plugin based on QETRA to facilitate the developers to perform title reformulation. Our study provides a new perspective for studying the quality of post titles and can further generate high-quality titles.
Keywords
Stack Overflow mining, question post quality assurance, question title reformulation, modification logs, deeplearning
Discipline
Artificial Intelligence and Robotics | Software Engineering
Research Areas
Intelligent Systems and Optimization
Publication
IEEE Transactions on Software Engineering
Volume
49
Issue
9
First Page
4390
Last Page
4410
ISSN
0098-5589
Identifier
10.1109/TSE.2023.3292399
Publisher
Institute of Electrical and Electronics Engineers
Citation
LIU, Ke; CHEN, Xiang; CHEN, Chunyang; XIE, Xiaofei; and CUI, Zhanqi.
Automated question title reformulation by mining modifcation logs from Stack Overflow. (2023). IEEE Transactions on Software Engineering. 49, (9), 4390-4410.
Available at: https://ink.library.smu.edu.sg/sis_research/8225
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/TSE.2023.3292399