Research Collection School Of Computing and Information Systems

PTM4Tag+: Tag recommendation of stack overflow posts with pre-trained models

Junda HE
Bowen XU
Zhou YANG
DongGyun HAN
Chengran YANG
Jiakun LIU
Zhipeng ZHAO
David LO, Singapore Management UniversityFollow

Publication Type

Journal Article

Version

submittedVersion

Publication Date

2-2025

Abstract

Stack Overflow is one of the most influential Software Question & Answer (SQA) websites, hosting millions of programming-related questions and answers. Tags play a critical role in efficiently organizing the contents on Stack Overflow and are vital to support various site operations, such as querying relevant content. Poorly chosen tags often lead to issues such as tag ambiguity and tag explosion. Therefore, a precise and accurate automated tag recommendation technique is needed. Inspired by the recent success of pre-trained models (PTMs) in natural language processing (NLP), we present PTM4Tag+, a tag recommendation framework for Stack Overflow posts that utilize PTMs in language modeling. PTM4Tag+ is implemented with a triplet architecture, which considers three key components of a post, i.e., Title, Description, and Code, with independent PTMs. We utilize a number of popular pre-trained models, including BERT-based models (e.g., BERT, RoBERTa, CodeBERT, BERTOverflow, and ALBERT), and encoder-decoder models (e.g., PLBART, CoTexT, and CodeT5). Our results show that leveraging CodeT5 under the PTM4Tag+ framework achieves the best performance among the eight considered PTMs and outperforms the state-of-the-art Convolutional Neural Network-based approach by a substantial margin in terms of average Precision@k, Recall@k, and F1-score@k (k ranges from 1 to 5). Specifically, CodeT5 improves the performance of F1-score@1-5 by 8.8%, 12.4%, 15.3%, 16.4%, and 16.6%, respectively. Moreover, to address the concern with inference latency, we experimented PTM4Tag+ using smaller PTM models (i.e., DistilBERT, DistilRoBERTa, CodeBERT-small, and CodeT5-small). We find that although smaller PTMs cannot outperform larger PTMs, they still maintain over 93.96% of the performance on average while reducing the mean inference time by more than 47.2%.

Keywords

Pre-trained models, Stack overflow, Tag recommendation, Transformer

Discipline

Software Engineering

Research Areas

Software and Cyber-Physical Systems

Publication

Empirical Software Engineering

Volume

Issue

First Page

Last Page

ISSN

1382-3256

Identifier

10.1007/s10664-024-10576-z

Publisher

Springer

Citation

HE, Junda; XU, Bowen; YANG, Zhou; HAN, DongGyun; YANG, Chengran; LIU, Jiakun; ZHAO, Zhipeng; and David LO. PTM4Tag+: Tag recommendation of stack overflow posts with pre-trained models. (2025). Empirical Software Engineering. 30, (1), 1-45.
Available at: https://ink.library.smu.edu.sg/sis_research/9846

Copyright Owner and License

Authors CC-BY

Creative Commons License

This work is licensed under a Creative Commons Attribution 3.0 License.

Additional URL

https://doi.org/10.1007/s10664-024-10576-z

Download

Included in

Software Engineering Commons

COinS

Research Collection School Of Computing and Information Systems

PTM4Tag+: Tag recommendation of stack overflow posts with pre-trained models

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

PTM4Tag+: Tag recommendation of stack overflow posts with pre-trained models

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links