Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
9-2023
Abstract
Stack Overflow, the world's largest software Q&A (SQA) website, is facing a significant traffic drop due to the emergence of generative AI techniques. ChatGPT is banned by Stack Overflow after only 6 days from its release. The main reason provided by the official Stack Overflow is that the answers generated by ChatGPT are of low quality. To verify this, we conduct a comparative evaluation of human-written and ChatGPT-generated answers. Our methodology employs both automatic comparison and a manual study. Our results suggest that human-written and ChatGPT-generated answers are semantically similar, however, human-written answers outperform ChatGPT-generated ones consistently across multiple aspects, specifically by 10% on the overall score. We release the data, analysis scripts, and detailed results at https://github.com/maxxbw54/GAI4SQA.
Keywords
ChatGPT, Generative AI, large language model, Software Q&A, Stack Overflow
Discipline
Artificial Intelligence and Robotics | Software Engineering
Research Areas
Software and Cyber-Physical Systems
Publication
2023 38th IEEE/ACM International Conference on Automated Software Engineering: Luxembourg, September 11-15: Proceedings
First Page
1713
Last Page
1717
ISBN
9798350329964
Identifier
10.1109/ASE56229.2023.00023
Publisher
IEEE
City or Country
Piscataway, NJ
Citation
XU, Bowen; NGUYEN, Thanh-Dat; LE-CONG, Thanh; HOANG, Thong; LIU, Jiakun; KIM, Kisub; GONG, Chen; NIU, Changan; WANG, Chenyu; David LO; and LO, David.
Are we ready to embrace generative AI for software Q&A?. (2023). 2023 38th IEEE/ACM International Conference on Automated Software Engineering: Luxembourg, September 11-15: Proceedings. 1713-1717.
Available at: https://ink.library.smu.edu.sg/sis_research/8489
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/ASE56229.2023.00023