Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

11-2017

Abstract

The prevalence of questions and answers on domain-specific Q&A sites like Stack Overflow constitutes a core knowledge asset for software engineering domain. Although search engines can return a list of questions relevant to a user query of some technical question, the abundance of relevant posts and the sheer amount of information in them makes it difficult for developers to digest them and find the most needed answers to their questions. In this work, we aim to help developers who want to quickly capture the key points of several answer posts relevant to a technical question before they read the details of the posts. We formulate our task as a query-focused multi-answer-posts summarization task for a given technical question. Our proposed approach AnswerBot contains three main steps : 1) relevant question retrieval, 2) useful answer paragraph selection, 3) diverse answer summary generation. To evaluate our approach, we build a repository of 228,817 Java questions and their corresponding answers from Stack Overflow. We conduct user studies with 100 randomly selected Java questions (not in the question repository) to evaluate the quality of the answer summaries generated by our approach and the effectiveness of its relevant question retrieval and answer paragraph selection components. Our evaluation shows that answer summaries generated by our approach are relevant, useful and diverse to developers’ technical questions, and its components can effectively retrieve relevant questions and select salient answer paragraphs for summarization.

Keywords

summary generation, question retrieval

Discipline

Databases and Information Systems | Numerical Analysis and Scientific Computing | Software Engineering

Research Areas

Software and Cyber-Physical Systems

Publication

ASE '17: Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering: Urbana-Champaign, IL, October 30-November 3

First Page

706

Last Page

716

ISBN

9781538626849

Identifier

10.1109/ASE.2017.8115681

Publisher

IEEE

City or Country

Piscataway, NJ

Copyright Owner and License

Authors

Additional URL

https://doi.org/10.1109/ASE.2017.8115681

Share

COinS