Publication Type
Journal Article
Version
acceptedVersion
Publication Date
2-2016
Abstract
Community-based question answering (cQA) is a popular type of online knowledge-sharing web service where users ask questions and obtain answers contributed by others. To enhance knowledge sharing, cQA also provides users with a retrieval function to access the historical question-answer pairs (QAs). However, it is still ineffective in that the retrieval result is typically a ranking list of potentially relevant QAs, rather than a succinct and informative answer. To alleviate the problem, this paper proposes a three-level scheme, which aims to generate a query-focused summary-style answer in terms of two factors, i.e., novelty and redundancy. Specifically, we first retrieve a set of QAs to the given query, and then develop a smoothed Naive Bayes model to identify the topics of answers, by exploiting their associated category information. Next, to compute the global ranking scores of answers, we first propose a parameterized graph-based method to model a Markov random walk on a graph that is parameterized by the heterogeneous features of answers, and then combine the ranking scores with the relevance scores of answers. Based on the computed global ranking scores, we utilize two different strategies to construct top-K candidate answer set, and finally solve a constrained optimization problem on the sentence set of top-K answers to generate a summary towards a user's query. Experiments on real-world data demonstrate the effectiveness of our proposed approach as compared to the-baselines. (C) 2015 Elsevier Inc. All rights reserved.
Keywords
Summarization, Community-based question answering, Graph-based ranking
Discipline
Computer Sciences | Databases and Information Systems
Research Areas
Data Science and Engineering
Publication
Information Sciences
Volume
330
First Page
403
Last Page
423
ISSN
0020-0255
Identifier
10.1016/j.ins.2015.10.024
Publisher
Elsevier
Citation
WEI, Wei; MING, ZhaoYan; NIE, Liqiang; LI, Guohui; LI, Jianjun; ZHU, Feida; SHANG, Tianfeng; and LUO, Changyin.
Exploring Heterogeneous Features for Query-focused Summarization of Categorized Community Answers. (2016). Information Sciences. 330, 403-423.
Available at: https://ink.library.smu.edu.sg/sis_research/3132
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1016/j.ins.2015.10.024