Exploring Heterogeneous Features for Query-focused Summarization of Categorized Community Answers
Community-based question answering (cQA) is a popular type of online knowledge-sharing web service where users ask questions and obtain answers contributed by others. To enhance knowledge sharing, cQA also provides users with a retrieval function to access the historical question-answer pairs (QAs). However, it is still ineffective in that the retrieval result is typically a ranking list of potentially relevant QAs, rather than a succinct and informative answer. To alleviate the problem, this paper proposes a three-level scheme, which aims to generate a query-focused summary-style answer in terms of two factors, i.e., novelty and redundancy. Specifically, we first retrieve a set of QAs to the given query, and then develop a smoothed Naive Bayes model to identify the topics of answers, by exploiting their associated category information. Next, to compute the global ranking scores of answers, we first propose a parameterized graph-based method to model a Markov random walk on a graph that is parameterized by the heterogeneous features of answers, and then combine the ranking scores with the relevance scores of answers. Based on the computed global ranking scores, we utilize two different strategies to construct top-K candidate answer set, and finally solve a constrained optimization problem on the sentence set of top-K answers to generate a summary towards a user's query. Experiments on real-world data demonstrate the effectiveness of our proposed approach as compared to the-baselines. (C) 2015 Elsevier Inc. All rights reserved.
Summarization, Community-based question answering, Graph-based ranking
Computer Sciences | Databases and Information Systems
Data Management and Analytics
WEI, Wei; MING, ZhaoYan; NIE, Liqiang; LI, Guohui; LI, Jianjun; ZHU, Feida; SHANG, Tianfeng; and LUO, Changyin.
Exploring Heterogeneous Features for Query-focused Summarization of Categorized Community Answers. (2016). Information Sciences. 330, 403-423. Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/3132