Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
9-2015
Abstract
A microblog repost tree provides strong clues on how an event described therein develops. To help social media users capture the main clues of events on microblogging sites, we propose a novel repost tree summarization framework by effectively differentiating two kinds of messages on repost trees called leaders and followers, which are derived from contentlevel structure information, i.e., contents of messages and the reposting relations. To this end, Conditional Random Fields (CRF) model is used to detect leaders across repost tree paths. We then present a variant of random-walk-based summarization model to rank and select salient messages based on the result of leader detection. To reduce the error propagation cascaded from leader detection, we improve the framework by enhancing the random walk with adjustment steps for sampling from leader probabilities given all the reposting messages. For evaluation, we construct two annotated corpora, one for leader detection, and the other for repost tree summarization. Experimental results confirm the effectiveness of our method.
Discipline
Databases and Information Systems
Research Areas
Data Science and Engineering; Information Systems and Management
Publication
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, September 17-21
First Page
2168
Last Page
2178
Identifier
10.18653/v1/D15-1259
Publisher
Association for Computational Linguistics
City or Country
New York
Citation
LI, Jing; GAO, Wei; WEI, Zhongyu; PENG, Baolin; and WONG, Kam-Fai.
Using content-level structures for summarizing microblog repost trees. (2015). Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, September 17-21. 2168-2178.
Available at: https://ink.library.smu.edu.sg/sis_research/6795
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.18653/v1/D15-1259