Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
4-2018
Abstract
The efficient processing of document streams plays an important role in many information filtering systems. Emerging applications, such as news update filtering and social network notifications, demand presenting end-users with the most relevant content to their preferences. In this work, user preferences are indicated by a set of keywords. A central server monitors the document stream and continuously reports to each user the top-k documents that are most relevant to her keywords. The objective is to support large numbers of users and high stream rates, while refreshing the topk results almost instantaneously. Our solution abandons the traditional frequency-ordered indexing approach, and follows an identifier-ordering paradigm that suits better the nature of the problem. When complemented with a locally adaptive technique, our method offers (i) optimality w.r.t. the number of considered queries per stream event, and (ii) an order of magnitude shorter response time than the state-of-the-art.
Discipline
Databases and Information Systems | Data Storage Systems
Research Areas
Data Science and Engineering
Publication
Proceedings of the 34th IEEE International Conference on Data Engineering: ICDE 2018, Paris, France, April 16-19
First Page
1803
Last Page
1804
Identifier
10.1109/ICDE.2018.00259
City or Country
Paris, France
Citation
U, Leong Hou; ZHANG, Junjie; MOURATIDIS, Kyriakos; and LI, Ye.
Continuous top-K monitoring on document streams (Extended abstract). (2018). Proceedings of the 34th IEEE International Conference on Data Engineering: ICDE 2018, Paris, France, April 16-19. 1803-1804.
Available at: https://ink.library.smu.edu.sg/sis_research/4174
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/ICDE.2018.00259