Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
12-2008
Abstract
Since Jim Gray introduced the concept of ”data cube” in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimensional information, it is natural to propose a data cube model that integrates the power of traditional OLAP and IR techniques for text. In this paper, we propose a Text-Cube model on multidimensional text database and study effective OLAP over such data. Two kinds of hierarchies are distinguishable inside: dimensional hierarchy and term hierarchy. By incorporating these hierarchies, we conduct systematic studies on efficient text-cube implementation, OLAP execution and query processing. Our performance study shows the high promise of our methods.
Keywords
Cube, Text, OLAP
Discipline
Databases and Information Systems
Publication
ICDM 2008: 8th IEEE International Conference on Data Mining: 15-19 December, Pisa, Italy: Proceedings
First Page
905
Last Page
910
ISBN
9780769535029
Identifier
10.1109/ICDM.2008.135
Publisher
IEEE Computer Society
City or Country
Los Alamitos, CA
Citation
LIN, Cindy Xinde; DING, Bolin; HAN, Jiawei; ZHU, Feida; and ZHAO, Bo.
Text Cube: Computing IR Measures for Multidimensional Text Database Analysis. (2008). ICDM 2008: 8th IEEE International Conference on Data Mining: 15-19 December, Pisa, Italy: Proceedings. 905-910.
Available at: https://ink.library.smu.edu.sg/sis_research/1008
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.ieeecomputersociety.org/10.1109/ICDM.2008.135