Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

12-2008

Abstract

Since Jim Gray introduced the concept of ”data cube” in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimensional information, it is natural to propose a data cube model that integrates the power of traditional OLAP and IR techniques for text. In this paper, we propose a Text-Cube model on multidimensional text database and study effective OLAP over such data. Two kinds of hierarchies are distinguishable inside: dimensional hierarchy and term hierarchy. By incorporating these hierarchies, we conduct systematic studies on efficient text-cube implementation, OLAP execution and query processing. Our performance study shows the high promise of our methods.

Keywords

Cube, Text, OLAP

Discipline

Databases and Information Systems

Publication

ICDM 2008: 8th IEEE International Conference on Data Mining: 15-19 December, Pisa, Italy: Proceedings

First Page

905

Last Page

910

ISBN

9780769535029

Identifier

10.1109/ICDM.2008.135

Publisher

IEEE Computer Society

City or Country

Los Alamitos, CA

Additional URL

https://doi.ieeecomputersociety.org/10.1109/ICDM.2008.135

Share

COinS