Title

Text Cube: Computing IR Measures for Multidimensional Text Database Analysis

Publication Type

Conference Proceeding Article

Publication Date

12-2008

Abstract

Since Jim Gray introduced the concept of ”data cube” in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimensional information, it is natural to propose a data cube model that integrates the power of traditional OLAP and IR techniques for text. In this paper, we propose a Text-Cube model on multidimensional text database and study effective OLAP over such data. Two kinds of hierarchies are distinguishable inside: dimensional hierarchy and term hierarchy. By incorporating these hierarchies, we conduct systematic studies on efficient text-cube implementation, OLAP execution and query processing. Our performance study shows the high promise of our methods.

Keywords

Cube, Text, OLAP

Discipline

Databases and Information Systems

Research Areas

Data Management and Analytics

Publication

Proceedings of the 8th International Conference on Data Mining (ICDM '08)

First Page

905

Last Page

910

ISBN

9780769535029

Identifier

10.1109/ICDM.2008.135

Publisher

IEEE

City or Country

Pisa, Italy

Additional URL

http://doi.ieeecomputersociety.org/10.1109/ICDM.2008.135