On warehousing historical web information
Publication Type
Conference Proceeding Article
Publication Date
10-2000
Abstract
We present a temporal web data model designed for warehousing historical data from World Wide Web that changes with time. As the Web is now populated with large volume of web information, it has become necessary to capture some useful web information in a data warehouse that supports further intelligent data analysis. Nevertheless, due to the unstructured and dynamic nature of Web, the traditional relational model and its temporal variants could not be used to build such a data warehouse. In this paper, we therefore propose a temporal web data model that captures the connectivities of web documents and their content in the form of temporal web tables. To support the analysis of web data that evolve with time, valid time intervals are associated with each web document. To manipulate temporal web tables, we define a variety of web operators and illustrate their usefulness using some realistic motivating examples.
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Publication
Conceptual Modeling - ER 2000: 19th International Conference on Conceptual Modeling, Salt Lake City, Utah, October 9-12: Proceedings
Volume
1920
First Page
253
Last Page
266
ISBN
9783540453932
Identifier
10.1007/3-540-45393-8_19
Publisher
Springer Verlag
City or Country
Salt Lake City, UT
Citation
CAO, Yinyan; LIM, Ee Peng; and NG, Wee-Keong.
On warehousing historical web information. (2000). Conceptual Modeling - ER 2000: 19th International Conference on Conceptual Modeling, Salt Lake City, Utah, October 9-12: Proceedings. 1920, 253-266.
Available at: https://ink.library.smu.edu.sg/sis_research/917
Additional URL
http://doi.org/10.1007/3-540-45393-8_19