Title

Cost-Benefit Analysis of Bags in a Web Warehouse

Publication Type

Conference Proceeding Article

Publication Date

8-1999

Abstract

Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of a web warehouse called WHOWEDA (Warehouse Of Weda Data) which we are currently building. Informally, a web bag is a web table which allows multiple occurrences of identical web tuples.Web bag helps to discover useful knowledge from a web table such as visible documents (or web sites), luminous docu-ments and luminous paths. In this paper, we provide a cost-benefit analysis of materializing web bags as compared to web tables with distinct web tuples.

Discipline

Databases and Information Systems

Research Areas

Data Management and Analytics

Publication

International Database Engineering and Applications Symposium (IDEAS '99)

First Page

34

Last Page

42

ISBN

9780769502656

Identifier

10.1109/IDEAS.1999.787249

Publisher

IEEE

City or Country

Montreal, Canada, Aug 2-4

Additional URL

http://dx.doi.org/10.1109/IDEAS.1999.787249