Title

ViDE: A Visual Data Extraction Environment for the Web

Publication Type

Conference Proceeding Article

Publication Date

9-2001

Abstract

With the rapid growth of information on the Web, a means to combat information overload is critical. In this paper, we present ViDE (Visual Data Extraction), an interactive web data extraction environment that supports efficient hierarchical data wrapping of multiple web pages. ViDE has two unique features that differentiate it from other extraction mechanisms. First, data extraction rules can be easily specified in a graphical user interface that is seamlessly integrated with a web browser. Second, ViDE introduces the concept of grouping which unites the extraction rules for a set of documents with the navigational patterns that exist among them. This paper describes our initial development of the system.

Discipline

Databases and Information Systems | Numerical Analysis and Scientific Computing

Research Areas

Data Management and Analytics

Publication

Database and Expert Systems Applications: 12th International Conference, DEXA 2001 Munich, Germany, September 3–5: Proceedings

Volume

2113

First Page

577

Last Page

586

ISBN

9783540447597

Identifier

10.1007/3-540-44759-8_57

Publisher

Springer Verlag

City or Country

Munich, Germany

Additional URL

http://dx.doi.org/0.1007/3-540-44759-8_57