Publication Type
Journal Article
Version
publishedVersion
Publication Date
10-2006
Abstract
Web pages from a Web site can often be associated with concepts in an ontology, and pairs of Web pages also can be associated with relationships between concepts. With such associations, the Web site can be searched, browsed, or even reorganized based on the concept and relationship labels of its Web pages. In this article, we study the link chain extraction problem that is critical to the extraction of Web pages that are related. A link chain is an ordered list of anchor elements linking two Web pages related by some semantic relationship. We propose a link chain extraction method that derives extraction rules for identifying the anchor elements forming the link chains. We applied the proposed method to two well-structured Web sites and found that its performance in terms of precision and recall is good, even with a small number of training examples.
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Publication
Journal of the American Society for Information Science and Technology
Volume
57
Issue
12
First Page
1590
Last Page
1605
ISSN
1532-2882
Identifier
10.1002/asi.20469
Publisher
Wiley
Citation
NAING, Myo-Myo; LIM, Ee Peng; and CHIANG, Roger Hsiang-Li.
Extracting link chains of relationship instances from a website. (2006). Journal of the American Society for Information Science and Technology. 57, (12), 1590-1605.
Available at: https://ink.library.smu.edu.sg/sis_research/202
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://doi.org/10.1002/asi.20469
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons