Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

6-2005

Abstract

In this paper, we attempt to give spatial semantics to web pages by assigning them place names. The entire assignment task is divided into three sub-problems, namely place name extraction, place name disambiguation and place name assignment. We propose our approaches to address these sub-problems. In particular, we have modified GATE, a well-known named entity extraction software, to perform place name extraction using a US Census gazetteer. A rule-based place name disambiguation method and a place name assignment method capable of assigning place names to web page segments have also been proposed. We have evaluated our proposed disambiguation and assignment methods on a web page collection referenced by the DLESE metadata collection. The results returned by our methods are compared with manually disambiguated place names and place name assignment. It is shown that our proposed place name disambiguation method works well for geo/geo ambiguities. The preliminary results of our place name assignment method indicate promising results given the existence of geo/non-geo ambiguities among place names.

Discipline

Databases and Information Systems | Geography

Publication

JCDL 05: Proceedings of the 5th ACM/IEEE Joint Conference on Digital Libraries: Denver, CO, June 7-11, 2005

First Page

354

Last Page

362

ISBN

9781581138764

Identifier

10.1145/1065385.1065464

Publisher

ACM

City or Country

New York

Additional URL

http://doi.org/10.1145/1065385.1065464

Share

COinS