Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
6-2005
Abstract
In this paper, we attempt to give spatial semantics to web pages by assigning them place names. The entire assignment task is divided into three sub-problems, namely place name extraction, place name disambiguation and place name assignment. We propose our approaches to address these sub-problems. In particular, we have modified GATE, a well-known named entity extraction software, to perform place name extraction using a US Census gazetteer. A rule-based place name disambiguation method and a place name assignment method capable of assigning place names to web page segments have also been proposed. We have evaluated our proposed disambiguation and assignment methods on a web page collection referenced by the DLESE metadata collection. The results returned by our methods are compared with manually disambiguated place names and place name assignment. It is shown that our proposed place name disambiguation method works well for geo/geo ambiguities. The preliminary results of our place name assignment method indicate promising results given the existence of geo/non-geo ambiguities among place names.
Discipline
Databases and Information Systems | Geography
Publication
JCDL 05: Proceedings of the 5th ACM/IEEE Joint Conference on Digital Libraries: Denver, CO, June 7-11, 2005
First Page
354
Last Page
362
ISBN
9781581138764
Identifier
10.1145/1065385.1065464
Publisher
ACM
City or Country
New York
Citation
ZONG, Wenbo; WU, Dan; SUN, Aixin; LIM, Ee Peng; and GOH, Dion Hoe-Lian.
On assigning place names to geography related web pages. (2005). JCDL 05: Proceedings of the 5th ACM/IEEE Joint Conference on Digital Libraries: Denver, CO, June 7-11, 2005. 354-362.
Available at: https://ink.library.smu.edu.sg/sis_research/936
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://doi.org/10.1145/1065385.1065464