Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
6-2003
Abstract
Information extraction (IE) is of great importance in many applications including web intelligence, search engines, text understanding, etc. To extract information from text documents, most IE systems rely on a set of extraction patterns. Each extraction pattern is defined based on the syntactic and/or semantic constraints on the positions of desired entities within natural language sentences. The IE systems also provide a set of pattern templates that determines the kind of syntactic and semantic constraints to be considered. In this paper, we argue that such pattern templates restricts the kind of extraction patterns that can be learned by IE systems. To allow a wider range of context information to be considered in learning extraction patterns, we first propose to model the content and context information of a candidate entity to be extracted as a set of features. A classification model is then built for each category of entities using Support Vector Machines (SVM). We have conducted IE experiments to evaluate our proposed method on a text collection in the terrorism domain. From the preliminary experimental results, we conclude that our proposed method can deliver reasonable accuracies.
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Publication
Intelligence and Security Informatics: First NSF/NIJ Symposium, ISI 2003, Tucson, AZ, USA, June 2-3, 2003: Proceedings
Volume
2665
First Page
1
Last Page
12
ISBN
9783540401896
Identifier
10.1007/3-540-44853-5_1
Publisher
Springer Verlag
City or Country
Berlin
Citation
SUN, Aixin; NAING, Myo-Myo; LIM, Ee Peng; and LAM, Wai.
Using Support Vector Machines for Terrorism Information Extraction. (2003). Intelligence and Security Informatics: First NSF/NIJ Symposium, ISI 2003, Tucson, AZ, USA, June 2-3, 2003: Proceedings. 2665, 1-12.
Available at: https://ink.library.smu.edu.sg/sis_research/960
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://doi.org/10.1007/3-540-44853-5_1
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons