Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
8-2008
Abstract
The effectiveness of knowledge transfer using classification algorithms depends on the difference between the distribution that generates the training examples and the one from which test examples are to be drawn. The task can be especially difficult when the training examples are from one or several domains different from the test domain. In this paper, we propose a locally weighted ensemble framework to combine multiple models for transfer learning, where the weights are dynamically assigned according to a model's predictive power on each test example. It can integrate the advantages of various learning algorithms and the labeled information from multiple training domains into one unified classification model, which can then be applied on a different domain. Importantly, different from many previously proposed methods, none of the base learning method is required to be specifically designed for transfer learning. We show the optimality of a locally weighted ensemble framework as a general approach to combine multiple models for domain transfer. We then propose an implementation of the local weight assignments by mapping the structures of a model onto the structures of the test domain, and then weighting each model locally according to its consistency with the neighborhood structure around the test example. Experimental results on text classification, spam filtering and intrusion detection data sets demonstrate significant improvements in classification accuracy gained by the framework. On a transfer learning task of newsgroup message categorization, the proposed locally weighted ensemble framework achieves 97% accuracy when the best single model predicts correctly only on 73% of the test examples. In summary, the improvement in accuracy is over 10% and up to 30% across different problems.
Keywords
Algorithms, Classification accuracies, Learning methods
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Research Areas
Data Science and Engineering
Publication
KDD 2008: Proceedings of the 14th ACMKDD International Conference on Knowledge Discovery & Data Mining, Las Vegas, NV, August 24-27
First Page
283
Last Page
291
ISBN
9781605581934
Identifier
10.1145/1401890.1401928
Publisher
ACM
City or Country
New York
Citation
GAO, Jing; FAN, Wei; JIANG, Jing; and HAN, Jiawei.
Knowledge Transfer Via Multiple Model Local Structure Mapping. (2008). KDD 2008: Proceedings of the 14th ACMKDD International Conference on Knowledge Discovery & Data Mining, Las Vegas, NV, August 24-27. 283-291.
Available at: https://ink.library.smu.edu.sg/sis_research/307
Copyright Owner and License
Publisher
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/1401890.1401928
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons