Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees
Publication Type
Conference Proceeding Article
Publication Date
7-2009
Abstract
Although a vast majority of inductive learning algorithms has been developed for handling of the concept drifting data streams, especially the ones in virtue of ensemble classification models, few of them could adapt to the detection on the different types of concept drifts from noisy streaming data in a light demand on overheads of time and space. Motivated by this, a new classification algorithm for Concept drifting Detection based on an ensembling model of Random Decision Trees (called CDRDT) is proposed in this paper. Extensive studies with synthetic and real streaming data demonstrate that in comparison to several representative classification algorithms for concept drifting data streams, CDRDT not only could effectively and efficiently detect the potential concept changes in the noisy data streams, but also performs much better on the abilities of runtime and space with an improvement in predictive accuracy. Thus, our proposed algorithm provides a significant reference to the classification for concept drifting data streams with noise in a light weight way.
Keywords
Data Streams - Ensemble Decision Trees - Concept Drift - Noise
Discipline
Computer Sciences
Publication
MLDM´2009
ISBN
9783642030697
Identifier
10.1007/978-3-642-03070-3_18
Publisher
Springer Verlag
Citation
LI, Peipei; Hu, X.; LIANG, Qianhui (Althea); and GAO, Yunjun.
Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees. (2009). MLDM´2009.
Available at: https://ink.library.smu.edu.sg/sis_research/466
Additional URL
http://dx.doi.org/10.1007/978-3-642-03070-3_18