Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees
Conference Proceeding Article
Although a vast majority of inductive learning algorithms has been developed for handling of the concept drifting data streams, especially the ones in virtue of ensemble classification models, few of them could adapt to the detection on the different types of concept drifts from noisy streaming data in a light demand on overheads of time and space. Motivated by this, a new classification algorithm for Concept drifting Detection based on an ensembling model of Random Decision Trees (called CDRDT) is proposed in this paper. Extensive studies with synthetic and real streaming data demonstrate that in comparison to several representative classification algorithms for concept drifting data streams, CDRDT not only could effectively and efficiently detect the potential concept changes in the noisy data streams, but also performs much better on the abilities of runtime and space with an improvement in predictive accuracy. Thus, our proposed algorithm provides a significant reference to the classification for concept drifting data streams with noise in a light weight way.
Data Streams - Ensemble Decision Trees - Concept Drift - Noise
Data Management and Analytics
LI, Peipei; Hu, X.; LIANG, Qianhui (Althea); and GAO, Yunjun.
Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees. (2009). MLDM´2009. Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/466