Publication Type
Journal Article
Version
acceptedVersion
Publication Date
1-2019
Abstract
Kernel-based regression represents an important family of learning techniques for solving challenging regression tasks with non-linear patterns. Despite being studied extensively, most of the existing work suffers from two major drawbacks as follows: (i) they are often designed for solving regression tasks in a batch learning setting, making them not only computationally inefficient and but also poorly scalable in real-world applications where data arrives sequentially; and (ii) they usually assume that a fixed kernel function is given prior to the learning task, which could result in poor performance if the chosen kernel is inappropriate. To overcome these drawbacks, this work presents a novel scheme of Online Multiple Kernel Regression (OMKR), which sequentially learns the kernel-based regressor in an online and scalable fashion, and dynamically explore a pool of multiple diverse kernels to avoid suffering from a single fixed poor kernel so as to remedy the drawback of manual/heuristic kernel selection. The OMKR problem is more challenging than regular kernelbased regression tasks since we have to on-the-fly determine both the optimal kernel-based regressor for each individual kernel and the best combination of the multiple kernel regressors. We propose a family of OMKR algorithms for regression and discuss their application to time series prediction tasks including application to AR, ARMA, and ARIMA time series.We develop novel approaches to make OMKR scalable for large datasets, to counter the problems arising from an unbounded number of support vectors.We also explore the effect of kernel combination at prediction level and at the representation level. Finally, we conduct extensive experiments to evaluate the empirical performance on both real-world regression and times series prediction tasks.
Keywords
large-scale kernel learning, multiple kernel regression, Online learning, time-series prediction
Discipline
Databases and Information Systems | Theory and Algorithms
Research Areas
Data Science and Engineering
Publication
ACM Transactions on Knowledge Discovery from Data
Volume
13
Issue
1
First Page
9:1
Last Page
33
ISSN
1556-4681
Identifier
10.1145/3299875
Publisher
Association for Computing Machinery (ACM)
Citation
SAHOO, Doyen; HOI, Steven C. H.; and LIN, Bin.
Large scale online multiple kernel regression with application to time-series prediction. (2019). ACM Transactions on Knowledge Discovery from Data. 13, (1), 9:1-33.
Available at: https://ink.library.smu.edu.sg/sis_research/4383
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3299875