Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
5-2024
Abstract
Auto-scaling is crucial for achieving elasticity in cloud databases as well as other cloud systems. Predictive auto-scaling, which leverages forecasting techniques to adjust resources based on predicted workload, has been widely adopted. However, the inherent inaccuracy of forecasting presents a significant challenge, potentially causing resource under-provisioning. To address this challenge, we propose robust predictive auto-scaling that considers the uncertainty in forecasts. Unlike previous predictive approaches that rely on single-valued forecasts, we leverage probabilistic forecasting techniques to generate quan-tile forecasts, providing a more comprehensive understanding of the potential future workloads. By formulating the auto-scaling problem as a robust optimization problem, we enable the implementation of auto-scaling strategies with customizable levels of robustness, which can be determined by considering various quantile levels of forecasts. Moreover, we enhance the adaptability of our strategy by incorporating different quantile levels through-out the entire decision horizon, allowing for dynamic adjustments in the conservatism of our auto-scaling decisions. This enables us to strike a balance between resource efficiency and system robustness. Through extensive experiments, we demonstrate the effectiveness of our approach in achieving robust auto-scaling in cloud databases, while maintaining reasonable resource efficiency.
Keywords
Resource Scaling, Workload Forecasting, Cloud Databases
Discipline
Software Engineering
Research Areas
Software and Cyber-Physical Systems
Publication
2024 40th IEEE 40th International Conference on Data Engineering (ICDE): Utrecht, Netherlands, May 13-16: Proceedings
First Page
4016
Last Page
4029
ISBN
9798350317152
Identifier
10.1109/ICDE60146.2024.00308
Publisher
IEEE Computer Society
City or Country
Los Alamitos, CA
Citation
HANG, Haitian; TANG, Xiu; SUN, Jianling; BAO, Lingfeng; LO, David; and WANG, Haoye.
Robust auto-scaling with probabilistic workload forecasting for cloud databases. (2024). 2024 40th IEEE 40th International Conference on Data Engineering (ICDE): Utrecht, Netherlands, May 13-16: Proceedings. 4016-4029.
Available at: https://ink.library.smu.edu.sg/sis_research/9264
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/ICDE60146.2024.00308