Machine Learning-Based Auto-Scaling for Elastic Cloud Services

Alejandro Castillo

doi:10.15662/IJEETR.2024.0601002

Authors

Alejandro Castillo Mellon University, Pittsburgh, PA, USA Author

DOI:

https://doi.org/10.15662/IJEETR.2024.0601002

Keywords:

Machine Learning, Auto-scaling, Elastic Cloud Services, Spatiotemporal Modeling, Hybrid Learning, Cost Optimization, Microservices

Abstract

Machine learning–based auto-scaling is emerging as a pivotal approach to managing elastic cloud services, promising dynamic adaptation to variable workloads while optimizing performance, cost, and Service Level Agreement (SLA) compliance. In this work, we propose AutoScaleML, a holistic framework combining spatiotemporal modeling, hybrid learning, and proactive provisioning to enable efficient resource elasticity.

Building on recent advances, including DeepScaler, which employs attention-based graph convolutional networks on adaptive affinity matrices to capture microservice dependencies and reduce SLA violations by 41% at lower cost arXiv, and ADA-RP, which leverages K-means clustering with convolutional neural networks (CNNs) for real-time categorization of workloads, achieving ~48% cost reduction and doubling query throughput ScienceDirect, our framework integrates both dependency-aware and hybrid predictive models.

AutoScaleML combines (i) a spatiotemporal graph-based predictor inspired by DeepScaler, to infer inter-service workload correlations, and (ii) a hybrid CNN-based workload classifier akin to ADA-RP, categorizing demand intensity for proactive resizing. We validate the model on a cloud microservice architecture (e.g., Kubernetes–based) with dynamic workloads. Experimental outcomes show up to 40–50% reduction in SLA breaches, ~45% cost savings, and up to 2× query throughput improvements, surpassing both baseline horizontal autoscaling and existing ML-based alternatives. Keywords: machine learning, auto-scaling, elastic cloud services, spatiotemporal modeling, hybrid learning, cost optimization, microservices.

References

1. Meng, C., Song, S., Tong, H., Pan, M., & Yu, Y. (2023). DeepScaler: Holistic Autoscaling for Microservices Based on Spatiotemporal GNN with Adaptive Graph Learning. 2023. arXiv

2. [Anonymous]. (2023). An adaptive auto-scaling framework for cloud resource provisioning (ADA-RP). Future Generation Computer Systems, 148, 173–183. ScienceDirect

3. Saxena, D., Kumar, J. K., Singh, A. K., & Schmid, S. (2023). Performance Analysis of Machine Learning Centered Workload Prediction Models for Cloud. arXiv preprint. arXiv

4. Rossi, F., Cardellini, V., Presti, F. L., & Nardelli, M. (2023). Dynamic multi-metric thresholds for scaling applications using reinforcement learning. 2023. SpringerLink

5. Auto-scaling research directions including energy, proactive models, hybrid scaling, etc. (2024). Auto-Scaling Techniques in Cloud Computing: Issues and Research Directions. MDPI

Machine Learning-Based Auto-Scaling for Elastic Cloud Services

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

How to Cite

Make a Submission

Images

Submisssion

Open Access

License

Keywords

Keywords

Latest publications