Elastic Scaling Method for Multi-tenant Databases Based on Hybrid Workload Prediction Model
Author:
Affiliation:

Clc Number:

TP311

  • Article
  • | |
  • Metrics
  • |
  • Reference [18]
  • |
  • Related
  • | | |
  • Comments
    Abstract:

    One of the most important features of multi-tenant databases in cloud environments is scalability. However, most elastic scaling techniques struggle to make effective scaling decisions for dynamically changing loads. If load changes can be predicted in advance, resource supply can be accurately adjusted. Given this, this study proposes a load-prediction-based elastic scaling method for multi-tenant databases. It includes a combined load prediction model and an elastic scaling strategy. The load prediction model combines the advantages of convolutional neural networks, long short-term memory networks and gated recurrent units. It can accurately forecast memory requirements of database clusters. Based on the prediction results, the elastic scaling strategy adjusts the number of virtual machines to ensure that resource supply remains within a reasonable range. Compared to existing methods, the combined load prediction model can reduce prediction errors by 8.7% to 21.8% and improve prediction fitting degree by 4.6%. Furthermore, this study improves the Bayesian optimization algorithm for hyperparameter tuning of the combined prediction model. The improved hyperparameter tuning model reduces errors by above 20% and improves fitting degree by 1.04%, which proves that it can well address the poor performance of Bayesian optimization in combined domains of discrete and continuous solutions. Compared to the most widely used scaling strategy in Kubernetes, the proposed elastic scaling method reduces response time by 8.12% and latency by 9.56%. It can avoid the latency and the waste of resources to a large extent.

    Reference
    [1] Zhao HQ, Lim H, Hanif M, Lee C. Predictive container auto-scaling for cloud-native applications. In: Proc. of the 2019 Int’l Conf. on Information and Communication Technology Convergence. Jeju: IEEE, 2019. 1280–1282. [doi: 10.1109/ICTC46691.2019.8939932]
    [2] Nguyen TT, Yeom YJ, Kim T, Park DH, Kim S. Horizontal pod autoscaling in Kubernetes for elastic container orchestration. Sensors, 2020, 20(16): 4621.
    [3] Sellami W, Hadj Kacem H, Hadj Kacem A. Dynamic provisioning of service composition in a multi-tenant SaaS environment. Journal of Network and Systems Management, 2020, 28(2): 367–397.
    [4] Zafeiropoulos A, Fotopoulou E, Filinis N, Papavassiliou S. Reinforcement learning-assisted autoscaling mechanisms for serverless computing platforms. Simulation Modelling Practice and Theory, 2022, 116: 102461.
    [5] Nouri SMR, Li H, Venugopal S, Guo WX, He MY, Tian WH. Autonomic decentralized elasticity based on a reinforcement learning controller for cloud applications. Future Generation Computer Systems, 2019, 94: 765–780.
    [6] Ghobaei-Arani M, Jabbehdari S, Pourmina MA. An autonomic resource provisioning approach for service-based cloud applications: A hybrid approach. Future Generation Computer Systems, 2018, 78: 191–210.
    [7] Horovitz S, Arian Y. Efficient cloud auto-scaling with SLA objective using Q-learning. In: Proc. of the 6th IEEE Int’l Conf. on Future Internet of Things and Cloud. Barcelona: IEEE, 2018. 85–92. [doi: 10.1109/FiCloud.2018.00020]
    [8] Wei Y, Kudenko D, Liu SJ, Pan L, Wu L, Meng XX. A reinforcement learning based auto-scaling approach for SaaS providers in dynamic cloud environment. Mathematical Problems in Engineering, 2019, 2019(1): 5080647.
    [9] 吴晓军, 张成, 原盛, 任晓春, 王玮. 基于强化学习的云资源混合式弹性伸缩算法. 西安交通大学学报, 2022, 56(1): 142–150.
    Wu XJ, Zhang C, Yuan S, Ren XC, Wang W. Blended elastic scaling method for cloud resources following reinforcement learning. Journal of Xi’an Jiaotong University, 2022, 56(1): 142–150 (in Chinese with English abstract).
    [10] Song SH, Pan L, Liu SJ. A Q-learning based auto-scaling approach for provisioning big data analysis services in cloud environments. Future Generation Computer Systems, 2024, 154: 140–150.
    [11] Kardani-Moghaddam S, Buyya R, Ramamohanarao K. ADRL: A hybrid anomaly-aware deep reinforcement learning-based resource scaling in clouds. IEEE Trans. on Parallel and Distributed Systems, 2021, 32(3): 514–526.
    [12] Garí Y, Monge DA, Pacini E, Mateos C, García Garino C. Reinforcement learning-based application Autoscaling in the cloud: A survey. Engineering Applications of Artificial Intelligence, 2021, 102: 104288.
    [13] Yu Y, Si XS, Hu CH, Zhang JX. A review of recurrent neural networks: LSTM cells and network architectures. Neural Computation, 2019, 31(7): 1235–1270.
    [14] Hoseinzade E, Haratizadeh S. CNNpred: CNN-based stock market prediction using a diverse set of variables. Expert Systems with Applications, 2019, 129: 273–285.
    [15] Vidal A, Kristjanpoller W. Gold volatility prediction using a CNN-LSTM approach. Expert Systems with Applications, 2020, 157: 113481.
    [16] Ouhame S, Hadi Y, Ullah A. An efficient forecasting approach for resource utilization in cloud data center using CNN-LSTM model. Neural Computing and Applications, 2021, 33(16): 10043–10055.
    [17] Du L, Gao RB, Suganthan PN, Wang DZW. Bayesian optimization based dynamic ensemble for time series forecasting. Information Sciences, 2022, 591: 155–175.
    Related
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

徐海洋,刘海龙,陈先,王磊,金轲,侯舒峰,李战怀.基于组合负载预测模型的多租户数据库弹性伸缩方法.软件学报,2025,36(3):981-994

Copy
Share
Article Metrics
  • Abstract:1064
  • PDF: 1715
  • HTML: 196
  • Cited by: 0
History
  • Received:May 27,2024
  • Revised:July 16,2024
  • Online: September 13,2024
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063