Revolutionizing Cloud Computing with Predictive Autoscaling using transformer model: Improving Resource Utilization

Sabiha, Fatema Tuz

dc.contributor.advisor	Shrestha, Raju
dc.contributor.author	Sabiha, Fatema Tuz
dc.date.accessioned	2023-11-07T14:44:28Z
dc.date.available	2023-11-07T14:44:28Z
dc.date.issued	2023
dc.identifier.uri	https://hdl.handle.net/11250/3101181
dc.description.abstract	The adoption of cloud computing by small as well as large organizations has been rapidly increasing now a days. While cloud computing can be cost-effective, it can also become very expensive if proper care is not taken. In order to ensure high availability, cloud providers often tend to overprovision resources, leading to resource wastage and financial losses. Therefore, there is a growing need for efficient resource management in cloud computing. Recognizing the growing interest among researchers in utilizing machine learning models for optimizing resource utilization in cloud computing, this study aims to enhance resource utilization by automating the scaling of a traffic controller in a cloud environment by using a transformer model, which have gained popularity recently. The proposed approach in this research involves training and utilizing a time series forecasting model to implement an autoscaling strategy that can dynamically allocate resources based on actual and predicted future demand in cloud computing. To implement the proposed model, a transformer model was trained using publicly available data offline and used to predict future traffic. The predicted value was then utilized to calculate the target utilization and fed to a Kubernetes-based Event-Driven Autoscaler (KEDA) component for autoscaling an ingress controller integrated with a microservice application running in the cloud. The model was tested in four different scenarios, including without autoscaling, with Horizontal Pod Autoscaling (HPA), with KEDA, and with the implemented transformer model. The experimental results show that the proposed model did not significantly outperform HPA in terms of the performance metrics considered. However, the proposed model exhibited a trend of changing utilization levels while maintaining a stable response time, suggesting a possibility of improving resource utilization with further investigation and fine-tuning.	en_US
dc.language.iso	nob	en_US
dc.publisher	Oslomet - storbyuniversitetet	en_US
dc.title	Revolutionizing Cloud Computing with Predictive Autoscaling using transformer model: Improving Resource Utilization	en_US
dc.type	Master thesis	en_US
dc.description.version	publishedVersion	en_US

Tilhørende fil(er)

Filnavn:: Sabiha_acit2023.pdf
Størrelse:: 1.477Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

TKD - Master i Anvendt data- og informasjonsteknologi (ACIT) [244]

Vis enkel innførsel