Robust Resource Scaling of Containerized Microservices with Probabilistic Machine learning

Peng Kang; Palden Lama; IEEE

doi:10.1109/UCC48980.2020.00031

Back

Conference proceeding

Robust Resource Scaling of Containerized Microservices with Probabilistic Machine learning

Peng Kang, Palden Lama and IEEE

2020 IEEE/ACM 13th International Conference on Utility and Cloud Computing (UCC), pp.122-131

12/2020

DOI: https://doi.org/10.1109/UCC48980.2020.00031

Abstract

Adaptation models

adaptiveness

changing resource demands

changing system dynamics

cloud computing

cloud data centers

cloud providers

complex interactions

computer centres

computing resources

container orchestration

container-level resource usage metrics

containerization

containerized microservices

Containers

end-to-end performance guarantee

large-scale web services

learning (artificial intelligence)

lightweight isolated execution environment

Machine learning

microservice workflows

Microservices

modular components

multitenant performance interference

NSF Cloud's Chameleon

open-source microservices benchmark

Performance modeling

performance SLO

popular machine learning techniques

Predictive models

probabilistic machine learning-based performance model

quality of service

resource allocation

Resource management

robust resource scaling system

RScale

Servers

service-level-objective

shared hardware resources

superior prediction accuracy

virtual machine-level hardware performance

virtual machines

virtualisation

web service performance

Web services

Large-scale web services are increasingly being built with many small modular components (microservices), which can be deployed, updated and scaled seamlessly. These microservices are packaged to run in a lightweight isolated execution environment (containers) and deployed on computing resources rented from cloud providers. However, the complex interactions and the contention of shared hardware resources in cloud data centers pose significant challenges in managing web service performance. In this paper, we present RScale, a robust resource scaling system that provides end-to-end performance guarantee for containerized microservices deployed in the cloud. RScale employs a probabilistic machine learning-based performance model, which can quickly adapt to changing system dynamics and directly provide confidence bounds in the predictions with minimal overhead. It leverages multi-layered data collected from container-level resource usage metrics and virtual machine-level hardware performance counter metrics to capture changing resource demands in the presence of multi-tenant performance interference. We implemented and evaluated RScale on NSF Cloud's Chameleon testbed using KVM for virtualization, Docker Engine for containerization and Kubernetes for container orchestration. Experimental results with an open-source microservices benchmark, Robot Shop, demonstrate the superior prediction accuracy and adaptiveness of our modeling approach compared to popular machine learning techniques. RScale meets the performance SLO (service-level-objective) targets for various microservice workflows even in the presence of multi-tenant performance interference and changing system dynamics.

Metrics

1 Record Views

Details

Title: Robust Resource Scaling of Containerized Microservices with Probabilistic Machine learning
Creators: Peng Kang - The University of Texas at San Antonio
Palden Lama - The University of Texas at San Antonio
IEEE
Academic Unit: California State University, Sacramento; Computer Science Department
Publisher: IEEE
Publication Details: 12/2020
Grant note: National Science Foundation (10.13039/100000001)
Identifiers: 99258167659701671; https://doi.org/10.1109/UCC48980.2020.00031
Language: English
Number of pages: 10

Robust Resource Scaling of Containerized Microservices with Probabilistic Machine learning

Abstract

Related links

Metrics

Details