Optimizing Resource Management for Shared Microservices: A Scalable System Design

Shutian Luo, Chenyu Lin, Kejiang Ye, Guoyao Xu, Liping Zhang, Guodong Yang, Huanle Xu, Chengzhong Xu

February 2024

Abstract

We present an efficient resource management system, namely Erms, for guaranteeing SLAs with high probability in shared microservice environments. Erms profiles microservice latency as a piece-wise linear function of the workload, resource usage, and interference. Based on this profiling, Erms builds resource scaling models to optimally determine latency targets for microservices with complex dependencies. Erms also designs new scheduling policies at shared microservices to further enhance resource efficiency. Experiments across microservice benchmarks as well as trace-driven simulations demonstrate that Erms can reduce SLA violation probability by 5× and more importantly, lead to a reduction in resource usage by 1.6×, compared to state-of-the-art approaches.

Type

Journal article

Publication

ACM Transactions on Computer Systems (ToCS), 42(1-2)

Optimizing Resource Management for Shared Microservices: A Scalable System Design

Abstract

Shutian Luo

2021-2023 PhD Student

Chenyu Lin

2022-2024 Master Student

Huanle Xu

2021 - Current

Chengzhong Xu

2019 - Current