Embracing Imbalance: Dynamic Load Shifting among Microservice Containers in Shared Clusters

Abstract

In this paper, we utilize an alternative approach by leveraging load imbalance. The central concept involves the dynamic load shifting across microservice containers with a focus on imbalance awareness. However, achieving seamless integration between load shifting and resource scaling, while accommodating the demands of partial connection between upstream and downstream containers, remains a challenge. To address this challenge, we introduce Imbres—a new microservice system that optimizes load shifting, connection management, and resource scaling in tandem. One significant advantage of Imbres lies in its rapid responsiveness, relying solely on online gradients of latency, eliminating the need for offline profiling. Evaluation using real microservice benchmarks reveals that Imbres reduces resource allocation by up to 62% and decreases SLA violation probability by up to 82%, compared to state-of-the-art systems.

Publication
In The ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) 2025
Shutian Luo
2021-2023 PhD Student
Jianxiong Liao
Jianxiong Liao
2024 - Current

My current research interests lie in cloud computing and machine learning systems.

Chenyu Lin
2022-2024 Master Student
Huanle Xu
Huanle Xu
2021.01 - Current

I am currently an assistant professor from the Department of Computer and Information Scicence, Univeristy of Macau.