New Graph Transformer models improve microservice tail latency prediction

By PulseAugur Editorial · [3 sources] · 2026-04-28 04:00

Two new research papers propose advanced methods for predicting tail latency in microservice systems. The first, STLGT, uses a graph transformer to model service dependencies and a temporal module for workload dynamics, showing improved accuracy and speed over existing methods. The second, USRFNet, employs a dual-stream learning approach to separate traffic and resource metrics, incorporating a gradient modulation strategy to address training imbalances and achieving significant reductions in prediction error. AI

IMPACT These new models offer improved accuracy and efficiency for predicting microservice tail latency, aiding proactive SLO management and system reliability.

RANK_REASON Two academic papers published on arXiv present novel methods for tail latency prediction in microservices.

Read on arXiv cs.LG →

paper
infra

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

COVERAGE [3]

arXiv cs.AI TIER_1 English(EN) · Yongliang Ding, Qigong Bi, Peng Pu · 2026-04-30 04:00

STLGT: A Scalable Trace-Based Linear Graph Transformer for Tail Latency Prediction in Microservices

arXiv:2604.26422v1 Announce Type: cross Abstract: Accurate end-to-end tail-latency forecasting is critical for proactive SLO management in microservice systems. However, modeling long-range dependency propagation and non-stationary, bursty workloads while maintaining inference ef…
arXiv cs.AI TIER_1 English(EN) · Peng Pu · 2026-04-29 08:32

STLGT: A Scalable Trace-Based Linear Graph Transformer for Tail Latency Prediction in Microservices

Accurate end-to-end tail-latency forecasting is critical for proactive SLO management in microservice systems. However, modeling long-range dependency propagation and non-stationary, bursty workloads while maintaining inference efficiency at scale remains challenging. We present …
arXiv cs.LG TIER_1 English(EN) · Wenzhuo Qian, Hailiang Zhao, Jiayi Chen, Ziqi Wang, Tianlv Chen, Zhiwei Ling, Xinkui Zhao, Kingsum Chow, Albert Y. Zomaya, Shuiguang Deng · 2026-04-28 04:00

Reliable Microservice Tail Latency Prediction via Decoupled Dual-Stream Learning and Gradient Modulation

arXiv:2508.01635v2 Announce Type: replace Abstract: Microservice architectures enable scalable cloud-native applications; however, the distributed nature of these systems complicates the maintenance of strict Service Level Objectives. Accurately predicting window-level P95 tail l…

COVERAGE [3]

STLGT: A Scalable Trace-Based Linear Graph Transformer for Tail Latency Prediction in Microservices

STLGT: A Scalable Trace-Based Linear Graph Transformer for Tail Latency Prediction in Microservices

Reliable Microservice Tail Latency Prediction via Decoupled Dual-Stream Learning and Gradient Modulation

RELATED ENTITIES

RELATED TOPICS