ReleaseBytes
gcp Google Cloud release notes ·

Cloud Load Balancing: Traffic Duration for Backends

networkinggcpgaengineer
feature

Application Load Balancers now support configuring traffic duration settings (SHORT or LONG) for backends. This allows for better control over how long backends can take to respond to HTTP requests, with an in-flight balancing mode for requests exceeding one second. This feature is now generally available, impacting engineers and architects managing application traffic.

Features (1)
  • Cloud Load Balancing

    Application Load Balancers now support the configuration of a traffic duration setting when you add backends to backend services. You can configure this setting as SHORT or LONG based on the response time needed by backends to complete HTTP requests. Application Load Balancers also support the use of the in-flight balancing mode that lets you configure the load balancer's traffic distribution to supported backends when requests take more than a second to complete. This feature is in General availability .

Read the original announcement →

https://docs.cloud.google.com/release-notes#May_22_2026