Overview
Describes resilient hashing and flow auto-recovery features that maintain stable traffic flows during ECMP path failures by rerouting only affected flows and restoring original distribution upon recovery.
Resilient hashing and flow auto-recovery is a network reliability feature that
-
maintains stable traffic flows during Equal-Cost Multipath (ECMP) path failures by only rerouting traffic affected by the failed path,
-
prevents unnecessary rebalancing of existing flows to new links, and
-
automatically restores original flow distribution when a failed path or server returns to service.
-
ECMP: A routing technique that balances traffic across multiple equal-cost paths to the same destination.
-
Bucket: A logical mapping of flows to paths in a hashing algorithm.
| Feature Name |
Release Name |
Description |
|---|---|---|
| Resilient hashing and flow auto-recovery |
Release 25.4.1 | Introduced in this release on: Fixed Systems (8010 [ASIC: A100])(select variants only*) *This feature is now supported on:
|
| Resilient hashing and flow auto-recovery |
Release 25.1.1 | Introduced in this release on: Fixed Systems (8010 [ASIC: A100])(select variants only*) *This feature is supported on Cisco 8011-4G24Y4H-I routers. |
| Resilient hashing and flow auto-recovery |
Release 24.4.1 |
Introduced in this release on: Fixed Systems (8200 [ASIC: P100], 8700 [ASIC: P100, K100])(select variants only); Modular Systems (8800 [LC ASIC: P100])(select variants only*) You can ensure no packet loss and optimal load distribution across available paths by automatically rerouting data flows during link failures. This feature enhances network reliability by maintaining continuous service and dynamically adjusting to network topology changes without manual intervention. It seamlessly integrates with existing configurations, offering high availability and reducing downtime, thus keeping network operations uninterrupted and efficient. *This feature is supported on:
*Previously this feature was supported on Q200 and Q100. |
Impact of ECMP path failures on traffic flows
Resilient hashing and flow auto-recovery let you selectively override the default equal cost multipath (ECMP) behavior during an ECMP path failure. This feature redirects only the flows on failed links and prevents all existing flows from being rehashed to a new link. It also allows a recovered link or server to be reused for sessions when it becomes available.
Prior to the implementation of resilient hashing and flow auto-recovery feature, ECMP load balances traffic across all available paths to a destination. If one path fails, ECMP rehashes the traffic and selects new next hops for each flow.
For example, if you have three links—link 1, link 2, and link 3—a traffic flow that originally uses link 1 may switch to link 3 after a failure, even if only link 2 fails.
This redistribution of traffic flows does not cause issues in traditional core networks because end-to-end connectivity is maintained and users are not affected. However, in data center environments, load balancing caused by this redistribution can create problems.
In data centers where multiple servers connect through ECMP, rehashing may cause active flows to move, which can reset TCP sessions and disrupt applications.
Benefits of resilient hashing and flow auto-recovery
-
Maintains uninterrupted network operations and high availability
-
Minimizes traffic disruption and session resets during link failures or recoveries, and
-
Supports dynamic adjustment to topology changes without manual intervention.