Cluster Sharding, slow handover during rolling restart

milanvdm · October 15, 2019, 2:29pm

We have the following situation:

We have a cluster of 3 nodes
The cluster is receiving numerous GET requests to get the information of an entity
A rolling restart starts
The 2 new nodes are spawn (5 nodes in the cluster)
When those 2 new nodes are healthy (based on AkkaManagementHttp), 2 oldest nodes are downed with coordinated-shutdown
1 new node is spawned (4 nodes in the cluster)
Latest old node is shut down.

What we see is that during this rolling restart, we see a bunch of GET requests having 4+ seconds latency (compared to the usual 30ms).

We know that shutting down the oldest node is not ideal due to the Singleton handovers, but this cannot be configured atm.
We are using akka 2.5.19

Is this delay caused by the ShardCoordinator handover? How can this latency increase be prevented?

shafqatevo · April 5, 2021, 6:22am

Hi @milanvdm was your problem solved later on? If yes, how?

Topic		Replies	Views
Slowdown shard handoff for akka cluster rolling update Akka Cluster akka-cluster	3	249	December 6, 2023
Rolling restart slowness with Kubernetes and Akka sharded cluster Akka Cluster akka-typed , scala , kubernetes	0	499	November 4, 2021
Akka-cluster-sharding HandOffStopper issue Akka Cluster	1	826	September 10, 2019
Akka - Actor Cluster Inconsistent Shard rebalance while Rolling restart of nodes Akka akka , akka-cluster	4	587	May 2, 2023
Akka-Cluster: Decreasing system performance having many active actors Akka akka-cluster	8	1243	October 28, 2020