RoleLeaderChanged appoints 2 leaders during start-up of a node

pvoss · September 23, 2020, 2:22pm

Akka 2.6.8
Java 8

I have found another source of unstable behavior in our production cluster.

Following situation:

I have 2 running nodes A and B. Node A has been appointed as the leader by evaluating the RoleLeaderChanged event.
Node B (the one that is not the leader) gets restarted.
During start-up node B gets appointed as a leader via RoleLeaderChanged event. Node A remains leader during this time and does not get any notifications. Some actors now cause damage, because they are running on 2 nodes now.
After a short period of time, node B gets another RoleLeaderChanged event and recognizes node A as the leader now. Now everything is fine, but the leader on node A cannot recover the damage that node B has created, because it does not even get to know that there was a second leader for some time.

Here are the relevant log lines. The node is leader for 5 seconds until it gets the Up status.

2020-09-22T22:08:34.077Z INFO  myown - Handle RoleLeaderChanged, selfAddress=akka://ClusterSystem@100.64.4.57:2551, leaderAddress=akka://ClusterSystem@100.64.4.57:2551, isLeader=true
2020-09-22T22:08:39.477Z INFO  akka.cluster.Cluster - Cluster Node [akka://ClusterSystem@100.64.4.57:2551] - Marking node as REACHABLE [Member(address = akka://ClusterSystem@100.64.0.45:2551, status = Up)].
2020-09-22T22:08:39.478Z INFO  akka.cluster.Cluster - Cluster Node [akka://ClusterSystem@100.64.4.57:2551] - is no longer leader
2020-09-22T22:08:39.479Z INFO  myown - Handle RoleLeaderChanged, selfAddress=akka://ClusterSystem@100.64.4.57:2551, leaderAddress=akka://ClusterSystem@100.64.0.45:2551, isLeader=false

Is this expected behavior? I can certainly evaluate Member.Up events as well, but this makes it much harder to rely on RoleLeaderChanged events. I would have expected that AKKA would not send any RoleLeaderChanged events until a decision can be made. Or if it does then without a leader being set.

patriknw · September 23, 2020, 7:15pm

A common misconception is that there is some kind of leader election that guarantees that there is only one leader at a time. That is not what Akka’s leader is about. See docs for more details.

Leader events are rare to be used by applications. ClusterSingleton is often what should be used instead.

pvoss · September 24, 2020, 3:11pm

Thank you, @patriknw for clarifying. This wasn’t as obvious to me from the documentation, but I have got it now.

Topic		Replies	Views
Sick Cluster - WeaklyUp Leader Akka Cluster	5	750	May 18, 2020
Hook into cluster leader changes Akka Cluster	6	1271	March 25, 2019
New incarnation of existing member is trying to join. Existing will be removed from the cluster and then new member will be allowed to join Akka	7	1277	April 13, 2018
Dynamicaly change roles of a node Akka Cluster	2	577	January 3, 2020
Strange Behavior with Cluster Aware Group Akka Cluster	1	753	August 27, 2018

RoleLeaderChanged appoints 2 leaders during start-up of a node

Related Topics