Akka coordinated shutdown is timing out


I am trying to investigate why my akka members are failing to exit the cluster properly.
If I look at their logs, I can see the following:

Coordinated shutdown phase [cluster-exiting] timed out after 15000 milliseconds

Is there a way to investigate why could possibly go wrong at that stage?


That stage is waiting for the node to Exit the cluster. Do you have some DEBUG or event INFO logs of the node that timed out? It could be that there are unreachable nodes in the cluster so that the leader can’t move the node from Leaving to Exiting.

Yes that is definitely the case. It’s hard to know what is the cause of consequence though: we start to have a lot of unreachable nodes, but I had some hope it was because we were not exiting properly.