Reading this documentation, it’s not clear to me what is the difference between akka.remote.transport-failure-detector.heartbeat-interval and akka.cluster.failure-detector.heartbeat-interval.
I am trying to find the proper settings for a big (>500) akka cluster.
transport-failure-detector is used for finding stale connections and restart the. That should rarely be touched.
cluster.failure-detector is what you can adjust. Don’t increase the heartbeat interval (that shouldn’t generate much messages). Increasing acceptable-heartbeat-pause might be needed.