My Oracle Support Banner

Cluster Unavailable For a Long Period After a Rolling Restart Test (Doc ID 1670963.1)

Last updated on MAY 19, 2023

Applies to:

Oracle Coherence - Version 3.7.1 to 3.7.1 [Release AS10g]
Information in this document applies to any platform.

Goal

During a rolling restart test, when the senior node for the DistributedCache service was restarted, the cluster become unresponsive and a large number of UDP messages were observed in tcp dumps.  Both the TCP Ring and IpMonitor death detection mechanisms where enabled with their default times.  The cluster was available again after 5 minutes, which matches the default timeout for UDP packet delivery to timeout.

What is the root cause and how can we avoid this situation?
 

Solution

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Goal
Solution
References


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.