How to restart all compute nodes in a Exalogic Virtual Pool to recover operation of compute nodes where "D" state processes are running
(Doc ID 1950446.1)
Last updated on MARCH 12, 2021
Applies to:Oracle Exalogic Elastic Cloud Software - Version 126.96.36.199.0 and later
Exalogic Elastic Cloud X4-2 Hardware - Version X4 to X4 [Release X4]
Oracle Virtual Server (x86-64)
While running an Exalogic Virtual system you may, in exceptional circumstances, encounter a situation where the majority of the compute nodes in a server pool have become unhealthy due to processes from key services such as "o2cb", "ocsf2" or "ovs-agent" reaching a deep "Disk Sleep" state (processes list with a status of "D" in "ps -ef" output). When this occurs it can become necessary to shutdown and reboot all nodes in the affected pool to restore the system to full operation. This note illustrates the process that should be followed to shutdown and restart all nodes in a pool without exposing additional "HA" related complications that can impact the subsequent restart of vServers after the nodes have been rebooted.
To view full details, sign in with your My Oracle Support account.
Don't have a My Oracle Support account? Click to get started!
In this Document