[PCA/OVM] Compute Nodes / OVM Servers Self-fence (reboot via kernel panic) After Extended Failure of a Clustered Storage Device
(Doc ID 2423328.1)
Last updated on JULY 14, 2020
Applies to:
Private Cloud Appliance - Version 2.0.5 and laterOracle VM - Version 3.2.1 and later
Oracle Cloud Infrastructure - Version N/A and later
Linux x86-64
Symptoms
Compute nodes / OVM Servers suffer a kernel panic upon extended loss of access to non-heartbeat clustered storage device.
If all paths to the storage are lost, and multipath cannot access any of them, OCFS2 invokes a kernel panic:
The dm-24 device above where the error has been raised is a multipath device of a repository, not the global heartbeat device (OVM "pool filesystem").
Changes
In the example reported, testing was being performed to establish consequences of complete failure of a storage device; previously the issue had occurred in a production PCA system when an external storage device completely failed, not even able to provide fail-over to another path.
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Changes |
Cause |
Solution |
References |