Oracle Linux: Set an Appropriate Value of OCFS2 Disk Heartbeat to Prevent Nodes Fencing During HA Storage System Controllers Failover (Doc ID 1966208.1)

Last updated on SEPTEMBER 15, 2016

Applies to:

Linux OS - Version Oracle Linux 5.5 and later
Oracle Exalytics Software - Version 1.0.0.5.0 to 1.0.0.5.0 [Release 1.0]
Linux x86
Linux x86-64

Goal

With Storage System Controllers in HA (High Availability) pair, connection to LUNs might be temporarily lost during a Controller failover event, which can take up to five minutes till fully recovery.

Note: The terminology "Storage System Controllers" might vary to different Storage System vendors in the same definition, e.g. heads, CPU modules.


An unforeseen value of OCFS2 Disk Heartbeat might incidentally cause nodes fencing.


To avoid o2hb fencing during a Controller failover event, it is recommended to enlarge the value of O2CB_HEARTBEAT_THRESHOLD in accordance to Storage System vendors (EMC, NetAPP, HP, ZFSSA, etc.) recommendation, though in our experiences five minutes has handled most of these events.

Solution

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms