Whole RHCS Cluster went Down After Network Switch Failure

(Doc ID 2087542.1)

Last updated on DECEMBER 17, 2015

Applies to:

Linux OS - Version Oracle Linux 5.1 to Oracle Linux 5.10 [Release OL5U1 to OL5U10]
Linux x86
Linux x86-64

Symptoms

9 node RHCS Cluster went down after Network Switch failure:

Each node has same setup for bond0

During the failure each node was having issue with eth0 in bond0, whenever eth0 port was visible with port status down  bond0 did failover to backup slave eth3, whenever link was restored bond0 again activated eth0 as a primary slave.

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms