Solaris Cluster IPMP Group Failure Impact when in.mpathd log "state transition from OK to DOWN" (Doc ID 1006916.1)

Last updated on NOVEMBER 02, 2016

Applies to:

Solaris Cluster - Version 3.1 to OSC 4.3 [Release 3.1 to 4.3]
Oracle Solaris on SPARC (32-bit)
Oracle Solaris on x86-64 (64-bit)
Oracle Solaris on x86 (32-bit)
Oracle Solaris on SPARC (64-bit)

Symptoms

Short network pause causes IPMP group to fail which triggers resource group failover in Solaris Cluster.

IPMP failure can be observed in /var/adm/messages as follows:

Aug 22 12:59:22 sun2 in.mpathd[1966]: [ID 594170 daemon.error] NIC failure detected on ce4 of group sun2-ipmp1
Aug 22 12:59:22 sun2 in.mpathd[1966]: [ID 832587 daemon.error] Successfully failed over from NIC ce4 to NIC ce0
Aug 22 13:00:29 sun2 in.mpathd[1966]: [ID 168056 daemon.error] All Interfaces in group sun2-ipmp1 have failed
Aug 22 13:00:29 sun2 Cluster.PNM: [ID 890413 daemon.notice] sun2-ipmp1: state transition from OK to DOWN.



But then it came back 38 seconds later:

Aug 22 13:01:00 sun2 Cluster.PNM: [ID 890413 daemon.notice] sun2-ipmp1: state transition from DOWN to OK.



But at the same time the resource group failover already triggered:
(Notice that it happened 9 seconds before IPMP group came back OK)

Aug 22 13:00:51 sun1 Cluster.RGM.rgmd: [ID 529407 daemon.notice] resource group sun-rg state on node sun2 change to RG_PENDING_OFFLINE

 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms