A network resource becomes OFFLINE on Fujitsu GLS environment (Doc ID 2013699.1)

Last updated on JUNE 17, 2016

Applies to:

Oracle Database - Enterprise Edition - Version 11.2.0.4 and later
Information in this document applies to any platform.

Symptoms

A network resource unexpectedly becomes offline during GLS NIC failover.

When this issue occurs, following log output is observed from the orarootagent.log.

 

[crsd orarootagent_root.log]

19:17:09.795: [ora.net1.network][check] getActiveInterface enter loop timeout = 30{ <--(***) enter timeout count
19:17:10.796: [ora.net1.network][check] Checking if net0 Interface is fine
19:17:10.796: [ora.net1.network][check] Interface Name = net0
19:17:10.796: [ora.net1.network][check] Primary IP = XXX.XXX.XXX.XXX
19:17:10.817: [ default][155]ICMP Ping from XXX.XXX.XXX.XXX to UNKNOWN
:
19:17:10.823: [ora.net1.network][check] ICMP Ping Exception in sCheckLink
19:17:10.823: [ora.net1.network][check] Checking if net4 Interface is fine
19:17:10.823: [ora.net1.network][check] Exception in checkNetInterface
ignoring
19:17:10.823: [ora.net1.network][check] Interface Name = net0
19:17:10.823: [ora.net1.network][check] Primary IP = XXX.XXX.XXX.XXX
19:17:10.845: [ default][155]ICMP Ping from XXX.XXX.XXX.XXX to UNKNOWN

:
:
:
19:17:20.229: [ora.net1.network][check] ICMP Ping Exception in sCheckLink
19:17:20.229: [ora.net1.network][check] getActiveInterface::checkLink = 0
19:17:20.229: [ora.net1.network][check] getActiveInterface exit } return 0 <--(***) end the count after about 10 seconds

19:17:20.230: [ AGFW][9] ora.net1.network XXXXX 1 state changed
from: ONLINE to: OFFLINE

 

Changes

This issue occurs if all following 4 conditions are applicable.

1) On an environment using Fujitsu PRIMECLUSTER GLS
2) Patch:18143836 is applied
3) NIC managed by GLS is down and it has not reached to GLS failover timeout yet
4) NIC managed by GLS is down over 10 seconds

* messages output example

19:17:09 XXX mac: [ID 486395 kern.info] NOTICE: igb0 link down <-- (***) NIC link down

:

19:17:31 XXX hanet: [ID 532911 user.error] ERROR: 87000: polling status changed: Primary polling failed. (net0,target=XXX.XXX.XXX.XXX)
19:17:31 XXX hanet: [ID 910662 user.info] INFO: 89100: logical IP address is inactivated. (XXX.XXX.XXX.XXX)
19:17:31 XXX hanet: [ID 590586 user.info] INFO: 88900: interface is inactivated. (net0)
19:17:32 XXX hanet: [ID 790143 user.info] INFO: 88800: interface is activated. (net4) <-- (***) takes more than 10 secs from NIC link down for failover 

(*) Patch:18143836 is included in following patch sets
  - PSU 11.2.0.4.6 and later
  - PSR 12.1.0.2

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms