My Oracle Support Banner

Oracle ZFS Storage Appliance: FMA 'pcie-fatal' event seen after upgrade of Infiniband CX-2 firmware on ZS3-ES (Doc ID 2082722.1)

Last updated on JULY 24, 2018

Applies to:

Oracle ZFS Storage ZS3-4 - Version All Versions and later
Oracle ZFS Storage ZS3-BA - Version All Versions and later
Oracle ZFS Storage ZS3-2 - Version All Versions and later
Oracle ZFS Storage ZS4-4 - Version All Versions and later
7000 Appliance OS (Fishworks)

Symptoms

Upgrading the IB CX-2 firmware from 2.7.8130 to 2.11.2010 resulted in PCIE fatal errors causing /SYS, /SYS/MB, /SYS/MB/RISER1/PCIE1 and /SYS/MB/P1 to be faulted on a Oracle ZFS Storage ZS3-ES.

However, if we clear the faults, they will stay cleared until we do another upgrade.

 

Downgrade from 2.11.2010 back to 2.7.8130 or reflash (same version) do not incur the problem.

 

FMA event:

------------------- ------------------------------------ -------------- --------
Time                UUID                                 msgid          Severity
------------------- ------------------------------------ -------------- --------
2015-09-24/01:01:27 94130e54-45cb-6a19-d61e-fa52e9fc25c4 SPX86-8003-RR  Critical

Problem Status    : open
Diag Engine       : fdd 1.0
System
  Manufacturer   : Oracle Corporation
  Name           : Exalogic X4-2
  Part_Number    : Exalogic X4-2
  Serial_Number  : AK00260761

System Component
  Manufacturer   : Oracle Corporation
  Name           : SUN FIRE X4170 M3
  Part_Number    : 7078183
  Serial_Number  : 1441NML0G2

----------------------------------------
Suspect 1 of 3
  Fault class  : fault.io.intel.iio.pcie-fatal
  Certainty    : 33%
  Affects      : /SYS/MB/RISER1/PCIE1
  Status       : faulted

  FRU
     Status            : faulty
     Location          : /SYS/MB/RISER1/PCIE1
     Chassis
        Manufacturer   : Oracle Corporation
        Name           : SUN FIRE X4170 M3
        Part_Number    : 7078183
        Serial_Number  : 1441NML0G2
----------------------------------------
Suspect 2 of 3
  Fault class  : fault.io.intel.iio.pcie-fatal
  Certainty    : 33%
  Affects      : /SYS/MB/P1
  Status       : faulted

  FRU
     Status            : faulty
     Location          : /SYS/MB/P1
     Name              : Intel(R) Xeon(R) CPU E5-2658 0 @ 2.10GHz
     Part_Number       : 060D
     Chassis
        Manufacturer   : Oracle Corporation
        Name           : SUN FIRE X4170 M3
        Part_Number    : 7078183
        Serial_Number  : 1441NML0G2
----------------------------------------
Suspect 3 of 3
  Fault class  : fault.io.intel.iio.pcie-fatal
  Certainty    : 33%
  Affects      : /SYS/MB
  Status       : faulted

  FRU
     Status            : faulty
     Location          : /SYS/MB
     Manufacturer      : MiTAC International Corporation
     Name              : MOTHER BOARD ASSEMBL
     Part_Number       : 7048712
     Revision          : 06
     Serial_Number     : 489089M+1434U92M3W
     Chassis
        Manufacturer   : Oracle Corporation
        Name           : SUN FIRE X4170 M3
        Part_Number    : 7078183
        Serial_Number  : 1441NML0G2

Description : An Integrated I/O (II0) fatal error in downstream PCIE device
             has occurred.

Response    : The service-required LED on the chassis will be illuminated.

 

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Cause
Solution
References

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.