SPARC M8 and M7 server PCIE faults may identify an incorrect FRU

(Doc ID 2391817.1)

Last updated on APRIL 26, 2018

Applies to:

SPARC M7-8 - Version All Versions to All Versions [Release All Releases]
SPARC M7-16 - Version All Versions to All Versions [Release All Releases]
Oracle SuperCluster M7 Hardware - Version All Versions to All Versions [Release All Releases]
SPARC M8-8 - Version All Versions to All Versions [Release All Releases]
Oracle SuperCluster M8 Hardware - Version All Versions to All Versions [Release All Releases]
Oracle Solaris on SPARC (64-bit)

Symptoms

PCIE fault reports on SPARC M8 and M7 servers with Solaris 11.3 SRU24 and higher may indicate an incorrect FRU in 'fmadm faulty' output.  Solaris FMA will list the IOS root port number rather than identifying the PCIE card slot. This may occur for all PCIE-8000- diagnosis codes.

Example,

# fmadm faulty

--------------- ------------------------------------  -------------- ---------
TIME            EVENT-ID                              MSG-ID         SEVERITY
--------------- ------------------------------------  -------------- ---------
Feb 10 12:27:28 f8426610-1571-4fa9-9785-990644296c2c  PCIEX-8000-0A  Critical

Problem Status    : open
Diag Engine       : eft / 1.16
System
    Manufacturer  : unknown
    Name          : unknown
    Part_Number   : unknown
    Serial_Number : unknown
    Host_ID       : 84fa7000

----------------------------------------
Suspect 1 of 1 :
   Problem class : fault.io.pciex.device-interr
   Certainty   : 100%
   Affects     : dev:////pci@303/pci@1/SUNW,emlxs@0,f
   Status      : faulted but still in service

   FRU
     Status           : faulty
     Location         : "/SYS/CMIOU0/IOH/IOS3/RP0"             <=======root port improperly indicated
     Manufacturer     : unknown
     Name             : unknown
     Part_Number      : unknown
     Revision         : unknown
     Serial_Number    : unknown
     Chassis
        Manufacturer  : Oracle Corporation
        Name          : SPARC M7-8
        Part_Number   : 33972225+1+1
        Serial_Number : AK00361296

Description : A problem was detected for a PCIEX device.

Response    : One or more device instances may be disabled

Impact      : Loss of services provided by the device instances associated with
              this fault

Action      : Use 'fmadm faulty' to provide a more detailed view of this event.
              Please refer to the associated reference document at
              http://support.oracle.com/msg/PCIEX-8000-0A for the latest
              service procedures and policies regarding this diagnosis.

 

 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms