My Oracle Support Banner

Oracle ZFS Storage Appliance A Thermtrip Signal Has Occurred On CPU SPX86-8003-K5 (Doc ID 2745736.1)

Last updated on MARCH 30, 2021

Applies to:

Oracle ZFS Storage Appliance Racked System ZS4-4 - Version All Versions and later
Oracle ZFS Storage ZS4-4 - Version All Versions and later
Oracle ZFS Storage ZS5-2 - Version All Versions and later
Oracle ZFS Storage Appliance Racked System ZS5-2 - Version All Versions and later
Oracle ZFS Storage ZS5-4 - Version All Versions and later
Information in this document applies to any platform.

Symptoms

 A thermtrip signal has occurred on a server component. CPU

The Complex Programmable Logic Device (CPLD) on motherboard can detect a
thermal shutdown from either the processor or Platform Controller Hub (PCH), aka Platsburg chip.

If a thermal shutdown signal came from a processor;
 
   The service-required LED for the affected processor will be illuminated.
   Service Processor will display status of  processor LED indicator as: "/SYS/MB/Px= On"

   The  temperature fault LED on the server will be illuminated.
   Service Processor will display status of  temperature LED indicator as:  "/SYS/TEMP_FAULT= On"

   The chassis-wide service required LED on the server will be illuminated.
   Service Processor will display status of chassis-wide LED indicator as: " /SYS/SERVICE = On"

If a thermal shutdown signal came from the Platsburg PCH chip;
 
   The  temperature fault LED on the server will be illuminated.
   Service Processor will display status of  temperature LED indicator as:  "/SYS/TEMP_FAULT= On"
 
   The chassis-wide service required LED on the server will be illuminated.
   Service Processor will display status of chassis-wide LED indicator as: " /SYS/SERVICE = On"

Automated Response

The service-required LED's for the affected component, TEMP_FAULT, and chassis will be illuminated.

Impact

The server will be powered down immediately.

Suggested Action for System Administrator

Refer to the following document for the latest procedures for displaying event content in preparation
for submitting a service request and applying any post-repair actions that may be required.

PSH Procedural Article for ILOM-Based Diagnosis (Doc ID 1155200.1)

Storage Appliance will also report

Hardware  CPU as faulted state

Maintenance problems faulty status.

Example Maintenance Problems Below

 

Problem Status            : open [proxied]
Diag Engine               : fdd / 1.0
System
    Manufacturer          : unknown
    Name                  : -
    Part_Number           : unknown
    Serial_Number         : unknown

System Component
    Manufacturer          : Oracle-Corporation
    Name                  : SUN FIRE X4170 M3
    Part_Number           : unknown
    Serial_Number         : unknown
    Firmware_Manufacturer : unknown
    Firmware_Version      : unknown
    Firmware_Release      : unknown
    Host_ID               : (null)
    Server_Name           : -
----------------------------------------
Suspect 1 of 1 :
   Problem class : "fault.cpu.intel.thermtrip"
   Certainty   : 100%
   Affects     : "hc://:product-id=SUN FIRE X4170 M3:product-sn=:chassis-id=:server-id=tseis01nasx01-ilom/chassis=0/motherboard=0/chip=0"
   Status      : "faulted but still in service"

   FRU
     Status           : "faulty"
     Location         : "/SYS/MB/P0"
     Manufacturer     : unknown
     Name             : unknown
     Part_Number      : 060D
     Revision         : unknown
     Serial_Number    : unknown
     Chassis
        Manufacturer  : unknown
        Name          : SUN FIRE X4170 M3
        Part_Number   : unknown
        Serial_Number :

Description : "A thermtrip signal has occurred on a server component."

Response    : The service-required LEDs for the affected component, TEMP_FAULT,
              and chassis will be illuminated.
Impact      : The server will be powered down immediately.

Action      : Please refer to the associated reference document at
              http://support.oracle.com/msg/SPX86-8003-K5 for the latest
              service procedures and policies regarding this diagnosis.

Changes

 "CPU failed."

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Changes
Cause
Solution
References


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.