Oracle ZFS Storage Appliance A Thermtrip Signal Has Occurred On CPU SPX86-8003-K5
(Doc ID 2745736.1)
Last updated on SEPTEMBER 30, 2022
Applies to:
Oracle ZFS Storage Appliance Racked System ZS4-4 - Version All Versions and laterOracle ZFS Storage ZS4-4 - Version All Versions and later
Oracle ZFS Storage ZS5-2 - Version All Versions and later
Oracle ZFS Storage Appliance Racked System ZS5-2 - Version All Versions and later
Oracle ZFS Storage ZS5-4 - Version All Versions and later
Information in this document applies to any platform.
Symptoms
A thermtrip signal has occurred on a server component. CPU
- The Complex Programmable Logic Device (CPLD) on motherboard can detect a
thermal shutdown from either the processor or Platform Controller Hub (PCH), aka Platsburg chip.
If a thermal shutdown signal came from a processor;
The service-required LED for the affected processor will be illuminated.
Service Processor will display status of processor LED indicator as: "/SYS/MB/Px= On"
The temperature fault LED on the server will be illuminated.
Service Processor will display status of temperature LED indicator as: "/SYS/TEMP_FAULT= On"
The chassis-wide service required LED on the server will be illuminated.
Service Processor will display status of chassis-wide LED indicator as: " /SYS/SERVICE = On"
If a thermal shutdown signal came from the Platsburg PCH chip;
The temperature fault LED on the server will be illuminated.
Service Processor will display status of temperature LED indicator as: "/SYS/TEMP_FAULT= On"
The chassis-wide service required LED on the server will be illuminated.
Service Processor will display status of chassis-wide LED indicator as: " /SYS/SERVICE = On"
Automated Response
- The service-required LED's for the affected component, TEMP_FAULT, and chassis will be illuminated.
Impact
- The server will be powered down immediately.
Suggested Action for System Administrator
Refer to the following document for the latest procedures for displaying event content in preparation
for submitting a service request and applying any post-repair actions that may be required.
PSH Procedural Article for ILOM-Based Diagnosis (Doc ID 1155200.1)
Storage Appliance will also report
Hardware CPU as faulted state
Maintenance problems faulty status.
Example Maintenance Problems Below
Problem Status : open [proxied]
Diag Engine : fdd / 1.0
System
Manufacturer : unknown
Name : -
Part_Number : unknown
Serial_Number : unknown
System Component
Manufacturer : Oracle-Corporation
Name : SUN FIRE X4170 M3
Part_Number : unknown
Serial_Number : unknown
Firmware_Manufacturer : unknown
Firmware_Version : unknown
Firmware_Release : unknown
Host_ID : (null)
Server_Name : -
----------------------------------------
Suspect 1 of 1 :
Problem class : "fault.cpu.intel.thermtrip"
Certainty : 100%
Affects : "hc://:product-id=SUN FIRE X4170 M3:product-sn=:chassis-id=:server-id=tseis01nasx01-ilom/chassis=0/motherboard=0/chip=0"
Status : "faulted but still in service"
FRU
Status : "faulty"
Location : "/SYS/MB/P0"
Manufacturer : unknown
Name : unknown
Part_Number : 060D
Revision : unknown
Serial_Number : unknown
Chassis
Manufacturer : unknown
Name : SUN FIRE X4170 M3
Part_Number : unknown
Serial_Number :
Description : "A thermtrip signal has occurred on a server component."
Response : The service-required LEDs for the affected component, TEMP_FAULT,
and chassis will be illuminated.
Impact : The server will be powered down immediately.
Action : Please refer to the associated reference document at
http://support.oracle.com/msg/SPX86-8003-K5 for the latest
service procedures and policies regarding this diagnosis.
Changes
"CPU failed."
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Changes |
Cause |
Solution |
References |