Sun SPARC[TM] Enterprise M4000/M5000 - MBU_B and MEMB Being Faulted With SCF-8004-8X, SCF-8000-1D, and SCF-8005-MJ Errors.
(Doc ID 1296435.1)
Last updated on SEPTEMBER 04, 2023
Applies to:
Sun SPARC Enterprise M4000 Server - Version All Versions to All Versions [Release All Releases]Sun SPARC Enterprise M5000 Server - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.
Symptoms
Showlogs monitor have the following errors:
Jan 20 14:59:35 <HOSTNAME> Warning: /UNSPECIFIED:SCF:spurious unit interrupt
Jan 20 15:00:00 <HOSTNAME> last message repeated 11 times
Jan 20 15:01:53 <HOSTNAME> Alarm: /MBU_B/MEMB#5:ANALYZE:MAC detected clock fatal failure
Jan 20 15:02:41 <HOSTNAME> Warning: /MBU_B:SCF:SC test error
Jan 20 15:02:57 <HOSTNAME> Warning: /MBU_B:SCF:SC test error
Jan 20 15:03:16 <HOSTNAME> Warning: /MBU_B:SCF:SC test error
Jan 20 15:03:17 <HOSTNAME> Warning: /MBU_B:SCF:SC test error
FMA will record these same events as:
Jan 20 20:59:48.7163 <UUID> SCF-8004-8X
Jan 20 21:01:46.3456 <UUID> SCF-8000-1D
Jan 20 21:02:26.2880 <UUID> SCF-8005-MJ
or
Jan 20 20:59:48.7163 ereport.chassis.SPARC-Enterprise.asic.cpu.power.intr-fail
Jan 20 21:01:46.3456 ereport.chassis.SPARC-Enterprise.if.fe-asic-clk
Jan 20 21:02:26.2880 ereport.chassis.SPARC-Enterprise.asic.sc.test
The error to note is the SCF-8000-1D which is a clock distribution error.
Additional failure scenario:
In certain rare occasion, the 'clock fatal failure' is not seen and the following failure signature is seen. The following solution still applies.
Showlogs monitor:
Nov 5 20:56:01 <HOSTNAME> Alarm: /MBU_B/MEMB#0,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov 5 20:56:11 <HOSTNAME> Alarm: /MBU_B/MEMB#1,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov 5 20:56:32 <HOSTNAME> Alarm: /MBU_B/MEMB#2,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov 5 20:56:45 <HOSTNAME> Alarm: /MBU_B/MEMB#3,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov 5 20:56:50 <HOSTNAME> Warning: /MBU_B:SCF:Abnormal reaction of LSI (compare)
Nov 5 20:57:08 <HOSTNAME> Alarm: /MBU_B/MEMB#4,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov 5 20:57:13 <HOSTNAME> Warning: /MBU_B:SCF:Abnormal reaction of LSI (compare)
Nov 5 20:57:16 <HOSTNAME> Warning: /MBU_B:SCF:Abnormal reaction of LSI (compare)
Following FMA MSG-IDs are seen:
UUID: <UUID> MSG-ID: SCF-8004-3Y
UUID: <UUID> MSG-ID: SCF-8002-K2
showstatus:
* MBU_B Status:Faulted;
* MEMB#0 Status:Faulted;
* MEMB#1 Status:Faulted;
* MEMB#2 Status:Faulted;
* MEMB#3 Status:Faulted;
* MEMB#4 Status:Faulted;
Changes
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Changes |
Cause |
Solution |
References |