My Oracle Support Banner

Sun SPARC[TM] Enterprise M4000/M5000 - MBU_B and MEMB Being Faulted With SCF-8004-8X, SCF-8000-1D, and SCF-8005-MJ Errors. (Doc ID 1296435.1)

Last updated on SEPTEMBER 04, 2023

Applies to:

Sun SPARC Enterprise M4000 Server - Version All Versions to All Versions [Release All Releases]
Sun SPARC Enterprise M5000 Server - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Symptoms

Showlogs monitor have the following errors:

Jan 20 14:59:35 <HOSTNAME> Warning: /UNSPECIFIED:SCF:spurious unit interrupt
Jan 20 15:00:00 <HOSTNAME> last message repeated 11 times
Jan 20 15:01:53 <HOSTNAME> Alarm: /MBU_B/MEMB#5:ANALYZE:MAC detected clock fatal failure
Jan 20 15:02:41 <HOSTNAME> Warning: /MBU_B:SCF:SC test error
Jan 20 15:02:57 <HOSTNAME> Warning: /MBU_B:SCF:SC test error
Jan 20 15:03:16 <HOSTNAME> Warning: /MBU_B:SCF:SC test error
Jan 20 15:03:17 <HOSTNAME> Warning: /MBU_B:SCF:SC test error

FMA will record these same events as:

Jan 20 20:59:48.7163 <UUID> SCF-8004-8X
Jan 20 21:01:46.3456 <UUID> SCF-8000-1D
Jan 20 21:02:26.2880 <UUID> SCF-8005-MJ

or
Jan 20 20:59:48.7163 ereport.chassis.SPARC-Enterprise.asic.cpu.power.intr-fail
Jan 20 21:01:46.3456 ereport.chassis.SPARC-Enterprise.if.fe-asic-clk
Jan 20 21:02:26.2880 ereport.chassis.SPARC-Enterprise.asic.sc.test

The error to note is the SCF-8000-1D which is a clock distribution error.

Additional failure scenario:

In certain rare occasion, the 'clock fatal failure' is not seen and the following failure signature is seen. The following solution still applies.

Showlogs monitor:

Nov  5 20:56:01 <HOSTNAME> Alarm: /MBU_B/MEMB#0,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov  5 20:56:11 <HOSTNAME> Alarm: /MBU_B/MEMB#1,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov  5 20:56:32 <HOSTNAME> Alarm: /MBU_B/MEMB#2,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov  5 20:56:45 <HOSTNAME> Alarm: /MBU_B/MEMB#3,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov  5 20:56:50 <HOSTNAME> Warning: /MBU_B:SCF:Abnormal reaction of LSI (compare)
Nov  5 20:57:08 <HOSTNAME> Alarm: /MBU_B/MEMB#4,/MBU_B:SCF:Critical low voltage error(detector=187)
Nov  5 20:57:13 <HOSTNAME> Warning: /MBU_B:SCF:Abnormal reaction of LSI (compare)
Nov  5 20:57:16 <HOSTNAME> Warning: /MBU_B:SCF:Abnormal reaction of LSI (compare)

Following FMA MSG-IDs are seen:
   UUID: <UUID> MSG-ID: SCF-8004-3Y
   UUID: <UUID> MSG-ID: SCF-8002-K2


showstatus:
*   MBU_B Status:Faulted;
*       MEMB#0 Status:Faulted;
*       MEMB#1 Status:Faulted;
*       MEMB#2 Status:Faulted;
*       MEMB#3 Status:Faulted;
*       MEMB#4 Status:Faulted;

Loosing power from IOU#0 may cause other errors that are not listed here but should be considered as victims of the power loss.

Changes

 

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Changes
Cause
Solution
References

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.