My Oracle Support Banner

Troubleshooting Sun Fire[TM] Uncorrectable CPU and Memory Error(s) on Solaris[TM] 8 and 9 (Doc ID 1006517.1)

Last updated on JULY 30, 2020

Applies to:

Sun Fire E4900 Server
Sun Fire E6900 Server
Sun Fire E25K Server
Sun Fire 12K Server
Sun Fire E20K Server
All Platforms

Purpose

This document addresses uncorrectable CPU/Memory errors reported on systems running Solaris[TM] 8 and Solaris[TM] 9.

 

Your system may have one or more of the following symptoms:

 

A. Uncorrectable ECC error on from a read from system memory

 

Main memory uncorrectable ECC error detected by CPU3 from the bank of DIMMs in Slot A: J8100 J8101 J8201 J8200

Error Messaging Notes
  • When browsing messages files and observing console output note that [AFT1] is included in these messages, a 1 represents the "Asynchronous Fault Trap" for uncorrectable and unrecoverable errors. AFT0 is used for correctable errors, AFT2 and AFT3 can be ignored in almost all cases.
  • The above error messaging may change slightly depending on your kernel update patch version. 
  • It is important to understand that uncorrectable ECC errors can be reported by multiple components.  At no point will the corrupted data actually be used.

This document does not apply to Solaris[TM] 10 as FMA automates the diagnosis of these type of faults.  See <Document:1018939.1> Solaris[TM] 10 Operating System: Displaying the list of Fault Management Architecture (FMA) resources currently believed to be faulted If Solaris has not paniced, crashed, or rebooted and you are just seeing correctable errors please see <Document:1006513.1> Troubleshooting Sun Fire[TM] Correctable CPU and Memory Error(s) on Solaris[TM] 8 and 9

Troubleshooting Steps

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.