Sun Fire[TM] 12K/15K/E20K/E25K : Voltage Error on CPU Leads to Blacklisting the PROCPAIR (Doc ID 1010757.1)

Last updated on OCTOBER 04, 2016

Applies to:

Sun Fire 15K Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Fire E20K Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Fire 12K Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Fire E25K Server - Version Not Applicable to Not Applicable [Release N/A]
All Platforms

Goal

When a voltage problem is detected on a single CPU on Sun Fire[TM] 12K/15K/E20K/E25K platforms, ASR (Automatic System Recovery) blacklists it's PROCPAIR, and the domain is reset.
Blacklisting the PROCPAIR means that two CPUs and their memory are disabled and removed from the domain configuration.

This document explains why the esmd (Event Status Monitoring Daemon) disables and removes two CPUs as the result of a voltage problem on a single CPU.
This behavior might seem incorrect, but in fact the recovery action is exactly as it was designed to be.

Solution

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms