My Oracle Support Banner

UE MRC Warning WARN_VMSE_TRAINING_FAILED - VMSE Link JC failure (Doc ID 2824980.1)

Last updated on APRIL 24, 2024

Applies to:

Oracle Server X5-4 - Version All Versions to All Versions [Release All Releases]
Exalytics In-Memory Machine X5-4 - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Symptoms

-- P3/MR0 shows faulty:

-> show -d properties -level all -format nowrap /System

/System
Properties:
health = Service Required
health_details = P3/MR0 (CPU 3 Memory Riser 0) is faulty. Type 'show /System/Open_Problems' for details.
open_problems_count = 1
type = Rack Mount
model = EXALYTICS X5-4
system_fw_version = 3.2.6.20.a
primary_operating_system = Oracle Linux Server release 6.6
locator_indicator = Off
power_state = On
actual_power_consumption = 665 wat
action = (none)

--fma shows fault:
------------------- ------------------------------------ -------------- --------
Time UUID msgid Severity
------------------- ------------------------------------ -------------- --------
2021-08-14/06:00:26 a1b65a9f-1637-c744-aace-d9c5162c3838 SPX86A-8003-NK Critical

Problem Status : open
Diag Engine : fdd 1.0
System
Manufacturer : Oracle Corporation
Name : EXALYTICS X5-4
Part_Number : 7301336
----------------------------------------

Suspect 1 of 1
Fault class : fault.cpu.intel.mxb.training-failed
Certainty : 100%
Affects : /SYS/MB/P3/MR0
Status : faulted

FRU
Status : faulty
Location : /SYS/MB/P3/MR0
Manufacturer : MiTAC International Corporation
Name : ASSY,MEMORY RISER
Part_Number : 7090451
Revision : 02

Description : BIOS was unable to train the link to the memory expansion buffer.

Response : MR or CMOD LED is on.

Impact : All memory downstream of the memory buffer is unconfigured from the system.

Action : Please refer to the associated reference document at http://support.oracle.com/msg/SPX86A-8003-NK for the latest service procedures and policies regarding this diagnosis.

- Affected MR was replaced. It was verified with the MR serial number.
- P3/MR0 shows faulty even if known good MR was installed.
- New MR is installed in P3/MR1 and works OK.9 MiTAC International Corporation ASSY,MEMORY RISER 7090451 Rev 02

--ereport shows following errors:

2021-08-14/04:53:16 ereport.cpu.intel.mxb.training-failed@/SYS/MB/P3/MR0/MXB0
mrc_warn_major_errcode = 0x33
mrc_warn_minor_errcode = 0x1
dimm_undefined = 0x1

2021-08-14/05:23:29 ereport.cpu.intel.mxb.training-failed@/SYS/MB/P3/MR0/MXB0
mrc_warn_major_errcode = 0x33
mrc_warn_minor_errcode = 0x1
dimm_undefined = 0x1

2021-08-14/06:00:26 ereport.cpu.intel.mxb.training-failed@/SYS/MB/P3/MR0/MXB0
mrc_warn_major_errcode = 0x33
mrc_warn_minor_errcode = 0x1
dimm_undefined = 0x1

--Debug shows errors are same time:
Sat Aug 14 04:53:16 2021 ID 04a4 V UE MRC Warning WARN_VMSE_TRAINING_FAILED_173(0x33) Node 6 DIMM 3
Sat Aug 14 05:23:29 2021 ID 04c2 V UE MRC Warning WARN_VMSE_TRAINING_FAILED_173(0x33) Node 6 DIMM 3
Sat Aug 14 06:00:26 2021 ID 04e8 V UE MRC Warning WARN_VMSE_TRAINING_FAILED_173(0x33) Node 6 DIMM 3

--HW diag shows issue with "VMSE Link JC 0" for P3/MR0:

U 3 Memory Controller 0
VMSE Link JC 0 Full Width : PASSED
/SYS/MB/P3/MR0/D3 0 MT/s :  <<<<<<<<<<<<< All DIMMs under VMSE Link JC 0 shows "0 MT/s". 
/SYS/MB/P3/MR0/D4 0 MT/s :
/SYS/MB/P3/MR0/D0 0 MT/s :
/SYS/MB/P3/MR0/D1 0 MT/s :

VMSE Link JC 1 Full Width : PASSED

/SYS/MB/P3/MR0/D9 1600 MT/s : PASSED
/SYS/MB/P3/MR0/D10 1600 MT/s : PASSED
/SYS/MB/P3/MR0/D6 1600 MT/s : PASSED
/SYS/MB/P3/MR0/D7 1600 MT/s : PASSED

Changes

 Memory Riser -P3/MR0 was replaced previously due to other error.

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Changes
Cause
Solution


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.