UE MRC Warning WARN_VMSE_TRAINING_FAILED - VMSE Link JC failure
(Doc ID 2824980.1)
Last updated on DECEMBER 01, 2021
Applies to:
Oracle Server X5-4 - Version All Versions to All Versions [Release All Releases]Exalytics In-Memory Machine X5-4 - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.
Symptoms
-- P3/MR0 shows faulty:
-> show -d properties -level all -format nowrap /System
/System
Properties:
health = Service Required
health_details = P3/MR0 (CPU 3 Memory Riser 0) is faulty. Type 'show /System/Open_Problems' for details.
open_problems_count = 1
type = Rack Mount
model = EXALYTICS X5-4
system_fw_version = 3.2.6.20.a
primary_operating_system = Oracle Linux Server release 6.6
locator_indicator = Off
power_state = On
actual_power_consumption = 665 wat
action = (none)
--fma shows fault:
------------------- ------------------------------------ -------------- --------
Time UUID msgid Severity
------------------- ------------------------------------ -------------- --------
2021-08-14/06:00:26 a1b65a9f-1637-c744-aace-d9c5162c3838 SPX86A-8003-NK Critical
Problem Status : open
Diag Engine : fdd 1.0
System
Manufacturer : Oracle Corporation
Name : EXALYTICS X5-4
Part_Number : 7301336
----------------------------------------
Suspect 1 of 1
Fault class : fault.cpu.intel.mxb.training-failed
Certainty : 100%
Affects : /SYS/MB/P3/MR0
Status : faulted
FRU
Status : faulty
Location : /SYS/MB/P3/MR0
Manufacturer : MiTAC International Corporation
Name : ASSY,MEMORY RISER
Part_Number : 7090451
Revision : 02
Description : BIOS was unable to train the link to the memory expansion buffer.
Response : MR or CMOD LED is on.
Impact : All memory downstream of the memory buffer is unconfigured from the system.
Action : Please refer to the associated reference document at http://support.oracle.com/msg/SPX86A-8003-NK for the latest service procedures and policies regarding this diagnosis.
- Affected MR was replaced. It was verified with the MR serial number.
- P3/MR0 shows faulty even if known good MR was installed.
- New MR is installed in P3/MR1 and works OK.9 MiTAC International Corporation ASSY,MEMORY RISER 7090451 Rev 02
--ereport shows following errors:
2021-08-14/04:53:16 ereport.cpu.intel.mxb.training-failed@/SYS/MB/P3/MR0/MXB0
mrc_warn_major_errcode = 0x33
mrc_warn_minor_errcode = 0x1
dimm_undefined = 0x1
2021-08-14/05:23:29 ereport.cpu.intel.mxb.training-failed@/SYS/MB/P3/MR0/MXB0
mrc_warn_major_errcode = 0x33
mrc_warn_minor_errcode = 0x1
dimm_undefined = 0x1
2021-08-14/06:00:26 ereport.cpu.intel.mxb.training-failed@/SYS/MB/P3/MR0/MXB0
mrc_warn_major_errcode = 0x33
mrc_warn_minor_errcode = 0x1
dimm_undefined = 0x1
--Debug shows errors are same time:
Sat Aug 14 04:53:16 2021 ID 04a4 V UE MRC Warning WARN_VMSE_TRAINING_FAILED_173(0x33) Node 6 DIMM 3
Sat Aug 14 05:23:29 2021 ID 04c2 V UE MRC Warning WARN_VMSE_TRAINING_FAILED_173(0x33) Node 6 DIMM 3
Sat Aug 14 06:00:26 2021 ID 04e8 V UE MRC Warning WARN_VMSE_TRAINING_FAILED_173(0x33) Node 6 DIMM 3
--HW diag shows issue with "VMSE Link JC 0" for P3/MR0:
U 3 Memory Controller 0
VMSE Link JC 0 Full Width : PASSED
/SYS/MB/P3/MR0/D3 0 MT/s : <<<<<<<<<<<<< All DIMMs under VMSE Link JC 0 shows "0 MT/s".
/SYS/MB/P3/MR0/D4 0 MT/s :
/SYS/MB/P3/MR0/D0 0 MT/s :
/SYS/MB/P3/MR0/D1 0 MT/s :
VMSE Link JC 1 Full Width : PASSED
/SYS/MB/P3/MR0/D9 1600 MT/s : PASSED
/SYS/MB/P3/MR0/D10 1600 MT/s : PASSED
/SYS/MB/P3/MR0/D6 1600 MT/s : PASSED
/SYS/MB/P3/MR0/D7 1600 MT/s : PASSED
Changes
Memory Riser -P3/MR0 was replaced previously due to other error.
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Changes |
Cause |
Solution |