Node With Multiple VMs Hangs Due To CPU Stall
(Doc ID 2286384.1)
Last updated on NOVEMBER 08, 2018
Applies to:Oracle VM - Version 3.2.1 and later
Oracle Cloud Infrastructure - Version N/A and later
Information in this document applies to any platform.
A node and all running VMs suddenly become unresponsive forcing a power cycle to clear the problem. After booting, the system operates normally with no further issues. Investigation into the issue reveals the following:
- The the ILOM hostconsole.log file for the node shows the following mlx4_core errors followed by CPU stalls with a stack trace. These may repeat several times.
To view full details, sign in with your My Oracle Support account.
Don't have a My Oracle Support account? Click to get started!