bdacheckcluster/bdacheckhw on BDA 4.12 and Lower Raises "fault.io.intel.iio.pcie-fatal" WARNINGS on New X7-2 Servers
(Doc ID 2478259.1)
Last updated on JULY 20, 2024
Applies to:
Big Data Appliance Integrated Software - Version 4.5.0 and laterLinux x86-64
Symptoms
The bdacheckcluster utility on BDA V4.12 and lower raises "fault.io.intel.iio.pcie-fatal" WARNINGS on new X7-2 servers. The error is reported by bdacheckhw, but it can cause bdachecksw/bdacheckcluster to hang.
This may be seen after a BDA Mammoth cluster expansion when running bdacheckcluster manually or as part of the Mammoth cluster verification step. The symptom will be that bdacheckcluster raises "fault.io.intel.iio.pcie-fatal" WARNINGS on all new X7-2 servers.
After a BDA Mammoth expansion, if bdacheckcluster hangs when the "fault.io.intel.iio.pcie-fatal" faults are raised, make sure to review the first step in the "Post Expansion Steps" section of: Expanding an Oracle Big Data Appliance V4.5 and Higher Cluster (Doc ID 2478298.1). It is critical to verify that /opt/oracle/bda/install/state/config.json does not contain more that one occurrence of the same hostname or IP address in each of the file's "RACKS" sub-arrays. This can contribute to the faults being generated and bdacheckcluster hanging.
bdacheckcluster output looks like below:
See the next sections for a discussion on the relevant faults and how to clear them.
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Cause |
Solution |
Detailed Steps |
Appendix A |
References |