Common Problems Reported by Platinum Monitoring and Recommended Actions for Exalogic Systems
(Doc ID 1985576.1)
Last updated on APRIL 09, 2024
Applies to:
Exalogic Elastic Cloud X6-2 Hardware - Version X6 to X6 [Release X6]Exalogic Elastic Cloud X3-2 Eighth Rack - Version X3 to X3 [Release X3]
Exalogic Elastic Cloud X4-2 Quarter Rack - Version X4 to X4 [Release X4]
Exalogic Elastic Cloud X3-2 Hardware - Version X6 to X6 [Release X6]
Exalogic Elastic Cloud X4-2 Half Rack - Version X6 to X6 [Release X6]
Linux x86-64
Oracle Solaris on x86-64 (64-bit)
Oracle Virtual Server x86-64
Purpose
This Note provides list of commonly reported Platinum Fault and Alert messages by Platinum monitoring setup and provides recommended actions for addressing those Platinum Faults.
Scope
This note focuses on common Platinum alerts and solutions. For more information on Oracle Platinum Services including a full list of Fault Monitoring that is done, visit the following note:
Oracle Platinum Services – Quick Reference Guide (Doc ID 1993848.1)
Details
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Purpose |
Scope |
Details |
1. kernel: Error: Driver 'pcspkr' is already registered, aborting... |
2. kernel: sdp_process_tx_wc:261 sdp_sock( 4551:14 58027:10280): Send completion with error. wr_id 0x400000002 Status 12 |
3. kernel: uce_agent.bin[23006]: segfault at f6eac05c ip 00000000082639a1 sp 00000000f6eac060 error 6 in uce_agent.bin[8048000+6e8000] |
4. kernel: xs_tcp_setup_socket: connect returned unhandled error -107 |
5. kernel: bonding: bond1: Error: Unable to enslave eth326_2 because it is already up |
6. yum-updatesd: error getting update info: Cannot retrieve repository metadata (repomd.xml) for repository: ol5_UEK_latest. Please verify its path and try again |
7. kernel: cgrep[562]: segfault at 0 ip 000000004e7f7a3c sp 00000000ffc0ac4c error 4 in libc-2.5.so[4e79f000+154000] |
8. kernel: ponu_ge_lstdat1[16101]: segfault at 28 ip 00007f5002b0a33d sp 00007fffe0fb6ee0 error 4 in libkitd.so[7f5002aea000+3f000] |
9. kernel: FNDLOAD[1296]: segfault at c ip 000000000805689a sp 00000000ff8340ec error 4 in FNDLOAD[8048000+111000] |
10. kernel: ERST: Error Record Serialization Table (ERST) support is initialized |
12. kernel: JSL[23982]: Segfault at 8 ip 000000000040e995 sp 00007ffd0047b880 error 4 in JSL[400000+20000] |
13. kernel: ipmitool[10165]: segfault at 421 ip 000000000044a066 sp 00007ffd6ac4e2c0 error 4 in ipmitool[400000+78000] |
14. kernel: Buffer I/O error on device dm-xx, logical block xx |
15. kernel: tmipcrm[19586]: segfault at 7fe3ce38504e ip 00007xxxxx80633 sp 000xxxxx35b1abc0 error 4 in libtux.so[7fe3d761b000+276000] |
PLATINUM FAULT: Ilom Sensor Alerts: SensorAlerts:PowerSupplyStatus Power supply sensor(s) at level - CRITICAL |
PLATINUM FAULT: ZFSProblem:ProblemSeverity AK-8003-Y6 : The device configuration for JBOD 1111FMD00X |
PLATINUM FAULT: ZFSProblem:ProblemSeverity AK-8002-9M : The cable between the Ethernet ports of each controller is down |
Details: |
PLATINUM FAULT: ZFSProblem:ProblemSeverity ZFS-8000-D3 : ZFS device id1 sd@SATA_____TOSHIBA_THN |
PLATINUM FAULT: ZFSProblem:ProblemSeverity DISK-8000-CY : There have been non-recovered ZFS checksum errors on this disk |
PLATINUM FAULT: ZFSAlert:ProblemType All communication with the cluster peer has been lost |
PLATINUM: An Integrated I/O (II0) fatal error in downstream PCIE device has occurred |
PLATINUM FAULT: ZFSProblem:ProblemSeverity SUNOS-8000-KL : The system has rebooted after a kernel panic. Severity: Major Message ID: xxxxxxxx-xxxx-xxxx-xxxxxxxxxxxx |
Exalogic Virtual: "cacao: Error: Fail to start cacao agent" Error Message Seen In /var/log/messages of EC Control vServer |
Exalogic Virtual: Troubleshooting ORA-240 Errors |
References |