My Oracle Support Banner

Common Problems Reported by Platinum Monitoring and Recommended Actions for Exalogic Systems (Doc ID 1985576.1)

Last updated on APRIL 09, 2024

Applies to:

Exalogic Elastic Cloud X6-2 Hardware - Version X6 to X6 [Release X6]
Exalogic Elastic Cloud X3-2 Eighth Rack - Version X3 to X3 [Release X3]
Exalogic Elastic Cloud X4-2 Quarter Rack - Version X4 to X4 [Release X4]
Exalogic Elastic Cloud X3-2 Hardware - Version X6 to X6 [Release X6]
Exalogic Elastic Cloud X4-2 Half Rack - Version X6 to X6 [Release X6]
Linux x86-64
Oracle Solaris on x86-64 (64-bit)
Oracle Virtual Server x86-64
































Purpose

This Note provides list of commonly reported Platinum Fault and Alert messages by Platinum monitoring setup and provides recommended actions for addressing those Platinum Faults.

Scope

This note focuses on common Platinum alerts and solutions. For more information on Oracle Platinum Services including a full list of Fault Monitoring that is done, visit the following note:

Oracle Platinum Services – Quick Reference Guide (Doc ID 1993848.1)

Details

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Purpose
Scope
Details
 PLATINUM FAULT: LogFileMonitor:sys_occurrence_count Scanned /var/log/messages from line xxxxx to yyyyy. Found 1 occurence of the pattern [kernel:.* (error|crit|fatal)].. 1 crossed warning ( ) or critical (0) threshold.
 PLATINUM FAULT: LogFileMonitor:sys_occurrence_count Scanned /var/adm/messages from line xxxxx to yyyyy. Found 2 occurences of the pattern [svc.startd.*failed].. 2 crossed warning (0) or critical (1) threshold.
 1. kernel: Error: Driver 'pcspkr' is already registered, aborting...
 2. kernel: sdp_process_tx_wc:261 sdp_sock( 4551:14 58027:10280): Send completion with error. wr_id 0x400000002 Status 12
 3. kernel: uce_agent.bin[23006]: segfault at f6eac05c ip 00000000082639a1 sp 00000000f6eac060 error 6 in uce_agent.bin[8048000+6e8000]
 4. kernel: xs_tcp_setup_socket: connect returned unhandled error -107
 5. kernel: bonding: bond1: Error: Unable to enslave eth326_2 because it is already up
 6. yum-updatesd: error getting update info: Cannot retrieve repository metadata (repomd.xml) for repository: ol5_UEK_latest. Please verify its path and try again
 7. kernel: cgrep[562]: segfault at 0 ip 000000004e7f7a3c sp 00000000ffc0ac4c error 4 in libc-2.5.so[4e79f000+154000]
 8. kernel: ponu_ge_lstdat1[16101]: segfault at 28 ip 00007f5002b0a33d sp 00007fffe0fb6ee0 error 4 in libkitd.so[7f5002aea000+3f000]
 9. kernel: FNDLOAD[1296]: segfault at c ip 000000000805689a sp 00000000ff8340ec error 4 in FNDLOAD[8048000+111000]
 10. kernel: ERST: Error Record Serialization Table (ERST) support is initialized
 11. svc.startd[13]: [ID 652011 daemon.warning] svc:/application/pkg/system-repository:default: Method "/lib/svc/method/svc-pkg-sysrepo refresh" failed with exit status 95.
     svc.startd[13]: [ID 748625 daemon.error] application/pkg/system-repository:default failed fatally: transitioned to maintenance (see 'svcs -xv' for details)
 12. kernel: JSL[23982]: Segfault at 8 ip 000000000040e995 sp 00007ffd0047b880 error 4 in JSL[400000+20000]
 13. kernel: ipmitool[10165]: segfault at 421 ip 000000000044a066 sp 00007ffd6ac4e2c0 error 4 in ipmitool[400000+78000]
 14. kernel: Buffer I/O error on device dm-xx, logical block xx
 15. kernel: tmipcrm[19586]: segfault at 7fe3ce38504e ip 00007xxxxx80633 sp 000xxxxx35b1abc0 error 4 in libtux.so[7fe3d761b000+276000]
 PLATINUM FAULT: adrAlertLogIncidentError:accessViolationErrStack An access violation detected in /u01/app/oracle/diag/rdbms/elctrldb/elctrldb/alert/log.xml at time/line number: <Date Time>/<Line Number>
 PLATINUM FAULT: adrAlertLogIncidentError:genericIncidentErrStack Incident (ORA 445) detected in /u01/app/oracle/diag/rdbms/elctrldb/elctrldb/alert/log.xml at time/line number: <Date Time>/<Line Number>
 PLATINUM FAULT: adrAlertLogIncidentError:genericIncidentErrStack Incident (ORA 240) detected in /u01/app/oracle/diag/rdbms/elctrldb/elctrldb/alert/log.xml at time/line number: <Date Time> / <Line Number>
 PLATINUM FAULT: Ilom Sensor Alerts: SensorAlerts:PowerSupplyStatus Power supply sensor(s) at level - CRITICAL
 PLATINUM FAULT: ZFSProblem:ProblemSeverity AK-8003-Y6 : The device configuration for JBOD 1111FMD00X
 PLATINUM FAULT: ZFSProblem:ProblemSeverity AK-8002-9M : The cable between the Ethernet ports of each controller is down
 Details:
 PLATINUM FAULT: ZFSProblem:ProblemSeverity ZFS-8000-D3 : ZFS device id1 sd@SATA_____TOSHIBA_THN
 PLATINUM FAULT: ZFSProblem:ProblemSeverity USB-8000-GT : A hardware fault within the device or its interface was detected in the USB device. The driver has failed to initialize the device and the device is in an invalid state.
 PLATINUM FAULT: ZFSProblem:ProblemSeverity DISK-8000-CY : There have been non-recovered ZFS checksum errors on this disk
 PLATINUM FAULT: ZFSAlert:ProblemType All communication with the cluster peer has been lost
 PLATINUM: An Integrated I/O (II0) fatal error in downstream PCIE device has occurred
 PLATINUM FAULT: ZFSProblem:ProblemSeverity SUNOS-8000-KL : The system has rebooted after a kernel panic. Severity: Major Message ID: xxxxxxxx-xxxx-xxxx-xxxxxxxxxxxx
 Exalogic Virtual: "cacao: Error: Fail to start cacao agent" Error Message Seen In /var/log/messages of EC Control vServer
 Exalogic Virtual: Troubleshooting ORA-240 Errors
References

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.