Reboot Does Not Happen After Killed ocssd.bin
(Doc ID 2238849.1)
Last updated on SEPTEMBER 16, 2021
Applies to:
Oracle Database Cloud Service - Version N/A and laterOracle Database - Enterprise Edition - Version 12.1.0.2 and later
Oracle Database Cloud Schema Service - Version N/A and later
Oracle Database Exadata Express Cloud Service - Version N/A and later
Oracle Database Exadata Cloud Machine - Version N/A and later
Information in this document applies to any platform.
Symptoms
Node doesn't get rebooted after killing ocssd.bin process on AIX/Veritas SFRAC Cluster:
ohasd_cssdagent_root.trc
--------------------------------------------------------------------
2016-11-16 17:24:42.556864 :CLSFRAME:2199: {0:13:1784} Worker thread is exiting in TM [MultiThread] to meet the desired count of 3. New count is 3
2016-11-16 17:25:12.557624 :CLSFRAME:1: TM [MultiThread] is changing desired thread # to 4. Current # is 3
2016-11-16 17:26:01.414089 : CSSCLNT:3600: clsssRecvMsg: got a disconnect
from the server while waiting for message type 27 <----------------------- killed ocssd process at this time.
2016-11-16 17:26:01.414158 :GIPCXCPT:3600: gipcInternalSend: connection not
valid for send operation endp 112660670 [000000000000025b] { gipcEndpoint :localAddr
'clsc://(ADDRESS=(PROTOCOL=ipc)(KEY=)(GIPCID=280d4217-000025ae-14812266))',remoteAddr
'clsc://(ADDRESS=(PROTOCOL=ipc)(KEY=<KEY>)(GIPCID=000025ae-280d4
217-19006326))', numPend 0, numReady 0, numDone 0, numDead 0, numTransfer 0,
objFlags 0x0, pidPeer 19006326, readyRef 0, ready 0, wobj 1126625b0, sendp 0
status 0flags 0x2003861e, flags-2 0x0, usrFlags 0x20010 }, ret gipcretConnectionLost (12)
2016-11-16 17:26:01.414198 :GIPCXCPT:3600: gipcSendSyncF [clsssServerRPC :
clsss.c : 6791]: EXCEPTION[ ret gipcretConnectionLost (12) ] failed to send
on endp 112660670 [000000000000025b] { gipcEndpoint : localAddr
'clsc://(ADDRESS=(PROTOCOL=ipc)(KEY=)(GIPCID=280d4217-000025ae-14812266))', remoteAddr
'clsc://(ADDRESS=(PROTOCOL=ipc)(KEY=<KEY>)(GIPCID=000025ae-280d4
217-19006326))', numPend 0, numReady 0, numDone 0, numDead 0, numTransfer 0,
objFlags 0x0, pidPeer 19006326, readyRef 0, ready 0, wobj 1126625b0, sendp 0
status 0flags 0x2003861e, flags-2 0x0, usrFlags 0x20010 }, addr
0000000000000000, buf 1125ba0f8, len 80, flags 0x8000000
2016-11-16 17:26:01.414202 : CSSCLNT:3600: clsssServerRPC: send failed with err 12, msg type 7
2016-11-16 17:26:01.414205 : CSSCLNT:3600: clsssCommonClientExit: RPC failure, rc 3
2016-11-16 17:26:01.415208 : USRTHRD:515: clsnpoll_BlockMsg: lost connection with CSS
2016-11-16 17:26:01.415264 : USRTHRD:515: clsncssd_logose: slos [-2], SLOS depend-msg [76], SLOS error-msg [Socket is]
2016-11-16 17:26:01.415285 : USRTHRD:515: clsncssd_logose: SLOS other info is [invalid value ].
2016-11-16 17:26:01.415303 : USRTHRD:515: clsnpollmsg_main: warning vendor clusterware - 1
2016-11-16 17:26:01.415318 : USRTHRD:515: clsnpollmsg_main: calling sync
2016-11-16 17:26:02.358184 : USRTHRD:515: clsnpollmsg_main: sync completed
2016-11-16 17:26:02.358208 : USRTHRD:515: clsnpoll_cleanup: to exit status = 4
2016-11-16 17:26:02.366227 : USRTHRD:515: clsncssd_reboot: fatal 8 clidead 0 mode 3 dev 0
2016-11-16 17:26:02.366271 : USRTHRD:515: clsncssd_reboot: Waiting for reboot
2016-11-16 17:26:03.366360 : USRTHRD:515: clsncssd_reboot: Waiting for reboot
2016-11-16 17:26:04.366435 : USRTHRD:515: clsncssd_reboot: Waiting for reboot
2016-11-16 17:26:05.366521 : USRTHRD:515: clsncssd_reboot: Waiting for reboot <<== Repeating without OS reboot
--------------------------------------------------------------------
2016-11-16 17:24:42.556864 :CLSFRAME:2199: {0:13:1784} Worker thread is exiting in TM [MultiThread] to meet the desired count of 3. New count is 3
2016-11-16 17:25:12.557624 :CLSFRAME:1: TM [MultiThread] is changing desired thread # to 4. Current # is 3
2016-11-16 17:26:01.414089 : CSSCLNT:3600: clsssRecvMsg: got a disconnect
from the server while waiting for message type 27 <----------------------- killed ocssd process at this time.
2016-11-16 17:26:01.414158 :GIPCXCPT:3600: gipcInternalSend: connection not
valid for send operation endp 112660670 [000000000000025b] { gipcEndpoint :localAddr
'clsc://(ADDRESS=(PROTOCOL=ipc)(KEY=)(GIPCID=280d4217-000025ae-14812266))',remoteAddr
'clsc://(ADDRESS=(PROTOCOL=ipc)(KEY=<KEY>)(GIPCID=000025ae-280d4
217-19006326))', numPend 0, numReady 0, numDone 0, numDead 0, numTransfer 0,
objFlags 0x0, pidPeer 19006326, readyRef 0, ready 0, wobj 1126625b0, sendp 0
status 0flags 0x2003861e, flags-2 0x0, usrFlags 0x20010 }, ret gipcretConnectionLost (12)
2016-11-16 17:26:01.414198 :GIPCXCPT:3600: gipcSendSyncF [clsssServerRPC :
clsss.c : 6791]: EXCEPTION[ ret gipcretConnectionLost (12) ] failed to send
on endp 112660670 [000000000000025b] { gipcEndpoint : localAddr
'clsc://(ADDRESS=(PROTOCOL=ipc)(KEY=)(GIPCID=280d4217-000025ae-14812266))', remoteAddr
'clsc://(ADDRESS=(PROTOCOL=ipc)(KEY=<KEY>)(GIPCID=000025ae-280d4
217-19006326))', numPend 0, numReady 0, numDone 0, numDead 0, numTransfer 0,
objFlags 0x0, pidPeer 19006326, readyRef 0, ready 0, wobj 1126625b0, sendp 0
status 0flags 0x2003861e, flags-2 0x0, usrFlags 0x20010 }, addr
0000000000000000, buf 1125ba0f8, len 80, flags 0x8000000
2016-11-16 17:26:01.414202 : CSSCLNT:3600: clsssServerRPC: send failed with err 12, msg type 7
2016-11-16 17:26:01.414205 : CSSCLNT:3600: clsssCommonClientExit: RPC failure, rc 3
2016-11-16 17:26:01.415208 : USRTHRD:515: clsnpoll_BlockMsg: lost connection with CSS
2016-11-16 17:26:01.415264 : USRTHRD:515: clsncssd_logose: slos [-2], SLOS depend-msg [76], SLOS error-msg [Socket is]
2016-11-16 17:26:01.415285 : USRTHRD:515: clsncssd_logose: SLOS other info is [invalid value ].
2016-11-16 17:26:01.415303 : USRTHRD:515: clsnpollmsg_main: warning vendor clusterware - 1
2016-11-16 17:26:01.415318 : USRTHRD:515: clsnpollmsg_main: calling sync
2016-11-16 17:26:02.358184 : USRTHRD:515: clsnpollmsg_main: sync completed
2016-11-16 17:26:02.358208 : USRTHRD:515: clsnpoll_cleanup: to exit status = 4
2016-11-16 17:26:02.366227 : USRTHRD:515: clsncssd_reboot: fatal 8 clidead 0 mode 3 dev 0
2016-11-16 17:26:02.366271 : USRTHRD:515: clsncssd_reboot: Waiting for reboot
2016-11-16 17:26:03.366360 : USRTHRD:515: clsncssd_reboot: Waiting for reboot
2016-11-16 17:26:04.366435 : USRTHRD:515: clsncssd_reboot: Waiting for reboot
2016-11-16 17:26:05.366521 : USRTHRD:515: clsncssd_reboot: Waiting for reboot <<== Repeating without OS reboot
Changes
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Changes |
Cause |
Solution |
References |