Siebel Servers Failing To Start With Error "SBL-SCM-00018: Could not open connection to Siebel Gateway configuration store" When Primary Gateway Node is Down with Gateway Native Clustering
(Doc ID 2905206.1)
Last updated on APRIL 19, 2024
Applies to:
Siebel CRM - Version 21.3 and laterInformation in this document applies to any platform.
Symptoms
Siebel servers are failing to start when primary gateway node is down .
It is 5 node gateway clustering deployed with 20+ siebel servers where gateway nodes are distributed across multiple data centers DC1 (Gateway1 + Gateway2), DC2 (Gateway3 + Gateway4) and DC3 (Gateway5). While performing DR activity that is shutdown DC1 (Gateway1 + Gateway2) and bring up DC2(Gateway3 + Gateway4) & DC3(Gateway5), Siebel servers are failing to start and it keep trying to connect Gateway1 instead of retrying available node. Also all metafiles in gateway & Siebel servers still updated with Gateway1 information though this node (both registry & cgw) are down.
Siebsrvr.log as below error:
NameServerLayerLog Error 1 0000000263006b9d:0 2022-08-20 23:27:31 Gateway Registry failed: Registry connection failed.
GtwySvcFrmwrkLog Error 1 0000000263006b9d:0 2022-08-20 23:27:31 Service Discovery initialization failed: Unable to initialize registry.
GtwySvcFrmwrkLog Error 1 0000000263006b9d:0 2022-08-20 23:27:31 Service Discovery initialization failed: Registry Generic Exception.
ServerLog ServerStartup 1 0000000263006b9d:0 2022-08-20 23:27:31 Error: Failed to discover Gateway
SisnTcpIp SisnSockError 1 0000000263006b9d:0 2022-08-20 23:27:31 1444415104: [SISNAPI TLS] failed to read PARAM_GENERIC_ENABLE_OPENSSL parameter
NameServerLayerLog Error 1 0000000263006b9d:0 2022-08-20 23:27:31 1444415104: [cgclientservice] TLS handshake to gateway failed gateway1:9111
GenericLog GenericError 1 0000000263006b9d:0 2022-08-20 23:27:31 (scmnsclnt.cpp (135) err=2555922 sys=0) SBL-SCM-00018: Could not open connection to Siebel Gateway configuration store (gateway1:9111).
Ideally siebel server should try to connect another available node but this is not happening. Also metafile is still having Gateway1 reference which is causing this behavior.
STEPS
-----------------------
The issue can be reproduced at will with the following steps:
1) Siebel Enterprise deployed with 5 node Gateway cluster + 20+ Siebel servers + 4 AI servers
2) Stop complete enterprise
3) Bring up Gateway3, Gateway4 & gateway5 out of 5 nodes which is fine to maintain zookeeper quorum
4) Start Siebel servers, servers are failing to start and trying to connect Gateway1.
5) metafile is also updated with Gateway1 information, where this node completely down
Changes
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Changes |
Cause |
Solution |