Exadata storage server CELLSRV restart due to RDMA cancel operation time out
(Doc ID 2649975.1)
Last updated on MARCH 20, 2020
Applies to:Oracle Exadata Storage Server Software - Version 22.214.171.124.0 to 126.96.36.199.0 [Release 12.2]
Oracle Exadata Storage Server Software - Version 188.8.131.52.0 to 184.108.40.206.0 [Release 12.2]
Oracle Exadata Storage Server Software - Version 220.127.116.11.0 to 18.104.22.168.0 [Release 12.2]
Information in this document applies to any platform.
In rare circumstances, an RDMA cancel operation on a storage server to clean up stale connections may time out, which can lead to cellsrv process crash and automatic cellsrv restart. If this issue occurs on more than one storage server around the same time, it may cause ASM disk group dismount and affect database availability.
This issue affects storage servers running one of the following Exadata versions:
- 19.3.1 through 19.3.4
Storage server $CELLTRACE/alert.log shows information similar to the following:
Typically, cellsrv restart occurs with negligible impact to database workload.
To view full details, sign in with your My Oracle Support account.
Don't have a My Oracle Support account? Click to get started!
In this Document