My Oracle Support Banner

Exadata storage server CELLSRV restart due to RDMA cancel operation time out (Doc ID 2649975.1)

Last updated on MARCH 20, 2020

Applies to:

Oracle Exadata Storage Server Software - Version 19.3.1.0.0 to 19.3.4.0.0 [Release 12.2]
Oracle Exadata Storage Server Software - Version 19.2.10.0.0 to 19.2.10.0.0 [Release 12.2]
Oracle Exadata Storage Server Software - Version 18.1.24.0.0 to 18.1.24.0.0 [Release 12.2]
Information in this document applies to any platform.

Symptoms

In rare circumstances, an RDMA cancel operation on a storage server to clean up stale connections may time out, which can lead to cellsrv process crash and automatic cellsrv restart. If this issue occurs on more than one storage server around the same time, it may cause ASM disk group dismount and affect database availability.

This issue affects storage servers running one of the following Exadata versions:

Storage server $CELLTRACE/alert.log shows information similar to the following:

Typically, cellsrv restart occurs with negligible impact to database workload.

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Cause
Solution
 Recommended Action
 Alternative Action

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.