Coherence Cluster Got Restarted With NullPointerException on PartitionedCache$Storage.moveResourcesToPrimary (Doc ID 2219029.1)

Last updated on JANUARY 05, 2017

Applies to:

Oracle Coherence - Version 3.7.1.9 and later
Information in this document applies to any platform.

Symptoms

Coherence cluster in ENDANGERED state after 2 nodes out of 45 experienced OOM. Customer has 5 coherence servers, each running 9 coherence nodes. Customer has had out of memory issues with 2 out of 45 nodes at sometime and Customer had to restart the entire cluster since the cluster was in ENDANGERED state. Though few of the nodes left the cluster due to OOME other nodes should not abruptly terminated the partitioned cache service, but that got happened during the partition transferring/redistribution time. The NPE got happened while moving the partitions when discovered that orphaned and their backups were missing.

 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms