On BDA A Resource Manager Goes Down with "Exception while executing a ZK operation."
(Doc ID 2542137.1)
Last updated on NOVEMBER 06, 2019
Applies to:Big Data Appliance Integrated Software - Version 4.9.0 and later
On BDA, a Resource Manager goes down with errors like below:
From the Resource Manager logs:
<TIMESTAMP> INFO org.apache.zookeeper.ClientCnxn: Session establishment complete on server <HOSTNAME>.<DOMAINNAME>/<PRIVATE_IP>:<PORT>, sessionid = <ID>, negotiated timeout = 60000
<TIMESTAMP> INFO org.apache.zookeeper.ClientCnxn: Unable to read additional data from server sessionid <ID>, likely server has closed socket, closing socket connection and attempting reconnect
<TIMESTAMP> INFO org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore: Exception while executing a ZK operation. org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss
From the Zookeeper logs:
<TIMESTAMP> INFO org.apache.zookeeper.server.NIOServerCnxn: Closed socket connection for client /<PRIVATE_IP>:<PORT> which had sessionid <ID>
To view full details, sign in with your My Oracle Support account.
Don't have a My Oracle Support account? Click to get started!
In this Document