HBase Region Servers are Not Responsive Leading to No Active HBase Master with "TimeoutIOException: Failed to get sync result after 300000 ms for txid, WAL system stuck?" Failures
(Doc ID 2897681.1)
Last updated on JULY 20, 2024
Applies to:
Big Data Appliance Integrated Software - Version 5.2.0 and laterLinux x86-64
Symptoms
NOTE: In the examples that follow, user details, cluster names, hostnames, directory paths, filenames, etc. represent a fictitious sample (and are used to provide an illustrative example only). Any similarity to actual persons, or entities, living or dead, is purely coincidental and not intended in any manner.
On BDA 5.2/CDH 6.3.4 HBase Region Servers become unresponsive leading to no active HBase Master.
1. The HBase Region Server logs show that the WAL system is stuck hence the regions fail to open. Errors look like:
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Cause |
Solution |