My Oracle Support Banner

HBase Region Servers are Not Responsive Leading to No Active HBase Master with "TimeoutIOException: Failed to get sync result after 300000 ms for txid, WAL system stuck?" Failures (Doc ID 2897681.1)

Last updated on JULY 20, 2024

Applies to:

Big Data Appliance Integrated Software - Version 5.2.0 and later
Linux x86-64

Symptoms

NOTE: In the examples that follow, user details, cluster names, hostnames, directory paths, filenames, etc. represent a fictitious sample (and are used to provide an illustrative example only). Any similarity to actual persons, or entities, living or dead, is purely coincidental and not intended in any manner.

On BDA 5.2/CDH 6.3.4 HBase Region Servers become unresponsive leading to no active HBase Master.

1. The HBase Region Server logs show that the WAL system is stuck hence the regions fail to open. Errors look like:

 

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Cause
Solution


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.