How to Recover Deleted Files From HDFS on a BDA Cluster up to the Time Set By the Filesystem Trash Interval in the Cloudera Manager Trash Feature
Last updated on NOVEMBER 04, 2016
Applies to:Big Data Appliance Integrated Software - Version 4.2.0 and later
On the BDA (in all but the earliest versions) the Hadoop Trash feature is enabled. Confirm by navigating in Cloudera Manager(CM):
hdfs > Configuration > Search: Trash
"Use Trash" should be enabled. And the "Filesystem Trash Interval fs.trash.interval" should be defaulted to 1 day. See:
When enabled, the Hadoop Trash system turns the "hadoop fs -rm" command into a move operation, which moves files into a folder named /user/$username/.Trash. Files are retained in the "Trash folder" up to the time interval specified as above. You can rescue files up to that time by moving them back manually.
The goal of this note is to detail how to recover deleted files from the "Trash" directory up to the time set in the CM "Filesystem Trash Interval".
Sign In with your My Oracle Support account
Don't have a My Oracle Support account? Click to get started
My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms