My Oracle Support Banner

Details about Hadoop Archives (HAR files) Creation and Usage with Oracle Big Data Appliance V2.2.* (Doc ID 1590846.1)

Last updated on OCTOBER 20, 2019

Applies to:

Big Data Appliance Integrated Software - Version 2.0.1 and later
Linux x86-64

Purpose

This document provides details about Hadoop Archives (HAR files), Creation and Usage of HAR files for Oracle Big Data Appliance (BDA) V2.2.* .

Scope

 HDFS Administrator

Details

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Purpose
Scope
Details
 What are HAR files?
 Is creating a HAR file a resolution for 'data node has too many data blocks' Health Check?
 Does creating HAR files a resolution for MapReduce Performance problem with Small files?
 How to create HAR files?
 Does Archiving delete input files?
 Can one extract original files from HAR similar to 'tar -xvf'? 
 Is there a checksum command to check if the contents of the created archive are accurate and NOT corrupted?
 How to locate files in HAR files?
 How to access HAR files from Pig?
References

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.