How to Modify ODI 12c Hive KMs to Create Parquet Tables Instead of Default Serialization (Text) (Doc ID 2149684.1)

Last updated on JUNE 21, 2016

Applies to:

Oracle Data Integrator - Version 12.1.3.0.0 to 12.2.1.1.0 [Release 12c]
Information in this document applies to any platform.

Goal

How to modify Oracle Data Integrator (ODI) Hive KMs to create Parquet tables.

Full support of Hadoop Techology and improved Hive KMs where included starting 12.1.3.0.1 into 12.2.1.  In those versions the Hive KMs create and store tables in default serialization (text).

When creating narrow datasets this works fine but with very wide datasets where hundreds of columns is not unusual, this presents a performance issue.  

Changing use to parquet type is desirable for the following reasons:


So parquet being a columnar storage is a better implementation.

Solution

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms