After Upgrade to Oracle Big Data Appliance 4.2 Release, Hive Queries with Join and Groupby Clause are Not Working Correctly (Doc ID 2052624.1)

Last updated on OCTOBER 11, 2016

Applies to:

Big Data Appliance Integrated Software - Version 4.2.0 and later
Linux x86-64

Symptoms

Upgraded Oracle Big Data Appliance (BDA) CDH cluster from v4.1(CDH 5.3.1) to 4.2 (CDH 5.4.0). After the upgrade hive queries with multi join or 'group by' clause are not functioning correctly.

Symptoms noticed are
a) Hive query seems to hang during Map phase.
b) Queries returning wrong amount of rows.
c) Failing with below error message.

Error: java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while processing row [Error getting row data with exception java.lang.ArrayIndexOutOfBoundsException: 82
at org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryUtils.readVInt(LazyBinaryUtils.java:304)
at org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryUtils.checkObjectByteInfo(LazyBinaryUtils.java:183)
at org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryStruct.parse(LazyBinaryStruct.java:142)

 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms