Querying a Hive Table via the Hive CLI or in the HUE WEb UI using a RegexSerde as the Row Format Fails with Execution Error on Oracle Big Data Appliance

(Doc ID 1619024.1)

Last updated on FEBRUARY 04, 2014

Applies to:

Big Data Appliance Integrated Software - Version 2.2.1 and later
Linux x86-64


Querying a Hive table via the Hive CLI or in the HUE WEb UI using a RegexSerde as the row format fails with:

1. From the Hive Cli:

Diagnostic Messages for this Task:
java.lang.RuntimeException: Error in configuring object
        at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
        at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
        at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:413)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:332)
        at org.apache.hadoop.mapred.Child$4.run(Child.java:268)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408)
        at org.apache.hadoop.mapred.Child.main(Child.java:262)
Caused by: java.lang.reflect.InvocationTargetException
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.ja

FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
MapReduce Jobs Launched:
Job 0: Map: 1  Reduce: 1   HDFS Read: 0 HDFS Write: 0 FAIL
Total MapReduce CPU Time Spent: 0 msec

2. From the Hue UI:

Note that a query like "select * from table <table_name> will work since no MapReduce is triggered.  Selecting specific columns with "select count (*)" for example, triggers the errors as above.

Steps to reproduce the error:

1. For example with a web log, with a snippet that looks like below:

29-JAN-2014 12:14:10 * oracle "GET /twiki/x/y/z/test?=Test.test HTTP/1.1" 200 9
29-JAN-2014 12:14:11 * oracle "GET /twiki/x/y/z/test1?test1&test1 HTTP/1.1" 200 99
29-JAN-2014 12:14:12 * oracle "GET /file1.gif HTTP/1.0" 200 9999
29-JAN-2014 12:14:13 * oracle "GET /file2.gif HTTP/1.0" 200 999

located at: /user/oracle/app_log/app_log

2. Create a Hive table using a RegexSerde as the row format:

b) From Hue "select * from regexserde_table", results in the error as below:




Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms