BDD : Project Got Partial Sampling Data Set Even The Record Limit Has Not Been Reached (Doc ID 2185774.1)

Last updated on MAY 31, 2017

Applies to:

Oracle Big Data Discovery - Version 1.3.2.0.0 and later
Information in this document applies to any platform.

Symptoms

After running a -refresh EDP_CLI command on a project, you found  the number of records seen in Explore/Transform/Discover did not equal the number of records in the source dataset. Explore shows a 6.4M of 7.8M records.

But the configuration of properties both in Studio as well as in edp.properties file show 10,000,000 as the dataset limit.

And no errors in Hadoop or BDD log files.

Record Counts are not tying out:
7,761,177 in source Hive table
BDD Filtered 6,421,903 of 7,761,177 (Seen in Explore tab)

dgraph.out log message shows...

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms