Oracle Text can not filter binary documents and ctxhx returns ORA_DRG-11222 (Doc ID 459233.1)

Last updated on JUNE 21, 2011

Applies to:

Oracle Text - Version: 10.2.0.1 to 10.2.0.4 - Release: 10.2 to 10.2
SUSE \ UnitedLinux x86-64
Checked for relevance on 14-Jun-2010

Symptoms

Text does not index any words/tokens while indexing MS Word or PDF files. The token table is empty and no errors exist in the view ctx_user_index_errors.

When calling ctxhx binary locally on the Operating System, on any binary documents (e.g. pdf, doc) :

$ ctxhx doc.pdf doc.htm

The resulting file doc.htm only produces ORA_DRG-11222 errors.

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms