Lucene Search Engine PDF Indexing Issue with "commaspace" and "space" delimited fields in WebSphere Application Server (WAS) (Doc ID 1466818.1)

Last updated on JANUARY 04, 2016

Applies to:

Oracle WebCenter Sites - Version 7.6.1 and later
Information in this document applies to any platform.

Symptoms

Indexing an asset with a PDF BLOB attribute (such as a FirstSite II Document_C asset) results in a Lucene indexed document DefaultSearchField with search tokens delimited with either "commaspace" or "space" instead of a " " whitespace character. Searching for an occurence of a token normally found in the PDF document via the Dash,  Contributor UIs or "http://host:port/cs/ContentServer?pagename=OpenMarket/Xcelerate/Search/Search" yields no search results.

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms