My Oracle Support Banner

Lucene Search Engine PDF Indexing Issue with "commaspace" and "space" delimited fields in WebSphere Application Server (WAS) (Doc ID 1466818.1)

Last updated on APRIL 24, 2023

Applies to:

Oracle WebCenter Sites - Version 7.6.1 and later
Information in this document applies to any platform.

Symptoms

Indexing an asset with a PDF BLOB attribute (such as a FirstSite II Document_C asset) results in a Lucene indexed document DefaultSearchField with search tokens delimited with either "commaspace" or "space" instead of a " " whitespace character. Searching for an occurence of a token normally found in the PDF document via the Dash,  Contributor UIs or "http://host:port/cs/ContentServer?pagename=OpenMarket/Xcelerate/Search/Search" yields no search results.

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Cause
Solution


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.