Lucene Search Engine PDF Indexing Issue with "commaspace" and "space" delimited fields in WebSphere Application Server (WAS)
(Doc ID 1466818.1)
Last updated on OCTOBER 07, 2021
Applies to:Oracle WebCenter Sites - Version 7.6.1 and later
Information in this document applies to any platform.
Indexing an asset with a PDF BLOB attribute (such as a FirstSite II Document_C asset) results in a Lucene indexed document DefaultSearchField with search tokens delimited with either "commaspace" or "space" instead of a " " whitespace character. Searching for an occurence of a token normally found in the PDF document via the Dash, Contributor UIs or "http://host:port/cs/ContentServer?pagename=OpenMarket/Xcelerate/Search/Search" yields no search results.
To view full details, sign in with your My Oracle Support account.
Don't have a My Oracle Support account? Click to get started!
In this Document