Lucene Search Engine PDF Indexing Issue with "commaspace" and "space" delimited fields in WebSphere Application Server (WAS)
Last updated on JANUARY 04, 2016
Applies to:Oracle WebCenter Sites - Version 7.6.1 and later
Information in this document applies to any platform.
Indexing an asset with a PDF BLOB attribute (such as a FirstSite II Document_C asset) results in a Lucene indexed document DefaultSearchField with search tokens delimited with either "commaspace" or "space" instead of a " " whitespace character. Searching for an occurence of a token normally found in the PDF document via the Dash, Contributor UIs or "http://host:port/cs/ContentServer?pagename=OpenMarket/Xcelerate/Search/Search" yields no search results.
Sign In with your My Oracle Support account
Don't have a My Oracle Support account? Click to get started
My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms