My Oracle Support Banner

Oracle Text and UCM - Lexer Configurations (Doc ID 871212.1)

Last updated on MAY 22, 2020

Applies to:

Oracle WebCenter Content - Version 10.0 to 11.1.1.5.0 [Release 10gR3 to 11g]
Information in this document applies to any platform.
***Checked for relevance on 03-April-2011***



Goal

UCM uses Oracle Text to index the extracted text of content items. As the extracted text of content items is inserted into the indexes, a process of lexical analysis is performed, which tokenizes the indexable words of the text. The lexer settings in the database can affect and alter the manner in which text is stored and accessed in the database indexes, as well as how content items are searched upon through the UCM interface.

Appendix D of the Oracle Text Reference guide discusses lexer types in greater detail. This note will demonstrate use of several lexers that can be used with UCM.  The examples contained here assume the database version is at least Oracle 11.1.0.7.  Another assumption made is that in the file <UCM>/config/config.cfg the config setting SearchIndexerEngineName should be set like
SearchIndexerEngineName=OracleTextSearch.

The examples below have not been fully tested with a config setting of SearchIndexerEngineName=DATABASE.FULLTEXT. 

SearchIndexerEngineName=OracleTextSearch

 

For more information on lexers and indexes in Oracle Text, see the Oracle Text Reference guide. The 11g reference library has links to Oracle Text documentation at this link.

http://www.oracle.com/pls/db111/portal.portal_db?selected=7&frame=

 

Solution

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Goal
Solution
 UCM Default Lexer
 Which lexer to use?
 Using the BASIC_LEXER or AUTO_LEXER
 Using a Multi-Lexer
References

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.