My Oracle Support Banner

Oracle Text and UCM - Lexer Configurations (Doc ID 871212.1)

Last updated on JANUARY 29, 2022

Applies to:

Oracle WebCenter Content - Version 10.0 to [Release 10gR3 to 11g]
Information in this document applies to any platform.


UCM uses Oracle Text to index the extracted text of content items. As the extracted text of content items is inserted into the indexes, a process of lexical analysis is performed, which tokenizes the indexable words of the text. The lexer settings in the database can affect and alter the manner in which text is stored and accessed in the database indexes, as well as how content items are searched upon through the UCM interface.

Appendix D of the Oracle Text Reference guide discusses lexer types in greater detail. This note will demonstrate use of several lexers that can be used with UCM.  The examples contained here assume the database version is at least Oracle  Another assumption made is that in the file <UCM>/config/config.cfg the config setting SearchIndexerEngineName should be set like

The examples below have not been fully tested with a config setting of SearchIndexerEngineName=DATABASE.FULLTEXT. 



To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!

In this Document
 UCM Default Lexer
 Which lexer to use?
 Using a Multi-Lexer

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.