My Oracle Support Banner

How to make - or _ or other punctuation chars part of a token and prevent them from being tokenized (Doc ID 1040641.1)

Last updated on JANUARY 09, 2018

Applies to:

Oracle Knowledge - Version 8.0.x to 8.5 [Release 8.0 to 8.5]
Information in this document applies to any platform.
Information in this document applies to any platform.

Symptoms

How to make - or _ or other punctuation chars part of a token and prevent them from being tokenized?

There are two very specific types of customizations that can be made to the tokenizer for handeling specific cases of tokenization around non alpha characters.

Some examples of their uses are social security numbers, words with - or _, filenames or error codes that contain underscores, phone numbers etc.  By tokenizing these the search will use them as a single token rather than searching on individual pieces of the word, filename, social security number etc.

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.