How to make - or _ or other punctuation chars part of a token and prevent them from being tokenized (Doc ID 1040641.1)

Last updated on MARCH 30, 2016

Applies to:

Oracle Knowledge - Version 8.0.x to 8.5 [Release 8.0 to 8.5]
Information in this document applies to any platform.
Information in this document applies to any platform.

Symptoms

How to make - or _ or other punctuation chars part of a token and prevent them from being tokenized?

There are two very specific types of customizations that can be made to the tokenizer for handeling specific cases of tokenization around non alpha characters.

Some examples of their uses are social security numbers, words with - or _, filenames or error codes that contain underscores, phone numbers etc.  By tokenizing these the search will use them as a single token rather than searching on individual pieces of the word, filename, social security number etc.

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms