My Oracle Support Banner

Does ISS Use Ngram Tokens? (Doc ID 2310306.1)

Last updated on SEPTEMBER 10, 2019

Applies to:

Oracle Communications Indexing and Search Service - Version and later
Information in this document applies to any platform.


ISS 1u5 p29

Does ISS use ngram tokens like SOLR?

N-Gram Tokenizer
Reads the field text and generates n-gram tokens of sizes in the given range.
Factory class: solr.NGramTokenizerFactory
minGramSize: (integer, default 1) The minimum n-gram size, must be > 0.
maxGramSize: (integer, default 2) The maximum n-gram size, must be >= minGramSize.
Default behavior. Note that this tokenizer operates over the whole field. It does not break the field at whitespace. As a result, the space character is included in the encoding.
xml#666666solid ]]>
In: "hey man"
Out: "h", "e", "y", " ", "m", "a", "n", "he", "ey", "y ", " m", "ma", "an"
With an n-gram size range of 4 to 5:
xml#666666solid ]]>
In: "bicycle"
Out: "bicy", "bicyc", "icyc", "icycl", "cycl", "cycle", "ycle"



To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!

In this Document

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.