BASE_LETTER Not Working For The Latvian Letter Ģ (Doc ID 2152328.1)

Last updated on NOVEMBER 23, 2016

Applies to:

Oracle Text - Version 11.2.0.3 and later
Information in this document applies to any platform.

Symptoms

Using BASIC_LEXER:
begin
ctx_ddl.create_preference('"FT_IDCTEXT1_LEX"','BASIC_LEXER');
ctx_ddl.set_attribute('"FT_IDCTEXT1_LEX"','BASE_LETTER','YES');

end;
/

 

The base_letter functionality is working correctly for all Baltic languages, except for one letter:

the lower case ģ.

The test string used for Latvian is
ĀāČčĒēĢģĪīĶķĻ
ļŅņ¿¿Ūū¿¿.

It is indexed as 'AACCEEGĢIIKKLLNNSSUUZZ'  instead of 'AACCEEGGIIKKLLNNSSUUZZ' .


The upper case letter Ģ is handled correctly but the lower case letter is not.

The token needs to be stored as 'AACCEEGGIIKKLLNNSSUUZZ' (no special characters)

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms