UTF-8 Encoding of Diacritical Characters not Properly Maintained When Inserting Data from JSON Files Into Oracle Database In ODI 12c (Doc ID 2200952.1)

Last updated on NOVEMBER 11, 2016

Applies to:

Oracle Data Integrator - Version 12.1.3.0.0 and later
Information in this document applies to any platform.

Symptoms

When processing a JSON file encoded in UTF-8, and inserting the data into an Oracle database with AL32UTF8 character set, the Oracle Data Integrator (ODI) 12c application does not maintain the proper encoding.

For example, the diacritical É character with rawtotext value of C38, is inserted into the database as two characters with the code EFBFBD, repeated twice: EFBFBDEFBFBD.

The following sequence of chars in the JSON file:

CHÉ

shows in the database as:

CH��

Note that correct accented / UTF-8 characters are shown:

When viewing the contents of JSON file from a Datastore on Complex File technology: ODI Studio > Designer > Models > myComplexFileModel > myComplexFileDatastore > right-click "View Data", ODI does not show the accented / diacritical character correctly.

Changes

 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms