My Oracle Support Banner

UTF-8 Encoding of Diacritical Characters Not Properly Maintained when Inserting Data from JSON Files Into Oracle Database in ODI 12c (Doc ID 2200952.1)

Last updated on AUGUST 07, 2024

Applies to:

Oracle Data Integrator - Version 12.1.3.0.0 and later
Oracle Data Integrator on Marketplace - Version 1.0.2 and later
Oracle Data Integrator Cloud Service - Version 17.1.3 and later
Information in this document applies to any platform.

Symptoms

When processing a JSON file encoded in UTF-8, and inserting the data into an Oracle database with AL32UTF8 character set, the Oracle Data Integrator (ODI) 12c application does not maintain the proper encoding.

For example, the diacritical É character with rawtotext value of C38, is inserted into the database as two characters with the code EFBFBD, repeated twice: EFBFBDEFBFBD.

The following sequence of chars in the JSON file:

Note that correct accented / UTF-8 characters are shown:

When viewing the contents of JSON file from a Datastore on Complex File technology: ODI Studio > Designer > Models > myComplexFileModel > myComplexFileDatastore > right-click "View Data", ODI does not show the accented / diacritical character correctly.

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.