UTF-8 Encoding of Diacritical Characters not Properly Maintained When Inserting Data from JSON Files Into Oracle Database In ODI 12c
Last updated on NOVEMBER 11, 2016
Applies to:Oracle Data Integrator - Version 184.108.40.206.0 and later
Information in this document applies to any platform.
When processing a JSON file encoded in UTF-8, and inserting the data into an Oracle database with AL32UTF8 character set, the Oracle Data Integrator (ODI) 12c application does not maintain the proper encoding.
For example, the diacritical É character with rawtotext value of C38, is inserted into the database as two characters with the code EFBFBD, repeated twice: EFBFBDEFBFBD.
The following sequence of chars in the JSON file:
shows in the database as:
Note that correct accented / UTF-8 characters are shown:
- When editing the JSON file with a text editor such as Notepad...
- When viewing the contents of JSON file from a Datastore on File technology: ODI Studio > Designer > Models > myFileModel > myFileDatastore > right-click "View Data"
When viewing the contents of JSON file from a Datastore on Complex File technology: ODI Studio > Designer > Models > myComplexFileModel > myComplexFileDatastore > right-click "View Data", ODI does not show the accented / diacritical character correctly.
Sign In with your My Oracle Support account
Don't have a My Oracle Support account? Click to get started
My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms