INTELLIGENT WORK FORUMS
FOR COMPUTER PROFESSIONALS

Log In

Come Join Us!

Are you a
Computer / IT professional?
Join Tek-Tips Forums!
  • Talk With Other Members
  • Be Notified Of Responses
    To Your Posts
  • Keyword Search
  • One-Click Access To Your
    Favorite Forums
  • Automated Signatures
    On Your Posts
  • Best Of All, It's Free!

*Tek-Tips's functionality depends on members receiving e-mail. By joining you are opting in to receive e-mail.

Posting Guidelines

Promoting, selling, recruiting, coursework and thesis posting is forbidden.

Jobs

Loading and Retrieving UTF8 Data from Databases

Loading and Retrieving UTF8 Data from Databases

(OP)
Hi,

Here is my situation...

Data has been extracted from an Oracle database in UTF-8 format.

Data was loaded into a Teradata database into fields defined as LATIN (not defined as Unicode).  

For sake of simplicity I will use first_name as an example field.

This column in the Teradata database is defined as follows:
first_name VARCHAR(50) CHARACTER SET LATIN NOT CASESPECIFIC NOT NULL

The first_name data is loaded into Teradata using a load utility called Multiload without a session character set specified.  When Multiload is run it has an option to specify the character set for the Multiload "load session".  If one is not specified Multiload defaults to ASCII.  (Note: for any Teradata gurus: DBC.Hosts does not have a character set specification set for Multiload to pick up).  

So in a situation in which the name "Günther" is loaded.
The following is the hex code of the data after loading:
47C3BC6E74686572
The correct hex representation for this string is the following which is in Oracle:
47FC6E74686572

So when I select the row of data containing this name from Teradata I get back the following
Günther
Note: To do this select I used a basic query tool with a UTF8 ODBC connection to Teradata.  I can connect via an ASCII ODBC connection and I get the same output (no surprise).

Now here is the thing.  Our users of data in this Teradata database use Business Objects (BO) to get at the data.  BO connects to Teradata via a UTF8 ODBC connection.  But BO also uses a special font called "BO Japan" to display the data.  This font seems to be doing something to display in this example first_name correctly.

Now here is the problem...  

I have to use the first_name data in a ColdFusion email application.  My email application will pull data directly from views on the Teradata database and I can set the connection type to whatever I want (have total control of that).  I will be displaying first_name in an HTML table in the email that is sent out (note I can set the font to whatever I want but of course not everyone has a BO Japan font making that a non-optimal solution and I'm not sure that it would work when not on BO servers).  

So does anyone know if it's possible to get at this data from my email application and get it in the correct format so I can display it propertly in an email?

Notes:
1) I will have to deal with Japanese data also.
2) I know that having the Teradata database load the data properly is the ultimate solution - just want to know if there are others.

Thanks!
 

Red Flag This Post

Please let us know here why this post is inappropriate. Reasons such as off-topic, duplicates, flames, illegal, vulgar, or students posting their homework.

Red Flag Submitted

Thank you for helping keep Tek-Tips Forums free from inappropriate posts.
The Tek-Tips staff will check this out and take appropriate action.

Reply To This Thread

Posting in the Tek-Tips forums is a member-only feature.

Click Here to join Tek-Tips and talk with other members!

Resources

Close Box

Join Tek-Tips® Today!

Join your peers on the Internet's largest technical computer professional community.
It's easy to join and it's free.

Here's Why Members Love Tek-Tips Forums:

Register now while it's still free!

Already a member? Close this window and log in.

Join Us             Close