Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP
More options
HP.com home
HP-UX 11i Version 3 Release Notes: HP 9000 and HP Integrity Servers > Chapter 11 Internationalization

Unicode 5.0 Support

» 

Technical documentation

Complete book in PDF
» Feedback
Content starts here

 » Table of Contents

HP-UX 11i v3 includes Unicode 5.0 support, which is an extension to the previously supported Unicode 3.0 character set standard. Unicode 5.0 is aligned with the revised ISO 10646-2:2003 standard including Amendments 1 and 2, defining 99,089 total characters. It includes an additional 48,830 new characters from the Unicode 3.0 version. Most notable of these additions are new CJK ideographic characters that align with the most recent versions of GB-18030, JIS X 0213 and HKSCS-2004, new Latin and Cyrillic characters and a number of historic script characters.

Summary of Change

What’s New for Customers Migrating from HP-UX 11i v1 September 2005?

All 52 previously supported system supplied utf8 locales have been updated to support the character repertoire specified by the Unicode 5.0 standard. In addition, all new 11i v3 utf8 locales (refer to “New locales - Baltic/Russia/Ukraine/Latin America” section) align with the Unicode 5.0 standard.

Note that as of 11i v3, the locale binaries provided are version 3. Refer to the “New Locale Versioning” section for further details regarding levels of support for PA-RISC-based archived applications.

Changes have been made in the Streams ldterm modules, libc and associated libc utf8 method libraries, the localedef and the eucset commands, iconv converters, Xlib, fonts, CDE, and Asian print drivers to support Unicode 5.0.

Users will also notice specific changes in display/printing aspects between Asian utf8 and non-Asian utf8 locales. Support has been added within HP-UX to properly support the Unicode Consortium’s specified “Alternate Width Properties” for Asian locales.

(Refer to “Alternative Width Properties for Unicode Character Sets” for more details.)

To effect this change within the tty/ldterm settings, the eucset command has been extended to include a new -c codeset option “ASIAN_UTF8”. Refer to the eucset(1) manpage for more details. In addition to eucset command, the CDE components have been enhanced to handle ‘ASIAN_UTF8’ option for Asian locales.

Significant additions have been made to iconv converters to support new Unicode 5.0 characters, surrogate characters, byte-order marks and all forms of Unicode-specified transformations, including UTF-8, UTF-16, UTF-32, big and little-endian forms. Refer to the system.config.iconv file under /usr/lib/nls/iconv for the complete listing of all iconv converters supported as part of the base operating system.

What’s New for Customers Migrating from HP-UX 11i v2 June 2006?

See “What’s New for Customers Migrating from HP-UX 11i v1 September 2005?”

Impact

No additional memory requirements are needed when running in any of these locales.

Compatibility

Support for Unicode 5.0 is only provided for applications running on Itanium®-based 32/64-bit platforms or those compiled in shared mode on PA-RISC 32/64-bit platforms. Unicode 5.0 support is not provided for those applications that were compiled in archived mode on previous HP-UX releases. Those archived applications will continue to use the Unicode 3.0 repertoire as supported in previous HP-UX releases.

Performance

For specific operations, involving collation and/or string handling, some performance improvements from past releases will be observed within the system.

Documentation

See the manpages for localedef(1) and eucset(1).

Obsolescence

Not applicable.

Printable version
Privacy statement Using this site means you accept its terms Feedback to webmaster
© 2006-2007 Hewlett-Packard Development Company, L.P.