Oracle8i National Language Support Guide Release 2 (8.1.6) Part Number A76966-01 |
|
This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:
You can also obtain information about supported character sets, languages, territories, and sorting orders by querying the dynamic data view V$NLS_VALID_VALUES. For more information on the data which can be returned by this view, see Oracle8i Reference.
Table A-1 lists the languages supported by the Oracle server.
Oracle error messages have been translated into the languages which are listed in Table A-2.
Table A-3 lists the territories supported by the Oracle server.
Oracle-supported character sets are listed below, for easy reference, according to three broad language groups:
Note that some character sets may be listed under multiple language groups because they provide multilingual support. For instance, Unicode spans the Asian, European, and Middle Eastern language groups because it supports most of the major scripts of the world.
The comment section indicates the type of encoding used:
As mentioned in Chapter 3, "Choosing a Character Set", the type of encoding will affect performance, so you should use the most efficient encoding that meets your language needs. Also, some encoding types can only be used with certain data types. For instance, fixed-width multibyte encoded character sets can only be used as an NCHAR character set, and not as a database character set.
Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For instance, whether the character set supports the new Euro currency symbol, whether user-defined characters are supported for character set customization, and whether the character set is a strict superset of ASCII (which will allow you to make use of the ALTER DATABASE [NATIONAL] CHARACTER SET statement in case of migration.)
EURO = Euro symbol supported
UDC = User-defined Characters supported
ASCII = Strict Superset of ASCII
Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you should refer to the actual national, international, or vendor-specific standards.
Table A-4 lists the Oracle character sets that can support Asian languages.
Name | Description | Comments |
---|---|---|
BN8BSCII |
Bangladesh National Code 8-bit BSCII |
SB, ASCII |
ZHT16BIG5 |
BIG5 16-bit Traditional Chinese |
MB, ASCII |
ZHS16CGB231280 |
CGB2312-80 16-bit Simplified Chinese |
MB, ASCII |
JA16EUC |
EUC 24-bit Japanese |
MB, ASCII |
JA16EUCYEN |
EUC 24-bit Japanese with '\' mapped to the Japanese yen character |
MB |
JA16EUCFIXED |
EUC 16-bit Japanese. A fixed-width subset of JA16EUC (contains only the 2-byte characters of JA16EUC). Contains no 7- or 8-bit ASCII characters |
FIXED |
ZHT32EUC |
EUC 32-bit Traditional Chinese |
MB, ASCII |
ZHT32EUCFIXED |
EUC 32-bit Traditional Chinese (32-bit fixed-width, no single byte) |
FIXED |
ZHS16GBK |
GBK 16-bit Simplified Chinese |
MB, ASCII, UDC |
ZHS16GBKFIXED |
GBK 16-bit Simplified Chinese (16-bit fixed-width, no single byte) |
FIXED, UDC |
ZHT16CCDC |
HP CCDC 16-bit Traditional Chinese |
MB, ASCII |
JA16DBCS |
IBM EBCDIC 16-bit Japanese |
MB, UDC |
JA16EBCDIC930 |
IBM DBCS Code Page 290 16-bit Japanese |
MB, UDC |
JA16DBCSFIXED |
IBM EBCDIC 16-bit Japanese (16-bit fixed width, no single byte) |
FIXED, UDC |
KO16DBCS |
IBM EBCDIC 16-bit Korean |
MB, UDC |
KO16DBCSFIXED |
IBM EBCDIC 16-bit Korean (16-bit fixed-width, no single byte) |
FIXED, UDC |
ZHS16DBCS |
IBM EBCDIC 16-bit Simplified Chinese |
MB, UDC |
ZHS16CGB231280 |
CGB2312-80 16-bit Simplified Chinese (16-bit fixed-width, no single byte) |
FIXED |
ZHS16DBCSFIXED |
IBM EBCDIC 16-bit Simplified Chinese (16-bit fixed-width, no single byte) |
FIXED, UDC |
ZHT16DBCS |
IBM EBCDIC 16-bit Traditional Chinese |
MB, UDC |
ZHT16DBCSFIXED |
IBM EBCDIC 16-bit Traditional Chinese (16-bit fixed-width, no single byte) |
FIXED |
KO16KSC5601 |
KSC5601 16-bit Korean |
MB, ASCII |
KO16KSCCS |
KSCCS 16-bit Korean |
MB, ASCII |
KO16KSC5601FIXED |
KSC5601 (16-bit fixed-width, no single byte) |
FIXED |
JA16VMS |
JVMS 16-bit Japanese |
MB, ASCII |
ZHS16MACCGB231280 |
Mac client CGB2312-80 16-bit Simplified Chinese |
MB |
JA16MACSJIS |
Mac client Shift-JIS 16-bit Japanese |
MB |
TH8MACTHAI |
Mac Client 8-bit Latin/Thai |
SB |
TH8MACTHAIS |
Mac Server 8-bit Latin/Thai |
SB, ASCII |
TH8TISEBCDICS |
Thai Industrial Standard 620-2533-EBCDIC Server 8-bit |
SB |
ZHT16MSWIN950 |
MS Windows Code Page 950 Traditional Chinese |
MB, ASCII, UDC |
KO16MSWIN949 |
MS Windows Code Page 949 Korean |
MB, ASCII, UDC |
VN8MSWIN1258 |
MS Windows Code Page 1258 8-bit Vietnamese |
SB, ASCII, EURO |
IN8ISCII |
Multiple-Script Indian Standard 8-bit Latin/Indian |
SB, ASCII |
JA16SJIS |
Shift-JIS 16-bit Japanese |
MB, ASCII, UDC |
JA16SJISFIXED |
Shift-JIS 16-bit Japanese. A fixed-width subset of JA16SJIS (contains only the 2-byte characters of JA16JIS). Contains no 7- or 8-bit ASCII characters |
FIXED, UDC |
JA16SJISYEN |
Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character |
MB, UDC |
ZHT32SOPS |
SOPS 32-bit Traditional Chinese |
MB, ASCII |
ZHT16DBT |
Taiwan Taxation 16-bit Traditional Chinese |
MB, ASCII |
ZHT16BIG5FIXED |
BIG5 16-bit Traditional Chinese (16-bit fixed-width, no single byte) |
FIXED |
TH8TISASCII |
Thai Industrial Standard 620-2533 - ASCII 8-bit |
SB, ASCII, EURO |
TH8TISEBCDIC |
Thai Industrial Standard 620-2533 - EBCDIC 8-bit |
SB |
ZHT32TRIS |
TRIS 32-bit Traditional Chinese |
MB, ASCII |
ZHT32TRISFIXED |
TRIS 32-bit Fixed-width Traditional Chinese |
FIXED |
AL24UTFFSS |
See "Universal Character Sets" for details |
|
UTF8 |
See "Universal Character Sets" for details |
|
UTFE |
See "Universal Character Sets" for details |
|
VN8VN3 |
VN3 8-bit Vietnamese |
SB, ASCII |
Table A-5 lists the Oracle character sets that can support European languages.
Name | Description | Comments |
---|---|---|
US7ASCII |
ASCII 7-bit American |
SB, ASCII |
SF7ASCII |
ASCII 7-bit Finnish |
SB |
YUG7ASCII |
ASCII 7-bit Yugoslavian |
SB |
RU8BESTA |
BESTA 8-bit Latin/Cyrillic |
SB, ASCII |
EL8GCOS7 |
Bull EBCDIC GCOS7 8-bit Greek |
SB |
WE8GCOS7 |
Bull EBCDIC GCOS7 8-bit West European |
SB |
EL8DEC |
DEC 8-bit Latin/Greek |
SB |
TR7DEC |
DEC VT100 7-bit Turkish |
SB |
TR8DEC |
DEC 8-bit Turkish |
SB, ASCII |
TR8EBCDIC1026 |
EBCDIC Code Page 1026 8-bit Turkish |
SB |
TR8EBCDIC1026S |
EBCDIC Code Page 1026 Server 8-bit Turkish |
SB |
TR8PC857 |
IBM-PC Code Page 857 8-bit Turkish |
SB, ASCII |
TR8MACTURKISH |
MAC Client 8-bit Turkish |
SB |
TR8MACTURKISHS |
MAC Server 8-bit Turkish |
SB, ASCII |
TR8MSWIN1254 |
MS Windows Code Page 1254 8-bit Turkish |
SB, ASCII, EURO |
WE8BS2000L5 |
Siemens EBCDIC.DF.L5 8-bit West European/Turkish |
SB |
WE8DEC |
DEC 8-bit West European |
SB, ASCII |
D7DEC |
DEC VT100 7-bit German |
SB |
F7DEC |
DEC VT100 7-bit French |
SB |
S7DEC |
DEC VT100 7-bit Swedish |
SB |
E7DEC |
DEC VT100 7-bit Spanish |
SB |
NDK7DEC |
DEC VT100 7-bit Norwegian/Danish |
SB |
I7DEC |
DEC VT100 7-bit Italian |
SB |
NL7DEC |
DEC VT100 7-bit Dutch |
SB |
CH7DEC |
DEC VT100 7-bit Swiss (German/French) |
SB |
SF7DEC |
DEC VT100 7-bit Finnish |
SB |
WE8DG |
DG 8-bit West European |
SB, ASCII |
WE8EBCDIC37C |
EBCDIC Code Page 37 8-bit Oracle/c |
SB |
WE8EBCDIC37 |
EBCDIC Code Page 37 8-bit West European |
SB |
D8EBCDIC273 |
EBCDIC Code Page 273/1 8-bit Austrian German |
SB |
DK8EBCDIC277 |
EBCDIC Code Page 277/1 8-bit Danish |
SB |
S8EBCDIC278 |
EBCDIC Code Page 278/1 8-bit Swedish |
SB |
I8EBCDIC280 |
EBCDIC Code Page 280/1 8-bit Italian |
SB |
WE8EBCDIC284 |
EBCDIC Code Page 284 8-bit Latin American/Spanish |
SB |
WE8EBCDIC285 |
EBCDIC Code Page 285 8-bit West European |
SB |
WE8EBCDIC1047 |
EBCDIC Code Page 1047 8-bit West European |
SB |
WE8EBCDIC1140 |
EBCDIC Code Page 1140 8-bit West European |
SB, EURO |
WE8EBCDIC1140C |
EBCDIC Code Page 1140 Client 8-bit West European |
SB, EURO |
WE8EBCDIC1145 |
EBCDIC Code Page 1145 8-bit West European |
SB, EURO |
WE8EBCDIC1146 |
EBCDIC Code Page 1146 8-bit West European |
SB, EURO |
WE8EBCDIC1148 |
EBCDIC Code Page 1148 8-bit West European |
SB, EURO |
WE8EBCDIC1148C |
EBCDIC Code Page 1148 Client 8-bit West European |
SB, EURO |
F8EBCDIC297 |
EBCDIC Code Page 297 8-bit French |
SB |
WE8EBCDIC500C |
EBCDIC Code Page 500 8-bit Oracle/c |
SB |
WE8EBCDIC500 |
EBCDIC Code Page 500 8-bit West European |
SB |
EE8EBCDIC870 |
EBCDIC Code Page 870 8-bit East European |
SB |
EE8EBCDIC870C |
EBCDIC Code Page 870 Client 8-bit East European |
SB |
EE8EBCDIC870S |
EBCDIC Code Page 870 Server 8-bit East European |
SB |
WE8EBCDIC871 |
EBCDIC Code Page 871 8-bit Icelandic |
SB |
EL8EBCDIC875 |
EBCDIC Code Page 875 8-bit Greek |
SB |
EL8EBCDIC875S |
EBCDIC Code Page 875 Server 8-bit Greek |
SB |
CL8EBCDIC1025 |
EBCDIC Code Page 1025 8-bit Cyrillic |
SB |
CL8EBCDIC1025C |
EBCDIC Code Page 1025 Client 8-bit Cyrillic |
SB |
CL8EBCDIC1025S |
EBCDIC Code Page 1025 Server 8-bit Cyrillic |
SB |
CL8EBCDIC1025X |
EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic |
SB |
BLT8EBCDIC1112 |
EBCDIC Code Page 1112 8-bit Baltic Multilingual |
SB |
BLT8EBCDIC1112S |
EBCDIC Code Page 1112 8-bit Server Baltic Multilingual |
SB |
D8EBCDIC1141 |
EBCDIC Code Page 1141 8-bit Austrian German |
SB, EURO |
DK8EBCDIC1142 |
EBCDIC Code Page 1142 8-bit Danish |
SB, EURO |
S8EBCDIC1143 |
EBCDIC Code Page 1143 8-bit Swedish |
SB, EURO |
I8EBCDIC1144 |
EBCDIC Code Page 1144 8-bit Italian |
SB, EURO |
F8EBCDIC1147 |
EBCDIC Code Page 1147 8-bit French |
SB, EURO |
EEC8EUROASCI |
EEC Targon 35 ASCI West European/Greek |
SB |
EEC8EUROPA3 |
EEC EUROPA3 8-bit West European/Greek |
SB |
LA8PASSPORT |
German Government Printer 8-bit All-European Latin |
SB, ASCII |
WE8HP |
HP LaserJet 8-bit West European |
SB |
WE8ROMAN8 |
HP Roman8 8-bit West European |
SB, ASCII |
HU8CWI2 |
Hungarian 8-bit CWI-2 |
SB, ASCII |
HU8ABMOD |
Hungarian 8-bit Special AB Mod |
SB, ASCII |
LV8RST104090 |
IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic) |
SB, ASCII |
US8PC437 |
IBM-PC Code Page 437 8-bit American |
SB, ASCII |
BG8PC437S |
IBM-PC Code Page 437 8-bit (Bulgarian Modification) |
SB, ASCII |
EL8PC437S |
IBM-PC Code Page 437 8-bit (Greek modification) |
SB, ASCII |
EL8PC737 |
IBM-PC Code Page 737 8-bit Greek/Latin |
SB |
LT8PC772 |
IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic) |
SB, ASCII |
LT8PC774 |
IBM-PC Code Page 774 8-bit Lithuanian (Latin) |
SB, ASCII |
BLT8PC775 |
IBM-PC Code Page 775 8-bit Baltic |
SB, ASCII |
WE8PC850 |
IBM-PC Code Page 850 8-bit West European |
SB, ASCII |
EL8PC851 |
IBM-PC Code Page 851 8-bit Greek/Latin |
SB, ASCII |
EE8PC852 |
IBM-PC Code Page 852 8-bit East European |
SB, ASCII |
RU8PC855 |
IBM-PC Code Page 855 8-bit Latin/Cyrillic |
SB, ASCII |
WE8PC858 |
IBM-PC Code Page 858 8-bit West European |
SB, ASCII, EURO |
WE8PC860 |
IBM-PC Code Page 860 8-bit West European |
SB. ASCII |
IS8PC861 |
IBM-PC Code Page 861 8-bit Icelandic |
SB, ASCII |
CDN8PC863 |
IBM-PC Code Page 863 8-bit Canadian French |
SB, ASCII |
N8PC865 |
IBM-PC Code Page 865 8-bit Norwegian |
SB. ASCII |
RU8PC866 |
IBM-PC Code Page 866 8-bit Latin/Cyrillic |
SB, ASCII |
EL8PC869 |
IBM-PC Code Page 869 8-bit Greek/Latin |
SB, ASCII |
LV8PC1117 |
IBM-PC Code Page 1117 8-bit Latvian |
SB, ASCII |
US8ICL |
ICL EBCDIC 8-bit American |
SB |
WE8ICL |
ICL EBCDIC 8-bit West European |
SB |
WE8ISOICLUK |
ICL special version ISO8859-1 |
SB |
WE8ISO8859P1 |
ISO 8859-1 West European |
SB, ASCII |
EE8ISO8859P2 |
ISO 8859-2 East European |
SB, ASCII |
SE8ISO8859P3 |
ISO 8859-3 South European |
SB, ASCII |
NEE8ISO8859P4 |
ISO 8859-4 North and North-East European |
SB, ASCII |
CL8ISO8859P5 |
ISO 8859-5 Latin/Cyrillic |
SB, ASCII |
AR8ISO8859P6 |
ISO 8859-6 Latin/Arabic |
SB, ASCII |
EL8ISO8859P7 |
ISO 8859-7 Latin/Greek |
SB, ASCII, EURO |
IW8ISO8859P8 |
ISO 8859-8 Latin/Hebrew |
SB, ASCII |
NE8ISO8859P10 |
ISO 8859-10 North European |
SB, ASCII |
WE8ISO8859P15 |
ISO 8859-15 West European |
SB, ASCII, EURO |
LA8ISO6937 |
ISO 6937 8-bit Coded Character Set for Text Communication |
SB, ASCII |
IW7IS960 |
Israeli Standard 960 7-bit Latin/Hebrew |
SB |
AR8ARABICMAC |
Mac Client 8-bit Latin/Arabic |
SB |
EE8MACCE |
Mac Client 8-bit Central European |
SB |
EE8MACCROATIAN |
Mac Client 8-bit Croatian |
SB |
WE8MACROMAN8 |
Mac Client 8-bit Extended Roman8 West European |
SB |
EL8MACGREEK |
Mac Client 8-bit Greek |
SB |
IS8MACICELANDIC |
Mac Client 8-bit Icelandic |
SB |
CL8MACCYRILLIC |
Mac Client 8-bit Latin/Cyrillic |
SB |
AR8ARABICMACS |
Mac Server 8-bit Latin/Arabic |
SB, ASCII |
EE8MACCES |
Mac Server 8-bit Central European |
SB, ASCII |
EE8MACCROATIANS |
Mac Server 8-bit Croatian |
SB, ASCII |
WE8MACROMAN8S |
Mac Server 8-bit Extended Roman8 West European |
SB, ASCII |
CL8MACCYRILLICS |
Mac Server 8-bit Latin/Cyrillic |
SB, ASCII |
EL8MACGREEKS |
Mac Server 8-bit Greek |
SB, ASCII |
IS8MACICELANDICS |
Mac Server 8-bit Icelandic |
SB |
BG8MSWIN |
MS Windows 8-bit Bulgarian Cyrillic |
SB, ASCII |
LT8MSWIN921 |
MS Windows Code Page 921 8-bit Lithuanian |
SB, ASCII |
ET8MSWIN923 |
MS Windows Code Page 923 8-bit Estonian |
SB, ASCII |
EE8MSWIN1250 |
MS Windows Code Page 1250 8-bit East European |
SB, ASCII, EURO |
CL8MSWIN1251 |
MS Windows Code Page 1251 8-bit Latin/Cyrillic |
SB, ASCII, EURO |
WE8MSWIN1252 |
MS Windows Code Page 1252 8-bit West European |
SB, ASCII, EURO |
EL8MSWIN1253 |
MS Windows Code Page 1253 8-bit Latin/Greek |
SB, ASCII, EURO |
BLT8MSWIN1257 |
MS Windows Code Page 1257 8-bit Baltic |
SB, ASCII, EURO |
BLT8CP921 |
Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic |
SB, ASCII |
LV8PC8LR |
Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic |
SB, ASCII |
WE8NCR4970 |
NCR 4970 8-bit West European |
SB, ASCII |
WE8NEXTSTEP |
NeXTSTEP PostScript 8-bit West European |
SB, ASCII |
CL8KOI8R |
RELCOM Internet Standard 8-bit Latin/Cyrillic |
SB, ASCII |
US8BS2000 |
Siemens 9750-62 EBCDIC 8-bit American |
SB |
DK8BS2000 |
Siemens 9750-62 EBCDIC 8-bit Danish |
SB |
F8BS2000 |
Siemens 9750-62 EBCDIC 8-bit French |
SB |
D8BS2000 |
Siemens 9750-62 EBCDIC 8-bit German |
SB |
E8BS2000 |
Siemens 9750-62 EBCDIC 8-bit Spanish |
SB |
S8BS2000 |
Siemens 9750-62 EBCDIC 8-bit Swedish |
SB |
DK7SIEMENS9780X |
Siemens 97801/97808 7-bit Danish |
SB |
F7SIEMENS9780X |
Siemens 97801/97808 7-bit French |
SB |
D7SIEMENS9780X |
Siemens 97801/97808 7-bit German |
SB |
I7SIEMENS9780X |
Siemens 97801/97808 7-bit Italian |
SB |
N7SIEMENS9780X |
Siemens 97801/97808 7-bit Norwegian |
SB |
E7SIEMENS9780X |
Siemens 97801/97808 7-bit Spanish |
SB |
S7SIEMENS9780X |
Siemens 97801/97808 7-bit Swedish |
SB |
WE8BS2000 |
Siemens EBCDIC.DF.04 8-bit West European |
SB |
CL8BS2000 |
Siemens EBCDIC.EHC.LC 8-bit Cyrillic |
SB |
AL24UTFFSS |
See "Universal Character Sets" for details |
|
UTF8 |
See "Universal Character Sets" for details |
|
UTFE |
See "Universal Character Sets" for details |
|
Table A-6 lists the Oracle character sets that can support Middle Eastern languages.
Name | Description | Comments |
---|---|---|
AR8APTEC715 |
APTEC 715 Server 8-bit Latin/Arabic |
SB, ASCII |
AR8APTEC715T |
APTEC 715 8-bit Latin/Arabic |
SB |
AR8ASMO708PLUS |
ASMO 708 Plus 8-bit Latin/Arabic |
SB, ASCII |
AR8ASMO8X |
ASMO Extended 708 8-bit Latin/Arabic |
SB, ASCII |
AR8ADOS710 |
Arabic MS-DOS 710 Server 8-bit Latin/Arabic |
SB, ASCII |
AR8ADOS710T |
Arabic MS-DOS 710 8-bit Latin/Arabic |
SB |
AR8ADOS720 |
Arabic MS-DOS 720 Server 8-bit Latin/Arabic |
SB, ASCII |
AR8ADOS720T |
Arabic MS-DOS 720 8-bit Latin/Arabic |
SB |
TR7DEC |
DEC VT100 7-bit Turkish |
SB |
TR8DEC |
DEC 8-bit Turkish |
SB |
WE8EBCDIC37C |
EBCDIC Code Page 37 8-bit Oracle/c |
SB |
IW8EBCDIC424 |
EBCDIC Code Page 424 8-bit Latin/Hebrew |
SB |
IW8EBCDIC424S |
EBCDIC Code Page 424 Server 8-bit Latin/Hebrew |
SB |
WE8EBCDIC500C |
EBCDIC Code Page 500 8-bit Oracle/c |
SB |
IW8EBCDIC1086 |
EBCDIC Code Page 1086 8-bit Hebrew |
SB |
AR8EBCDIC420S |
EBCDIC Code Page 420 Server 8-bit Latin/Arabic |
SB |
AR8EBCDICX |
EBCDIC XBASIC Server 8-bit Latin/Arabic |
SB |
TR8EBCDIC1026 |
EBCDIC Code Page 1026 8-bit Turkish |
SB |
TR8EBCDIC1026S |
EBCDIC Code Page 1026 Server 8-bit Turkish |
SB |
AR8HPARABIC8T |
HP 8-bit Latin/Arabic |
SB |
TR8PC857 |
IBM-PC Code Page 857 8-bit Turkish |
SB, ASCII |
IW8PC1507 |
IBM-PC Code Page 1507/862 8-bit Latin/Hebrew |
SB, ASCII |
AR8ISO8859P6 |
ISO 8859-6 Latin/Arabic |
SB, ASCII |
IW8ISO8859P8 |
ISO 8859-8 Latin/Hebrew |
SB, ASCII |
WE8ISO8859P9 |
ISO 8859-9 West European & Turkish |
SB, ASCII |
LA8ISO6937 |
ISO 6937 8-bit Coded Character Set for Text Communication |
SB, ASCII |
IW7IS960 |
Israeli Standard 960 7-bit Latin/Hebrew |
SB |
IW8MACHEBREW |
Mac Client 8-bit Hebrew |
SB |
AR8ARABICMAC |
Mac Client 8-bit Latin/Arabic |
SB |
AR8ARABICMACT |
Mac 8-bit Latin/Arabic |
SB |
TR8MACTURKISH |
Mac Client 8-bit Turkish |
SB |
IW8MACHEBREWS |
Mac Server 8-bit Hebrew |
SB, ASCII |
AR8ARABICMACS |
Mac Server 8-bit Latin/Arabic |
SB, ASCII |
TR8MACTURKISHS |
Mac Server 8-bit Turkish |
SB, ASCII |
TR8MSWIN1254 |
MS Windows Code Page 1254 8-bit Turkish |
SB, ASCII, EURO |
IW8MSWIN1255 |
MS Windows Code Page 1255 8-bit Latin/Hebrew |
SB, ASCII, EURO |
AR8MSWIN1256 |
MS Windows Code Page 1256 8-Bit Latin/Arabic |
SB. ASCII, EURO |
IN8ISCII |
Multiple-Script Indian Standard 8-bit Latin/Indian |
SB |
AR8MUSSAD768 |
Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic |
SB, ASCII |
AR8MUSSAD768T |
Mussa'd Alarabi/2 768 8-bit Latin/Arabic |
SB |
AR8NAFITHA711 |
Nafitha Enhanced 711 Server 8-bit Latin/Arabic |
SB, ASCII |
AR8NAFITHA711T |
Nafitha Enhanced 711 8-bit Latin/Arabic |
SB |
AR8NAFITHA721 |
Nafitha International 721 Server 8-bit Latin/Arabic |
SB, ASCII |
AR8NAFITHA721T |
Nafitha International 721 8-bit Latin/Arabic |
SB |
AR8SAKHR706 |
SAKHR 706 Server 8-bit Latin/Arabic |
SB, ASCII |
AR8SAKHR707 |
SAKHR 707 Server 8-bit Latin/Arabic |
SB, ASCII |
AR8SAKHR707T |
SAKHR 707 8-bit Latin/Arabic |
SB |
AR8XBASIC |
XBASIC 8-bit Latin/Arabic |
SB |
WE8BS2000L5 |
Siemens EBCDIC.DF.04.L5 8-bit West European/Turkish |
SB |
AL24UTFFSS |
See "Universal Character Sets" for details |
|
UTF8 |
See "Universal Character Sets" for details |
|
UTFE |
See "Universal Character Sets" for details |
|
Table A-7 lists the Oracle character sets that provide universal language support, that is, they attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.
Note: The Unicode 1.1 character set has been superseded by Unicode 2.1. One of the major differences between version 1.1 and 2.1 is the redefinition and addition of 11,172 Korean characters. Whenever possible, you should use the latest version of the Unicode standard. The primary scripts currently supported by Unicode 2.1 are:
Arabic |
Gujarati |
Latin |
Armenian |
Gurmukhi |
Lao |
Bengali |
Han |
Malayalam |
Bopomofo |
Hangul |
Oriya |
Cyrillic |
Hebrew |
Tamil |
Devanagari |
Hiragana |
Telugu |
Georgian |
Kannada |
Thai |
Greek |
Katakana |
Tibetan |
For details on the Unicode standard, see http://www.unicode.org or refer to the Unicode Standard, defined by the Unicode consortium.
Oracle's UTF8 character set currently supports the following characters.
These are 1-byte characters in UTF8, that have character codes 0x00 through 0x7f inclusive. These can represent only English ASCII characters. All English ASCII characters have exactly the same character codes (0x00 through 0x7f inclusive) in US7ASCII and UTF8 character sets.
These are 2-byte characters in UTF8, that have character codes 0xc0WW through 0xdfWW inclusive where WW can be 0x80 through 0xbf inclusive.
These can represent characters of most European (including Greek and Russian), Arabic, Hebrew and some other languages.
These are 3-byte characters in UTF8, that have character codes
0xe0WWTT through 0xecWWTT inclusive
0xed80TT through 0xed9fTT inclusive
0xeeWWTT through 0xefWWTT inclusive
where WW and TT are 0x80 through 0xbf inclusive.
These can represent characters of Chinese, Japanese, Korean, Thai, Indic, Dravidian and some other languages. Also, the "euro" currency sign is included in this group of characters.
Oracle's UTF8 character set currently does not support the following characters. If you use these characters in Oracle's current UTF8 character set, the result is not guaranteed, and the behavior changes in the future releases of Oracle.
These are called surrogates in Unicode 2.1 (UTF16). These are 4-byte characters in UTF8 (when implemented in the future). Since Unicode 2.1 didn't assign any character using surrogates yet, all assigned characters in Unicode 2.1 can be represented in Oracle's current UTF8 character set. Currently, the only advantage of UTF16 (which Oracle's current UTF8 character set doesn't have) is that surrogates can represent 131,072 extra User Defined Characters on top of 6,400 User-Defined Characters that are available in Oracle's current UTF8 character set.
Therefore, unless you need more than 6,400 User-Defined Characters, Oracle's current UTF8 character set can represent all characters of Unicode 2.1.
Linguistic definitions define linguistic cases for particular languages. Extended linguistic definitions include some special linguistic cases for the language. Typically, using the extended definition means that characters will be sorted differently from their ASCII values. For example, ch and ll are treated as only one character in XSPANISH. Table A-8 lists the linguistic definitions supported by the Oracle server.
By default, most territory definitions use the Gregorian calendar system. Table A-9 lists the other calendar systems supported by the Oracle server.
Figure A-1 shows how March 20, 1998 appears in ROC Official:
Figure A-2 shows how March 27, 1998 appears in Japanese Imperial:
Table A-10 lists the character sets that support the Euro symbol.
Table A-11 lists the default values for NLS parameters.
|
![]() Copyright © 1996-2000, Oracle Corporation. All Rights Reserved. |
|