*nix Documentation Project
·  Home
 +   man pages
·  Linux HOWTOs
·  FreeBSD Tips
·  *niX Forums

  man pages->Tru64 Unix man pages -> SJIS (5)              
Title
Content
Arch
Section
 

shiftjis(5)

Contents


NAME    [Toc]    [Back]

       shiftjis, SJIS - A character encoding system (codeset) for
       Japanese

DESCRIPTION    [Toc]    [Back]

       The Shift JIS (SJIS) codeset  consists  of  the  following
       character  sets:  JIS X0201 JIS X0208 User-Defined Characters
 (UDC)

   Shift JIS Encoding    [Toc]    [Back]
       Shift JIS character codes use a combination of single-byte
       data  and  2-byte  data to represent characters defined in
       the JIS X0201 and JIS X0208 standards and in the UDC area.

       All  JIS  X0201  characters are represented in the form of
       single-byte data. The  Roman  letters  in  JIS  X0201  are
       encoded  by setting the most significant bit (MSB) of each
       byte to off, while the Katakana characters are encoded  by
       setting the most significant bit (MSB) of each byte to on.
       For more information on JIS  X0201  characters,  refer  to
       deckanji(5).  In  the  Super  DEC  Kanji codeset, the code
       ranges for JIS X0201 characters are as follows: For  Roman
       letters, 00 to 7F For Katakana characters, A1 to DF

       JIS  X0208  characters  are encoded in 2-byte values.  The
       values for the first bytes are encoded so that  they  fall
       outside  the range of byte values for JIS X0201 characters
       (in other words, the JIS X0208 first byte ranges are  from
       81  to  9F  and from E0 to FC). In this manner, characters
       from the two different standards can be supported  by  the
       same codeset. The range for the second byte of a JIS X0208
       character is 40 to FC (except for 7F).  For more  information
 on JIS X0208 characters, refer to deckanji(5).

       The  Shift  JIS  codeset provides for 2444 UDC characters.
       These are encoded as 2-byte values  whose  code  range  is
       F040 to FCFC.

   Font Support for Super DEC Kanji    [Toc]    [Back]
       For  display  devices, the operating system supports Super
       DEC Kanji encoding by conversion to DEC Kanji encoding and
       then  using  fonts  available for DEC Kanji.  Refer to the
       iconv_intro(5) reference page for information  on  codeset
       conversion.

       For  printers, the operating system supports only printerresident
 fonts; therefore, Super DEC Kanji fonts cannot be
       dynamically  loaded to a printer.  For general information
       on printing non-English text, refer to i18n_printing(5).

SEE ALSO    [Toc]    [Back]

      
      
       Commands: locale(1)

       Others: ascii(5),  deckanji(5),  eucJP(5),  i18n_intro(5),
       i18n_printing(5),       iconv_intro(5),      iso2022jp(5),
       Japanese(5), jiskanji(5), l10n_intro(5), sdeckanji(5)



                                                      shiftjis(5)
[ Back ]
 Similar pages
Name OS Title
ISO8859-1 Tru64 A character encoding system (codeset)
iso8859-1 Tru64 A character encoding system (codeset)
iso8859-9 Tru64 A character encoding system (codeset) for Turkish
iso8859-8 Tru64 A character encoding system (codeset) for Hebrew
iso8859-5 Tru64 A character encoding system (codeset) for Russian
ISO8859-8 Tru64 A character encoding system (codeset) for Hebrew
deckorean Tru64 A character encoding system (codeset) for Korean
iso8859-7 Tru64 A character encoding system (codeset) for Greek
tactis Tru64 A character encoding system (codeset) for Thai.
ISO8859-9 Tru64 A character encoding system (codeset) for Turkish
Copyright © 2004-2005 DeniX Solutions SRL
newsletter delivery service