*nix Documentation Project
·  Home
 +   man pages
·  Linux HOWTOs
·  FreeBSD Tips
·  *niX Forums

  man pages->Tru64 Unix man pages -> JIS7 (5)              
Title
Content
Arch
Section
 

jiskanji(5)

Contents


NAME    [Toc]    [Back]

       jiskanji,  jiskanji7,  JIS7  - A character encoding system
       (codeset) for Japanese

DESCRIPTION    [Toc]    [Back]

       JIS Kanji is a codeset that  uses  the  JIS  X0202  symbol
       extension  method for encoding the JIS X0208 and JIS X0201
       character sets. There are two types of JIS Kanji encoding:
       7-bit JIS Kanji code and 8-bit JIS Kanji code.

   7-bit JIS Kanji Code
       In  7-bit  JIS  Kanji  encoding,  all character values are
       7-bit bytes. Characters are interpreted according to  preceding
  in and out sequences as follows: Kanji in sequence
       (ESC $ B)

              The code values following  the  Kanji  in  sequence
              (ESC  $  B)  are  treated  as characters in the JIS
              X0208 Kanji character set.  Kanji out sequence (ESC
              ( B)

              The  code  values  following the Kanji out sequence
              (ESC ( B) are treated as ASCII characters.  Supplementary
 Kanji in sequence (ESC $ ( D)

              The  code  values following the supplementary Kanji
              in sequence (ESC $ ( D) are treated  as  characters
              in the JIS X0212 supplementary Kanji character set.
              User-Defined Character (UDC) in sequence (ESC  $  (
              0)

              The  code values following the UDC in sequence (ESC
              $ ( 0) are treated as  characters  in  the  vendordefined
  or  user-defined  character  set.  Kana in
              (SO) and Kana out (SI) sequences

              The code values following  SO(0x0e)  and  preceding
              SI(0x0f) are treated as characters in the JIS X0201
              Katakana character set.  Katakana in sequence  (ESC
              ( I)

              Code values following the Katakana in sequence (ESC
              ( I) are treated as characters  in  the  JIS  X0201
              Katakana character set. In this case, the Kanji out
              sequence is used to switch back to ASCII code.

              The Katakana in and  Kanji  out  sequences  are  an
              alternative  to using the Kana in and out sequences
              (SO/SI).

   8-bit JIS Kanji Code
       In 8-bit JIS Kanji encoding, the JIS X0201 Katakana  characters
  are represented as 8-bit bytes. Using this form of
       encoding, in and out sequences have the following  effect:
       Kanji in sequence (ESC $ B)

              Code  values following the Kanji in sequence (ESC $
              B) are treated as characters in the JIS X0208 Kanji
              character  set.   Supplementary  Kanji  in sequence
              (ESC $ ( D)

              Code values following the  supplementary  Kanji  in
              sequence  (ESC  $ ( D) are treated as characters in
              the JIS X0212 supplementary  Kanji  character  set.
              User-Defined  Character  (UDC) in sequence (ESC $ (
              0)

              Code values following the UDC in sequence (ESC $  (
              0)  are  treated  as vendor-defined or user-defined
              characters.  Kanji out sequence (ESC ( B) Code values
  following the Kanji out sequence (ESC ( B) are
              treated as  ASCII  characters.   Kana  in  and  out
              sequences (SI/SO)

              These sequences are ignored.

   Codeset Conversion    [Toc]    [Back]
       The  following  codeset  converter pairs are available for
       converting Japanese characters between jiskanji7  or  JIS7
       and other encoding formats.  The RESTRICTIONS section discusses
 some conversion limitations  that  apply  to  these
       converters.

       Refer  to  iconv_intro(5)  for  an introduction to codeset
       conversion. For more information about the  other  codeset
       for  which  jiskanji7  or JIS7 is the input or output, see
       the reference page specified  in  the  list  item.   deckanji_jiskanji7
  or  deckanji_JIS7,  jiskanji7_deckanji  or
       JIS7_deckanji

              Converting from and to the DEC Kanji codeset: deckanji(5).      eucJP_jiskanji7     or    eucJP_JIS7,
              jiskanji7_eucJP or JIS7_eucJP

              Converting from and to Japanese Extended UNIX Code:
              eucJP(5).      eucTW_jiskanji7    or    eucTW_JIS7,
              jiskanji7_eucTW or JIS7_eucTW

              Converting from  and  to  Taiwanese  Extended  UNIX
              Code:   eucTW(5).   sdeckanji_jiskanji7  or  sdeckanji_JIS7,
 jiskanji7_sdeckanji or JIS7_sdeckanji

              Converting from and to the Super DEC Kanji codeset:
              sdeckanji(5).     SJIS_jiskanji7    or   SJIS_JIS7,
              jiskanji7_SJIS or JIS7_SJIS

              Converting from and to Shift JIS format: SJIS(5).

              Shift JIS encoding format is identical to  encoding
              in  Microsoft code-pages used on PC systems. Therefore,
 you  can  use  these  converters  to  convert
              Japanese  characters between JIS Kanji and PC codepage
 format. For general  information  on  how  the
              operating   system  supports  PC  code  pages,  see
              code_page(5).

RESTRICTIONS    [Toc]    [Back]

       The JIS Kanji codeset  is  not  supported  directly  by  a
       locale  but  through  code  conversion  (through the iconv
       utility, Japanese terminal (tty) code conversion,  and  so
       forth).

       In  the codeset naming conventions used by the iconv utility,
 the string JIS7 indicates 7-bit JIS Kanji  code  that
       follows  a  Katakana  in sequence and the string jiskanji7
       indicates 7-bit JIS Kanji code entered between Kana in and
       out  sequences.   The  following  sequences  are valid for
       input to the iconv utility but are not generated when code
       is  converted  to  jiskanji7:  Kanji in (ESC $ @) Kanji in
       (ESC & @ ESC $ B) Kanji in (ESC $ ( B) Kanji in (ESC  $  (
       @) Supplementary Kanji in (ESC $ D) Kana in (ESC ( J) Kana
       in (ESC ( H)

       In the code naming conventions of the  Japanese  terminal,
       the  string  jis7  indicates  7-bit JIS Kanji code and the
       string jis8 indicates 8-bit JIS Kanji code. When the  terminal
  code  is set to jis7, the Kana in and out sequences
       (SI/SO) are used for JIS X0201 Katakana  character  representation.

SEE ALSO    [Toc]    [Back]

      
      
       Commands: locale(1)

       Others:  ascii(5),  code_page(5),  deckanji(5),  eucJP(5),
       i18n_intro(5),      i18n_printing(5),      iconv_intro(5),
       iso2022jp(5),  Japanese(5),  l10n_intro(5),  sdeckanji(5),
       shiftjis(5)



                                                      jiskanji(5)
[ Back ]
 Similar pages
Name OS Title
ISO8859-1 Tru64 A character encoding system (codeset)
iso8859-1 Tru64 A character encoding system (codeset)
iso8859-9 Tru64 A character encoding system (codeset) for Turkish
iso8859-8 Tru64 A character encoding system (codeset) for Hebrew
iso8859-5 Tru64 A character encoding system (codeset) for Russian
ISO8859-8 Tru64 A character encoding system (codeset) for Hebrew
deckorean Tru64 A character encoding system (codeset) for Korean
iso8859-7 Tru64 A character encoding system (codeset) for Greek
tactis Tru64 A character encoding system (codeset) for Thai.
ISO8859-9 Tru64 A character encoding system (codeset) for Turkish
Copyright © 2004-2005 DeniX Solutions SRL
newsletter delivery service