*nix Documentation Project
·  Home
 +   man pages
·  Linux HOWTOs
·  FreeBSD Tips
·  *niX Forums

  man pages->Tru64 Unix man pages -> deckanji (5)              
Title
Content
Arch
Section
 

deckanji(5)

Contents


NAME    [Toc]    [Back]

       deckanji  -  A  character  encoding  system  (codeset) for
       Japanese

DESCRIPTION    [Toc]    [Back]

       The DEC Kanji codeset consists of  the  following  characters:
 ASCII or JIS X0201 Roman letters Katakana characters
       specified by JIS X0201 JIS X0208  characters  User-defined
       characters (UDC)

       DEC  Kanji  uses  a  combination  of  single-byte data and
       2-byte data to represent these characters.

       JIS X0201 is a single-byte character set and  consists  of
       Roman  letters and Katakana characters, which are Japanese
       phonetic symbols. The Roman letters  defined  in  the  JIS
       X0201-1976  standard  are  the same as ASCII letters. (For
       information on the ASCII  character  set,  see  ascii(5).)
       However,  JIS X0201 replaces the ASCII tilde (~) character
       with a horizontal bar (located at  the  upper  part  of  a
       character  cell),  and replaces the backslash (\) with the
       Japanese currency sign (Yen).

       The JIS XO208 standard specifies 2-byte  character  values
       that  represent  a  variety of characters, including ideographic
 symbols.

   DEC Kanji Encoding    [Toc]    [Back]
       All ASCII characters are represented by single-byte  7-bit
       values  in  DEC  Kanji.  That is, the most significant bit
       (MSB) is always set off in the  byte  that  represents  an
       ASCII  character. The Roman letters and the Katakana characters
 specified by JIS X0201 are also single-byte values,
       in which the most significant bit (MSB) is set off and on,
       respectively. Some applications and vendors  assume  Roman
       characters for the lower 7-bit values, while others assume
       ASCII. However, for Unicode conversion on Tru64 UNIX,  the
       Japanese_UCS  converters  (deckanji,  sdeckanji, SJIS, and
       eucJP) consider the lower 7-bit values to be ASCII.

       The code table for JIS X0208 characters is divided into 94
       rows, numbered from 1 to 94. Each row has 94 columns, also
       numbered from 1 to 94. JIS X0208 defines a total  of  6877
       characters,  which include the following: Special symbols,
       in rows 1 and 2 Numerals and Roman letters, in row 3 Hiragana
  characters,  in  row 4 Katakana characters, in row 5
       Greek letters, in row 6 Russian letters, in row 7  Symbols
       for  drawing  graphs, diagrams, and lines, in row 8 Firstlevel
 Kanji characters, in  rows  16  to  47  Second-level
       Kanji characters, in rows 48 to 84

       To  comply  with  the  JIS  X0208 standard, each JIS X0208
       character is a 2-byte value in the DEC Kanji codeset.  The
       MSB of both the first and second bytes is always set on to
       distinguish JIS X0208 characters from ASCII/JIS  Roman  or
       user-defined characters.

       For each JIS X0208 character, the first byte of the 2-byte
       value determines the row number and the second  determines
       the column number in the JIS X0208 code table. The following
 formula shows the code value for a JIS X0208 character
       in relation to its row and column numbers:

       1st byte = A0 + Row number
       2nd byte = A0 + Column number

       For  example,  if  a  character is positioned at the first
       column of the 36th row, its code value is C4A1,  which  is
       calculated as follows:

       1st byte = A0 (hex) + 36 = C4 (hex)
       2nd byte = A0 (hex) + 01 = A1 (hex)

       For  user-defined  character  (UDC) definitions, DEC Kanji
       provides an area of 2914 positions (from row 1 to row 31).
       Each UDC is represented by a 2-byte value, just like a JIS
       X0208 character value. However, the MSB of the second byte
       of  a  UDC  is  set off to distinguish it from a JIS X0208
       character.  The code range of the  UDC  area  is  A121  to
       BF7E.

       The  following  formula  calculates  the  code of a UDC in
       relation to its row and column numbers:

       1st byte = A0 + Row number
       2nd byte = 20 + Column number

       For example, if a UDC is positioned at the first column of
       the  16th row, its code value is B021, which is calculated
       as follows:

       1st byte = A0 (hex) + 16 = B0 (hex)
       2nd byte = 20 (hex) + 01 = 21 (hex)


   Codeset Conversion    [Toc]    [Back]
       The following codeset converter pairs  are  available  for
       converting  Japanese characters between deckanji and other
       encoding formats.  See iconv_intro(5) for an  introduction
       to  codeset  conversion.  For  more  information about the
       other codeset for which deckanji is the input  or  output,
       see  the  reference  page  specified  in  the  list  item.
       eucJP_deckanji, deckanji_eucJP

              Converting from and to Japanese Extended UNIX Code:
              eucJP(5).         iso-2022-jp_deckanji,       deckanji_iso-2022-jp


              Converting from and to  the  ISO  2022-JP  codeset:
              iso2022jp(5).     iso-2022-jpext_deckanji,    deckanji_iso-2022-jpext


              Converting from and to the ISO 2022-JPexp  codeset:
              iso2022jp(5).  JIS7_deckanji, deckanji_JIS7

              Converting   from   and   to   the   JIS7  codeset:
              jiskanji(5).   sdeckanji_deckanji,  deckanji_sdeckanji


              Converting from and to the Super DEC Kanji codeset:
              sdeckanji(5).  SJIS_deckanji, deckanji_SJIS

              Converting from  and  to  the  Shift  JIS  codeset:
              shiftjis(5).

              Shift  JIS  encoding is equivalent to the Microsoft
              code-page format used on PCs for  Japanese.  Therefore,
  you can use these converters to convert data
              between DEC Kanji  and  PC  code-page  format.  See
              code_page(5)  for  information  on  PC  code pages.
              UTF-16_deckanji, deckanji_UTF-16

              Converting from and to UTF-16  format:  Unicode(5).
              UCS-4_deckanji, deckanji_UCS-4

              Converting  from  and  to UCS-4 format: Unicode(5).
              UTF-8_deckanji, deckanji_UTF-8

              Converting from and to UTF-8 format: Unicode(5).

   Japanese Fonts    [Toc]    [Back]
       The  operating  system  provides  the  following  Japanese
       bitmap  fonts in various sizes and typefaces for 75dpi and
       100dpi  (dot-per-inch)  display  devices:  JIS  X0201-1976
       characters (Gothic family):

              -jdecw-gothic-medium-r-normal--8-80-75-75-m-40-jisx0201.1976-0
 -jdecw-gothicmedium-r-normal--14-140-75-75-m-70-jisx0201.1976-0

              -jdecw-gothic-medium-r-normal--12-120-75-75-m-60-jisx0201.1976-0
      -jdecwgothic-medium-r-nor-

              mal--24-240-75-75-m-120-jisx0201.1976-0     -jdecwgothic-medium-r-nor-

              mal--10-100-75-75-m-50-jisx0201.1976-0      -jdecwgothic-medium-r-nor-

              mal--18-180-75-75-m-90-jisx0201.1976-0      -jdecwgothic-medium-r-nor-

              mal--17-120-100-100-m-85-jisx0201.1976-0    -jdecwgothic-medium-r-nor-

              mal--34-240-100-100-m-170-jisx0201.1976-0   -jdecwgothic-medium-r-nor-

              mal--14-100-100-100-m-70-jisx0201.1976-0    -jdecwgothic-medium-r-nor-

              mal--25-180-100-100-m-125-jisx0201.1976-0   -jdecwgothic-medium-r-nor-

              mal--20-140-100-100-m-100-jisx0201.1976-0   -jdecwgothic-medium-r-nor-

              mal--11-80-100-100-m-55-jisx0201.1976-0         JIS
              X0201-1976 characters (Kmenu family)

              -jdecw-kmenu-medium-r-normal--12-120-75-75-p-70-jisx0201.1976-0
      -jdecwkmenu-medium-r-nor-

              mal--17-120-100-100-p-85-jisx0201.1976-0        JIS
              X0201-1976 characters (Mincho family)

              -jdecw-mincho-medium-r-normal--8-80-75-75-m-40-jisx0201.1976-0
 -jdecw-minchomedium-r-normal--14-140-75-75-m-70-jisx0201.1976-0

              -jdecw-mincho-medium-r-normal--24-240-75-75-m-120-jisx0201.1976-0
 -jdecw-mincho-medium-r-nor-

              mal--10-100-75-75-m-50-jisx0201.1976-0  -jdecw-mincho-medium-r-nor-

              mal--18-180-75-75-m-90-jisx0201.1976-0  -jdecw-mincho-medium-r-nor-

              mal--17-120-100-100-m-85-jisx0201.1976-0    -jdecwmincho-medium-r-nor-

              mal--34-240-100-100-m-170-jisx0201.1976-0   -jdecwmincho-medium-r-nor-

              mal--14-100-100-100-m-70-jisx0201.1976-0    -jdecwmincho-medium-r-nor-

              mal--25-180-100-100-m-125-jisx0201.1976-0   -jdecwmincho-medium-r-nor-

              mal--20-140-100-100-m-100-jisx0201.1976-0   -jdecwmincho-medium-r-nor-

              mal--11-80-100-100-m-55-jisx0201.1976-0         JIS
              X0201-1976 characters (Screen family)

              -jdecw-screen-medium-r-normal--24-240-75-75-m-120-jisx0201-romankana
  -jdecwscreen-medium-r-nor-

              mal--18-180-75-75-m-80-jisx0201-romankana   -jdecwscreen-medium-r-nor-

              mal--14-140-75-75-m-70-jisx0201-romankana   -jdecwscreen-medium-r-nor-

              mal--10-100-75-75-m-50-jisx0201-romankana JIS X0208
              characters (Gothic family)

              -jdecw-gothic-medium-r-normal--14-140-75-75-m-140-jisx0208.1983-1
     -jdecwgothic-medium-r-nor-

              mal--12-120-75-75-m-120-jisx0208.1983-1     -jdecwgothic-medium-r-nor-

              mal--24-240-75-75-m-240-jisx0208.1983-1     -jdecwgothic-medium-r-nor-

              mal--10-100-75-75-m-100-jisx0208.1983-1     -jdecwgothic-medium-r-nor-

              mal--18-180-75-75-m-180-jisx0208.1983-1     -jdecwgothic-medium-r-nor-

              mal--8-80-75-75-m-80-jisx0208.1983-1 -jdecw-gothicmedium-r-nor-

              mal--17-120-100-100-m-170-jisx0208.1983-1   -jdecwgothic-medium-r-nor-

              mal--34-240-100-100-m-340-jisx0208.1983-1   -jdecwgothic-medium-r-nor-

              mal--14-100-100-100-m-140-jisx0208.1983-1   -jdecwgothic-medium-r-nor-

              mal--25-180-100-100-m-250-jisx0208.1983-1   -jdecwgothic-medium-r-nor-

              mal--20-140-100-100-m-200-jisx0208.1983-1   -jdecwgothic-medium-r-nor-

              mal--11-80-100-100-m-110-jisx0208.1983-1 JIS  X0208
              characters (Mincho family)

              -jdecw-mincho-medium-r-normal--14-140-75-75-m-140-jisx0208.1983-1
  jdecw-mincho-medium-r-nor-

              mal--12-120-75-75-m-120-jisx0208.1983-1 -jdecw-mincho-medium-r-nor-

              mal--24-240-75-75-m-240-jisx0208.1983-1 -jdecw-mincho-medium-r-nor-

              mal--10-100-75-75-m-100-jisx0208.1983-1 -jdecw-mincho-medium-r-nor-

              mal--18-180-75-75-m-180-jisx0208.1983-1 -jdecw-mincho-medium-r-nor-

              mal--8-80-75-75-m-80-jisx0208.1983-1 -jdecw-minchomedium-r-nor-

              mal--17-120-100-100-m-170-jisx0208.1983-1   -jdecwmincho-medium-r-nor-

              mal--34-240-100-100-m-340-jisx0208.1983-1   -jdecwmincho-medium-r-nor-

              mal--14-100-100-100-m-140-jisx0208.1983-1   -jdecwmincho-medium-r-nor-

              mal--25-180-100-100-m-250-jisx0208.1983-1   -jdecwmincho-medium-r-nor-

              mal--20-140-100-100-m-200-jisx0208.1983-1   -jdecwmincho-medium-r-nor-

              mal--11-80-100-100-m-110-jisx0208.1983-1 JIS  X0208
              characters (Screen family)

              -jdecw-screen-medium-r-normal--24-240-75-75-m-240-jisx0208-kanji00
    -jdecwscreen-medium-r-nor-

              mal--10-100-75-75-m-100-jisx0208-kanji00    -jdecwscreen-medium-r-nor-

              mal--18-180-75-75-m-160-jisx0208-kanji00    -jdecwscreen-medium-r-nor-

              mal--16-160-75-75-m-160-jisx0208-kanji00    -jdecwscreen-medium-r-nor-

              mal--14-140-75-75-m-140-jisx0208-kanji00    -jdecwscreen-medium-r-nor-

              mal--24-240-75-75-m-240-jisx0208-kanji11    -jdecwscreen-medium-r-nor-

              mal--10-100-75-75-m-100-jisx0208-kanji11    -jdecwscreen-medium-r-nor-

              mal--18-180-75-75-m-160-jisx0208-kanji11    -jdecwscreen-medium-r-nor-

              mal--14-140-75-75-m-140-jisx0208-kanji11


       For printers, the operating system provides only  Japanese
       fonts  that  are  printer-resident;  that is, there are no
       Japanese fonts that can be dynamically downloaded  to  the
       printer.  See  i18n_printing(5) for general information on
       printing non-English text.

SEE ALSO    [Toc]    [Back]

      
      
       Commands: locale(1)

       Others: ascii(5), code_page(5),  eucJP(5),  i18n_intro(5),
       i18n_printing(5),       iconv_intro(5),      iso2022jp(5),
       Japanese(5), jiskanji(5), sdeckanji(5), shiftjis(5),  Unicode(5)



                                                      deckanji(5)
[ Back ]
 Similar pages
Name OS Title
ISO8859-1 Tru64 A character encoding system (codeset)
iso8859-1 Tru64 A character encoding system (codeset)
iso8859-9 Tru64 A character encoding system (codeset) for Turkish
iso8859-8 Tru64 A character encoding system (codeset) for Hebrew
iso8859-5 Tru64 A character encoding system (codeset) for Russian
ISO8859-8 Tru64 A character encoding system (codeset) for Hebrew
deckorean Tru64 A character encoding system (codeset) for Korean
iso8859-7 Tru64 A character encoding system (codeset) for Greek
tactis Tru64 A character encoding system (codeset) for Thai.
ISO8859-9 Tru64 A character encoding system (codeset) for Turkish
Copyright © 2004-2005 DeniX Solutions SRL
newsletter delivery service