deckanji - A character encoding system (codeset) for
Japanese
The DEC Kanji codeset consists of the following characters:
ASCII or JIS X0201 Roman letters Katakana characters
specified by JIS X0201 JIS X0208 characters User-defined
characters (UDC)
DEC Kanji uses a combination of single-byte data and
2-byte data to represent these characters.
JIS X0201 is a single-byte character set and consists of
Roman letters and Katakana characters, which are Japanese
phonetic symbols. The Roman letters defined in the JIS
X0201-1976 standard are the same as ASCII letters. (For
information on the ASCII character set, see ascii(5).)
However, JIS X0201 replaces the ASCII tilde (~) character
with a horizontal bar (located at the upper part of a
character cell), and replaces the backslash (\) with the
Japanese currency sign (Yen).
The JIS XO208 standard specifies 2-byte character values
that represent a variety of characters, including ideographic
symbols.
DEC Kanji Encoding [Toc] [Back]
All ASCII characters are represented by single-byte 7-bit
values in DEC Kanji. That is, the most significant bit
(MSB) is always set off in the byte that represents an
ASCII character. The Roman letters and the Katakana characters
specified by JIS X0201 are also single-byte values,
in which the most significant bit (MSB) is set off and on,
respectively. Some applications and vendors assume Roman
characters for the lower 7-bit values, while others assume
ASCII. However, for Unicode conversion on Tru64 UNIX, the
Japanese_UCS converters (deckanji, sdeckanji, SJIS, and
eucJP) consider the lower 7-bit values to be ASCII.
The code table for JIS X0208 characters is divided into 94
rows, numbered from 1 to 94. Each row has 94 columns, also
numbered from 1 to 94. JIS X0208 defines a total of 6877
characters, which include the following: Special symbols,
in rows 1 and 2 Numerals and Roman letters, in row 3 Hiragana
characters, in row 4 Katakana characters, in row 5
Greek letters, in row 6 Russian letters, in row 7 Symbols
for drawing graphs, diagrams, and lines, in row 8 Firstlevel
Kanji characters, in rows 16 to 47 Second-level
Kanji characters, in rows 48 to 84
To comply with the JIS X0208 standard, each JIS X0208
character is a 2-byte value in the DEC Kanji codeset. The
MSB of both the first and second bytes is always set on to
distinguish JIS X0208 characters from ASCII/JIS Roman or
user-defined characters.
For each JIS X0208 character, the first byte of the 2-byte
value determines the row number and the second determines
the column number in the JIS X0208 code table. The following
formula shows the code value for a JIS X0208 character
in relation to its row and column numbers:
1st byte = A0 + Row number
2nd byte = A0 + Column number
For example, if a character is positioned at the first
column of the 36th row, its code value is C4A1, which is
calculated as follows:
1st byte = A0 (hex) + 36 = C4 (hex)
2nd byte = A0 (hex) + 01 = A1 (hex)
For user-defined character (UDC) definitions, DEC Kanji
provides an area of 2914 positions (from row 1 to row 31).
Each UDC is represented by a 2-byte value, just like a JIS
X0208 character value. However, the MSB of the second byte
of a UDC is set off to distinguish it from a JIS X0208
character. The code range of the UDC area is A121 to
BF7E.
The following formula calculates the code of a UDC in
relation to its row and column numbers:
1st byte = A0 + Row number
2nd byte = 20 + Column number
For example, if a UDC is positioned at the first column of
the 16th row, its code value is B021, which is calculated
as follows:
1st byte = A0 (hex) + 16 = B0 (hex)
2nd byte = 20 (hex) + 01 = 21 (hex)
Codeset Conversion [Toc] [Back]
The following codeset converter pairs are available for
converting Japanese characters between deckanji and other
encoding formats. See iconv_intro(5) for an introduction
to codeset conversion. For more information about the
other codeset for which deckanji is the input or output,
see the reference page specified in the list item.
eucJP_deckanji, deckanji_eucJP
Converting from and to Japanese Extended UNIX Code:
eucJP(5). iso-2022-jp_deckanji, deckanji_iso-2022-jp
Converting from and to the ISO 2022-JP codeset:
iso2022jp(5). iso-2022-jpext_deckanji, deckanji_iso-2022-jpext
Converting from and to the ISO 2022-JPexp codeset:
iso2022jp(5). JIS7_deckanji, deckanji_JIS7
Converting from and to the JIS7 codeset:
jiskanji(5). sdeckanji_deckanji, deckanji_sdeckanji
Converting from and to the Super DEC Kanji codeset:
sdeckanji(5). SJIS_deckanji, deckanji_SJIS
Converting from and to the Shift JIS codeset:
shiftjis(5).
Shift JIS encoding is equivalent to the Microsoft
code-page format used on PCs for Japanese. Therefore,
you can use these converters to convert data
between DEC Kanji and PC code-page format. See
code_page(5) for information on PC code pages.
UTF-16_deckanji, deckanji_UTF-16
Converting from and to UTF-16 format: Unicode(5).
UCS-4_deckanji, deckanji_UCS-4
Converting from and to UCS-4 format: Unicode(5).
UTF-8_deckanji, deckanji_UTF-8
Converting from and to UTF-8 format: Unicode(5).
Japanese Fonts [Toc] [Back]
The operating system provides the following Japanese
bitmap fonts in various sizes and typefaces for 75dpi and
100dpi (dot-per-inch) display devices: JIS X0201-1976
characters (Gothic family):
-jdecw-gothic-medium-r-normal--8-80-75-75-m-40-jisx0201.1976-0
-jdecw-gothicmedium-r-normal--14-140-75-75-m-70-jisx0201.1976-0
-jdecw-gothic-medium-r-normal--12-120-75-75-m-60-jisx0201.1976-0
-jdecwgothic-medium-r-nor-
mal--24-240-75-75-m-120-jisx0201.1976-0 -jdecwgothic-medium-r-nor-
mal--10-100-75-75-m-50-jisx0201.1976-0 -jdecwgothic-medium-r-nor-
mal--18-180-75-75-m-90-jisx0201.1976-0 -jdecwgothic-medium-r-nor-
mal--17-120-100-100-m-85-jisx0201.1976-0 -jdecwgothic-medium-r-nor-
mal--34-240-100-100-m-170-jisx0201.1976-0 -jdecwgothic-medium-r-nor-
mal--14-100-100-100-m-70-jisx0201.1976-0 -jdecwgothic-medium-r-nor-
mal--25-180-100-100-m-125-jisx0201.1976-0 -jdecwgothic-medium-r-nor-
mal--20-140-100-100-m-100-jisx0201.1976-0 -jdecwgothic-medium-r-nor-
mal--11-80-100-100-m-55-jisx0201.1976-0 JIS
X0201-1976 characters (Kmenu family)
-jdecw-kmenu-medium-r-normal--12-120-75-75-p-70-jisx0201.1976-0
-jdecwkmenu-medium-r-nor-
mal--17-120-100-100-p-85-jisx0201.1976-0 JIS
X0201-1976 characters (Mincho family)
-jdecw-mincho-medium-r-normal--8-80-75-75-m-40-jisx0201.1976-0
-jdecw-minchomedium-r-normal--14-140-75-75-m-70-jisx0201.1976-0
-jdecw-mincho-medium-r-normal--24-240-75-75-m-120-jisx0201.1976-0
-jdecw-mincho-medium-r-nor-
mal--10-100-75-75-m-50-jisx0201.1976-0 -jdecw-mincho-medium-r-nor-
mal--18-180-75-75-m-90-jisx0201.1976-0 -jdecw-mincho-medium-r-nor-
mal--17-120-100-100-m-85-jisx0201.1976-0 -jdecwmincho-medium-r-nor-
mal--34-240-100-100-m-170-jisx0201.1976-0 -jdecwmincho-medium-r-nor-
mal--14-100-100-100-m-70-jisx0201.1976-0 -jdecwmincho-medium-r-nor-
mal--25-180-100-100-m-125-jisx0201.1976-0 -jdecwmincho-medium-r-nor-
mal--20-140-100-100-m-100-jisx0201.1976-0 -jdecwmincho-medium-r-nor-
mal--11-80-100-100-m-55-jisx0201.1976-0 JIS
X0201-1976 characters (Screen family)
-jdecw-screen-medium-r-normal--24-240-75-75-m-120-jisx0201-romankana
-jdecwscreen-medium-r-nor-
mal--18-180-75-75-m-80-jisx0201-romankana -jdecwscreen-medium-r-nor-
mal--14-140-75-75-m-70-jisx0201-romankana -jdecwscreen-medium-r-nor-
mal--10-100-75-75-m-50-jisx0201-romankana JIS X0208
characters (Gothic family)
-jdecw-gothic-medium-r-normal--14-140-75-75-m-140-jisx0208.1983-1
-jdecwgothic-medium-r-nor-
mal--12-120-75-75-m-120-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--24-240-75-75-m-240-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--10-100-75-75-m-100-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--18-180-75-75-m-180-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--8-80-75-75-m-80-jisx0208.1983-1 -jdecw-gothicmedium-r-nor-
mal--17-120-100-100-m-170-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--34-240-100-100-m-340-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--14-100-100-100-m-140-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--25-180-100-100-m-250-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--20-140-100-100-m-200-jisx0208.1983-1 -jdecwgothic-medium-r-nor-
mal--11-80-100-100-m-110-jisx0208.1983-1 JIS X0208
characters (Mincho family)
-jdecw-mincho-medium-r-normal--14-140-75-75-m-140-jisx0208.1983-1
jdecw-mincho-medium-r-nor-
mal--12-120-75-75-m-120-jisx0208.1983-1 -jdecw-mincho-medium-r-nor-
mal--24-240-75-75-m-240-jisx0208.1983-1 -jdecw-mincho-medium-r-nor-
mal--10-100-75-75-m-100-jisx0208.1983-1 -jdecw-mincho-medium-r-nor-
mal--18-180-75-75-m-180-jisx0208.1983-1 -jdecw-mincho-medium-r-nor-
mal--8-80-75-75-m-80-jisx0208.1983-1 -jdecw-minchomedium-r-nor-
mal--17-120-100-100-m-170-jisx0208.1983-1 -jdecwmincho-medium-r-nor-
mal--34-240-100-100-m-340-jisx0208.1983-1 -jdecwmincho-medium-r-nor-
mal--14-100-100-100-m-140-jisx0208.1983-1 -jdecwmincho-medium-r-nor-
mal--25-180-100-100-m-250-jisx0208.1983-1 -jdecwmincho-medium-r-nor-
mal--20-140-100-100-m-200-jisx0208.1983-1 -jdecwmincho-medium-r-nor-
mal--11-80-100-100-m-110-jisx0208.1983-1 JIS X0208
characters (Screen family)
-jdecw-screen-medium-r-normal--24-240-75-75-m-240-jisx0208-kanji00
-jdecwscreen-medium-r-nor-
mal--10-100-75-75-m-100-jisx0208-kanji00 -jdecwscreen-medium-r-nor-
mal--18-180-75-75-m-160-jisx0208-kanji00 -jdecwscreen-medium-r-nor-
mal--16-160-75-75-m-160-jisx0208-kanji00 -jdecwscreen-medium-r-nor-
mal--14-140-75-75-m-140-jisx0208-kanji00 -jdecwscreen-medium-r-nor-
mal--24-240-75-75-m-240-jisx0208-kanji11 -jdecwscreen-medium-r-nor-
mal--10-100-75-75-m-100-jisx0208-kanji11 -jdecwscreen-medium-r-nor-
mal--18-180-75-75-m-160-jisx0208-kanji11 -jdecwscreen-medium-r-nor-
mal--14-140-75-75-m-140-jisx0208-kanji11
For printers, the operating system provides only Japanese
fonts that are printer-resident; that is, there are no
Japanese fonts that can be dynamically downloaded to the
printer. See i18n_printing(5) for general information on
printing non-English text.
Commands: locale(1)
Others: ascii(5), code_page(5), eucJP(5), i18n_intro(5),
i18n_printing(5), iconv_intro(5), iso2022jp(5),
Japanese(5), jiskanji(5), sdeckanji(5), shiftjis(5), Unicode(5)
deckanji(5)
[ Back ] |