wdutil(1) wdutil(1)
NAME [Toc] [Back]
wdutil - manipulate Native Language I/O word dictionary
SYNOPSIS [Toc] [Back]
wdutil [ -c | -i[kcap][,dcap] | -jjfile ] file
wdutil [ -pd[desig] | -pk[desig] ] file
wdutil [ -sd[[+|-]val] | -sk[[+|-]val] ] file
wdutil [ -ud | -uk | -ut ] file
wdutil [ -d | -l ] file
DESCRIPTION [Toc] [Back]
wdutil is used to manipulate the word dictionary used by Native
Language I/O for phrase and word conversion. The word dictionary
consists of a key entries block and a data entries block. The key
entries block holds the designations, and the data entries block holds
the words corresponding to each designation. wdutil also functions as
a filter for transforming a word dictionary to a text file, and vice
versa. See the Text File section for the layout of a text file.
wdutil recognizes one of the options below. If no option is specified
and the file is a valid word dictionary, the capacity of the key and
data entries blocks in the file is displayed. Otherwise, an error
message is printed.
The capacity of the key entries block determines the maximum number of
designations. The capacity of the data entries block determines the
maximum number of words.
Options [Toc] [Back]
-c Condense the data entries block in the file to obtain a
larger contiguous free area. If the format version of
the file is old, it is updated.
-i[kcap][,dcap]
Initialize the file as a word dictionary which has key
entries block capacity specified by kcap and data
entries block capacity specified by dcap. If the file
does not exist, it is created. The default values are
499 for kcap and 650 for dcap.
-jjfile Join the dictionary jfile into the file. The capacity
of the resulting file is the sum of the capacities of
the original file and the jfile.
-pk[desig] Display the designations in the order of their code
value. If desig ends with *, designations starting
with desig are printed. If desig is * or omitted, all
designations in the file are printed.
-pd[desig] Display the designations and their corresponding words
and part of speeches. The string desig has the same
Hewlett-Packard Company - 1 - HP-UX 11i Version 2.0 : Jul 2003
wdutil(1) wdutil(1)
format as in -pk.
-sd[[+|-]val] Change the capacity of the data entries block in the
file. If + or - precedes val, the current value is
incremented or decremented by val. Otherwise, the
capacity is changed to val. The default value for val
is 650.
-sk[[+|-]val] Change the capacity of the key entries block in the
file. The number val has the same format as in -sd
option. The default value for val is 499.
-ud Display the capacity and usage of the data entries
block, and the size of contiguous free area.
-uk Display the capacity and usage of the key entries
block.
-ut Display the capacity and usage of both the key and data
entries blocks, and the size of contiguous free area of
the data entries block.
-d Read a word dictionary, transform it into text form,
and dump it to the standard output. If the word
includes a character whose code is undefined in $LANG
code set, its internal code is dumped in hexadecimal
notation.
-l Load the entry lines in text form from the standard
input into the specified word dictionary. If specified
dictionary exists, wdutil overwrites it with loaded
entry lines; otherwise wdutil creates a new one
containing them. If an entry line is invalid, it is
rejected and an error message is displayed on the
standard error.
Text File [Toc] [Back]
Each entry line in the text file consists of the following fields
terminated by \n. White space can be used as field separator. The
3rd field is effective only if LANG=japanese, japanese.euc,
ja_JP.SJIS, or ja_JP.eucJP
designation word hinshi(part of speech)
designation
Consists of up to sixteen characters excluding special
characters. However, after being transformed by the -d option,
all characters in designation are 2-byte characters in a text
file.
Hewlett-Packard Company - 2 - HP-UX 11i Version 2.0 : Jul 2003
wdutil(1) wdutil(1)
word The word corresponding to designation consists of up to 50 bytes
of multi-byte characters. The word may have hexadecimal notation
instead of multi-byte characters. For example, the hexadecimal
notation '\x7e7e' is recognized as a character whose internal
code is 0x7e7e.
hinshi
Specify a part of speech which is one of noun, sa-hen verb,
surname, personal name, and address. Filling conventions are
FUTSUUMEISHI(or simply MEISHI), SAHENDOUSHI(or simply SAHEN),
SEI, MEI and CHIMEI in kanji character. If nothing is specified,
wdutil sets it FUTSUUMEISHI automatically.
EXTERNAL INFLUENCES [Toc] [Back]
International Code Set Support
Single byte and multibyte character code sets are supported.
WARNINGS [Toc] [Back]
The smallest prime number not smaller than the given value is used as
the capacity of a key entries block. However, if the given value is
smaller than 5, 5 is used.
Voiced plosive or non_voiced plosive in a designation is counted as 1
character in a text file.
User dictionaries with old format version are supported on HP-UX 10.0,
but they will not be supported in the future. To update them, use -c
option:
$ wdutil -c file
AUTHOR [Toc] [Back]
wdutil was developed by HP.
Hewlett-Packard Company - 3 - HP-UX 11i Version 2.0 : Jul 2003 [ Back ] |