August 10, 2001
This directory contains the Unicode Character Database
data files.
Currently, the Unicode Character Database files are at
the version level:
Unicode Standard, Version 3.1.1
For information about the standard itself, see
UAX #27, Unicode 3.1.
and the Unicode 3.1.1 Update Notice.
Detailed documentation of the files constituting the
Unicode Character Database (contributory data files for
the standard itself) can now be found in
UnicodeCharacterDatabase.html. See also UnicodeData.html,
PropList.html, NamesList.html, and DerivedProperties.html
for specific details about particular files or sets of
files.
Unihan.txt is a very large file. For convenience, the current
Unicode 3.1.1 version of Unihan.txt is also available in
two compressed formats in the Unicode 3.1.1 update directory.
See: or
Unihan-3.1.1.zip for Windows. (Use winzip)
Unihan-3.1.1.txt.gz for Unix. (Use gzip or gunzip)
Note that the files are zipped in
exactly the same format they have on the server (with Unix
line endings). From a browser, right-clicking on
Unihan-3.1.1.zip will allow automatic download and unzip on a
Windows system with winzip installed.