Commit | Line | Data |
afc46004 |
1 | August 10, 2001 |
d357d9fe |
2 | |
e2be6f07 |
3 | This directory contains the Unicode Character Database |
4 | data files. |
d357d9fe |
5 | |
e2be6f07 |
6 | Currently, the Unicode Character Database files are at |
7 | the version level: |
8 | |
afc46004 |
9 | Unicode Standard, Version 3.1.1 |
e2be6f07 |
10 | |
11 | For information about the standard itself, see |
12 | UAX #27, Unicode 3.1. <http://www.unicode.org/unicode/reports/tr27/> |
afc46004 |
13 | and the Unicode 3.1.1 Update Notice. |
14 | <http://www.unicode.org/versions/Unicode3.1.1.html> |
d357d9fe |
15 | |
16 | Detailed documentation of the files constituting the |
17 | Unicode Character Database (contributory data files for |
18 | the standard itself) can now be found in |
50fc4248 |
19 | UnicodeCharacterDatabase.html. See also UnicodeData.html, |
20 | PropList.html, NamesList.html, and DerivedProperties.html |
21 | for specific details about particular files or sets of |
22 | files. |
23 | |
e2be6f07 |
24 | Unihan.txt is a very large file. For convenience, the current |
afc46004 |
25 | Unicode 3.1.1 version of Unihan.txt is also available in |
26 | two compressed formats in the Unicode 3.1.1 update directory. |
27 | See: <http://www.unicode.org/Public/3.1-Update1/> or |
28 | <ftp://ftp.unicode.org/Public/3.1-Update1/> |
50fc4248 |
29 | |
afc46004 |
30 | Unihan-3.1.1.zip for Windows. (Use winzip) |
31 | Unihan-3.1.1.txt.gz for Unix. (Use gzip or gunzip) |
e2be6f07 |
32 | |
33 | Note that the files are zipped in |
50fc4248 |
34 | exactly the same format they have on the server (with Unix |
e2be6f07 |
35 | line endings). From a browser, right-clicking on |
afc46004 |
36 | Unihan-3.1.1.zip will allow automatic download and unzip on a |
50fc4248 |
37 | Windows system with winzip installed. |
38 | |
39 | |
40 | |
41 | |
d357d9fe |
42 | |