The *.txt files were copied from
- http://www.unicode.org/Public/4.1.0/ucd
-as of Unicode 4.1.0 (April 2005).
+ http://www.unicode.org/Public/5.1.0/ucd
-The two big files, NormalizationTest.txt (2.1 MB) and Unihan.txt
-(26.7 MB) were not included due to space considerations. Also NOT
+as of Unicode 5.1.0 (March 2008).
+
+The two big files, NormalizationTest.txt (2 MB) and Unihan.txt (28 MB,
+5.8 MB zip) were not included due to space considerations. Also NOT
included were any *.html files and the Derived*.txt files
DerivedAge.txt
DerivedCoreProperties.txt
DerivedNormalizationProps.txt
-To be 8.3-friendly, the lib/unicore/PropertyValueAliases.txt was
-renamed to be lib/unicore/PropValueAliases.txt, since otherwise
-it would have conflicted with lib/unicore/PropertyAliases.txt.
+or any files from subdirectories.
+
+To be 8.3 filesystem friendly, the lib/unicore/PropertyValueAliases.txt was
+renamed to be lib/unicore/PropValueAliases.txt and the
+lib/unicore/NamedSequencesProv.txt was renamed to be
+lib/unicore/NamedSqProv.txt, since otherwise they would have
+conflicted with lib/unicore/PropertyAliases.txt and
+lib/unicore/NamedSequences.txt.
NOTE: If you modify the input file set you should also run
FOR PUMPKINS
+The files are inter-related. If you take the latest UnicodeData.txt, for example,
+but leave the older versions of other files, there can be subtle problems.
+
The *.pl files are generated from the *.txt files by the mktables script,
more recently done during the Perl build process, but if you want to try
the old manual way: