Add missing files from Unicode 5.1 Character Database
[p5sagit/p5-mst-13.2.git] / lib / unicore / README.perl
CommitLineData
1911be83 1The *.txt files were copied from
8836d2a5 2
b6922eda 3 http://www.unicode.org/Public/5.1.0/ucd
4
61131c94 5and subdirectories 'extracted' and 'auxiliary' as of Unicode 5.1.0 (March 2008).
6
7The big file, Unihan.txt (28 MB, 5.8 MB zip) was not included due to space
8considerations. Also NOT included were any *.html files and *Test.txt files.
9
10To be 8.3 filesystem friendly, the names of some of the input files have been
11changed from the values that are in the Unicode DB:
12
13mv PropertyValueAliases.txt PropValueAliases.txt
14mv NamedSequencesProv.txt NamedSqProv.txt
15mv DerivedAge.txt DAge.txt
16mv DerivedCoreProperties.txt DCoreProperties.txt
17mv DerivedNormalizationProps.txt DNormalizationProps.txt
18mv extracted/DerivedBidiClass.txt extracted/DBidiClass.txt
19mv extracted/DerivedBinaryProperties.txt extracted/DBinaryProperties.txt
20mv extracted/DerivedCombiningClass.txt extracted/DCombiningClass.txt
21mv extracted/DerivedDecompositionType.txt extracted/DDecompositionType.txt
22mv extracted/DerivedEastAsianWidth.txt extracted/DEastAsianWidth.txt
23mv extracted/DerivedGeneralCategory.txt extracted/DGeneralCategory.txt
24mv extracted/DerivedJoiningGroup.txt extracted/DJoinGroup.txt
25mv extracted/DerivedJoiningType.txt extracted/DJoinType.txt
26mv extracted/DerivedLineBreak.txt extracted/DLineBreak.txt
27mv extracted/DerivedNumericType.txt extracted/DNumType.txt
28mv extracted/DerivedNumericValues.txt extracted/DNumValues.txt
29
30The names of files, such as test files, that are not used by mktables are not
31changed, and will not work correctly on 8.3 filesystems.
32
33The file 'version' should exist and be a single line with the Unicode version,
34like 5.1.0
dbe75581 35
97050450 36NOTE: If you modify the input file set you should also run
37
38 mktables -makelist
39
40which will recreate the mktables.lst file which is used to speed up
41the build process.
42
0fa75b59 43FOR PUMPKINS
44
b6922eda 45The files are inter-related. If you take the latest UnicodeData.txt, for example,
46but leave the older versions of other files, there can be subtle problems.
47
a2bd7410 48The *.pl files are generated from the *.txt files by the mktables script,
49more recently done during the Perl build process, but if you want to try
50the old manual way:
0fa75b59 51
52 cd lib/unicore
53 cp .../UnicodeOriginal/*.txt .
54 rm NormalizationTest.txt Unihan.txt Derived*.txt
e1aef32f 55 p4 edit Properties *.pl */*.pl
0fa75b59 56 perl ./mktables
57 p4 revert -a
58 cd ../..
59 perl Porting/manicheck
60
e1aef32f 61You need to update version by hand
62
63 p4 edit version
64 ...
65
0fa75b59 66If any new (or deleted, unlikely but not impossible) *.pl files are indicated:
67
68 cd lib/unicore
69 p4 add ...
70 p4 delete ...
71 cd ../...
72 p4 edit MANIFEST
73 ...
74
75And finally:
76
77 p4 submit
8836d2a5 78
79--
e1aef32f 80jhi@iki.fi; updated by nick@ccl4.org
61131c94 81
82mktables is used to generate the tables used by the rest of Perl. It will warn
83you about any *.txt files in the directory substructure that it doesn't know
84about. You should remove any so-identified, or edit mktables to add them to
85its lists. You can run
86
87 mktables -globlist
88
89to have it try to process these tables generically.
90
91Also, for each new release, you should run
92
93 mktables -v -makelist
94
95to see all warnings and to recreate the mktables.lst file which is used to
96speed up the build process.
97
98The files are inter-related. If you take the latest UnicodeData.txt, for example,
99but leave the older versions of other files, there can be subtle problems.
100
101jhi@iki.fi; updated by nick@ccl4.org, public@khwilliamson.com