SYN SYN
[p5sagit/p5-mst-13.2.git] / lib / unicode / UCD301.html
CommitLineData
c529f79d 1<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"
2
3 "http://www.w3.org/TR/REC-html40/loose.dtd">
4
5<html>
6
c529f79d 7<head>
c529f79d 8<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
c529f79d 9<meta http-equiv="Content-Language" content="en-us">
c529f79d 10<meta name="GENERATOR" content="Microsoft FrontPage 4.0">
c529f79d 11<meta name="ProgId" content="FrontPage.Editor.Document">
c529f79d 12<link rel="stylesheet" href="http://www.unicode.org/unicode.css" type="text/css">
c529f79d 13<title>Unicode Character Database</title>
c529f79d 14</head>
15
c529f79d 16<body>
17
22d4bb9c 18<h1>UNICODE CHARACTER DATABASE<br>
19Version 3.0.1</h1>
c529f79d 20<table border="1" cellspacing="2" cellpadding="0" height="87" width="100%">
c529f79d 21 <tr>
c529f79d 22 <td valign="TOP" width="144">Revision</td>
22d4bb9c 23 <td valign="TOP">3.0.1</td>
c529f79d 24 </tr>
c529f79d 25 <tr>
c529f79d 26 <td valign="TOP" width="144">Authors</td>
c529f79d 27 <td valign="TOP">Mark Davis and Ken Whistler</td>
c529f79d 28 </tr>
c529f79d 29 <tr>
c529f79d 30 <td valign="TOP" width="144">Date</td>
22d4bb9c 31 <td valign="TOP">2000-08-17</td>
c529f79d 32 </tr>
c529f79d 33 <tr>
c529f79d 34 <td valign="TOP" width="144">This Version</td>
22d4bb9c 35 <td valign="TOP"><a
36 href="http://www.unicode.org/Public/3.0-Update1/UnicodeCharacterDatabase-3.0.1.html">http://www.unicode.org/Public/3.0-Update1/UnicodeCharacterDatabase-3.0.1.html</a></td>
c529f79d 37 </tr>
c529f79d 38 <tr>
c529f79d 39 <td valign="TOP" width="144">Previous Version</td>
22d4bb9c 40 <td valign="TOP"><a
41 href="http://www.unicode.org/Public/3.0-Update/UnicodeCharacterDatabase-3.0.0.html">http://www.unicode.org/Public/3.0-Update/UnicodeCharacterDatabase-3.0.0.html</a></td>
c529f79d 42 </tr>
c529f79d 43 <tr>
c529f79d 44 <td valign="TOP" width="144">Latest Version</td>
22d4bb9c 45 <td valign="TOP"><a
46 href="http://www.unicode.org/Public/UNIDATA/UnicodeCharacterDatabase.html">http://www.unicode.org/Public/UNIDATA/UnicodeCharacterDatabase.html</a></td>
c529f79d 47 </tr>
c529f79d 48</table>
22d4bb9c 49<p align="center">Copyright © 1995-2000 Unicode, Inc. All Rights reserved.</p>
50<h2>Disclaimer</h2>
51<p>The Unicode Character Database is provided as is by Unicode, Inc. No claims
52are made as to fitness for any particular purpose. No warranties of any kind are
53expressed or implied. The recipient agrees to determine applicability of
54information provided. If this file has been purchased on magnetic or optical
55media from Unicode, Inc., the sole remedy for any claim will be exchange of
56defective media within 90 days of receipt.</p>
57<p>This disclaimer is applicable for all other data files accompanying the
58Unicode Character Database, some of which have been compiled by the Unicode
59Consortium, and some of which have been supplied by other sources.</p>
60<h2>Limitations on Rights to Redistribute This Data</h2>
61<p>Recipient is granted the right to make copies in any form for internal
62distribution and to freely use the information supplied in the creation of
63products supporting the Unicode<sup>TM</sup> Standard. The files in the Unicode
64Character Database can be redistributed to third parties or other organizations
65(whether for profit or not) as long as this notice and the disclaimer notice are
66retained. Information can be extracted from these files and used in
67documentation or programs, as long as there is an accompanying notice indicating
68the source.</p>
69<h2>Introduction</h2>
70<p>The Unicode Character Database is a set of files that define the Unicode
71character properties and internal mappings. For more information about character
72properties and mappings, see <i><a
73href="http://www.unicode.org/unicode/uni2book/u2.html">The Unicode Standard</a></i>.</p>
74<p>The Unicode Character Database has been updated to reflect Version 3.0 of the
75Unicode Standard, with many characters added to those published in Version 2.0.
76A number of corrections have also been made to case mappings or other errors in
77the database noted since the publication of Version 2.0. Normative bidirectional
78properties have also been modified to reflect decisions of the Unicode Technical
79Committee.</p>
80<p>For more information on versions of the Unicode Standard and how to reference
81them, see <a href="http://www.unicode.org/unicode/standard/versions/">http://www.unicode.org/unicode/standard/versions/</a>.</p>
82<h2>Conformance</h2>
83<p>Character properties may be either normative or informative. <i>Normative</i>
84means that implementations that claim conformance to the Unicode Standard (at a
85particular version) and which make use of a particular property or field must
86follow the specifications of the standard for that property or field in order to
87be conformant. The term <i>normative</i> when applied to a property or field of
88the Unicode Character Database, does <i>not</i> mean that the value of that
89field will never change. Corrections and extensions to the standard in the
90future may require minor changes to normative values, even though the Unicode
91Technical Committee strives to minimize such changes. An<i> informative </i>property
92or field is strongly recommended, but a conformant implementation is free to use
93or change such values as it may require while still being conformant to the
94standard. Particular implementations may choose to override the properties and
95mappings that are not normative. In that case, it is up to the implementer to
96establish a protocol to convey that information.</p>
97<h2>Files</h2>
98<p>The following summarizes the files in the Unicode Character Database. &nbsp;For
99more information about these files, see the referenced technical report(s) or
100section of Unicode Standard, Version 3.0.</p>
101<p><b>UnicodeData.txt (Chapter 4, <a
102href="http://www.unicode.org/unicode/reports/tr21/">UTR #21: Case Mappings</a>, <a
103href="http://www.unicode.org/unicode/reports/tr15/">UAX #15 Unicode Normalization
104Forms</a>)</b>
105<ul>
106 <li>The main file in the Unicode Character Database.</li>
107 <li>For detailed information on the format, see <a href="UnicodeData.html">UnicodeData.html</a>.
108 This file also characterizes which properties are normative and which are
109 informative.</li>
110</ul>
111<p><b>PropList.txt (Chapter 4)</b>
112<ul>
113 <li>Additional informative properties list: <i>Alphabetic, Ideographic,</i>
114 and <i>Mathematical</i>, among others.</li>
115</ul>
116<p><b>SpecialCasing.txt (Chapter 4, <a
117href="http://www.unicode.org/unicode/reports/tr21/">UTR #21: Case Mappings</a>)</b>
118<ul>
119 <li>List of informative special casing properties, including one-to-many
120 mappings such as SHARP S =&gt; &quot;SS&quot;, and locale-specific mappings,
121 such as for Turkish <i>dotless i</i>.</li>
122</ul>
123<p><b>Blocks.txt (Chapter 14)</b>
124<ul>
125 <li>List of normative block names.</li>
126</ul>
127<p><b>Jamo.txt (Chapter 4)</b>
128<ul>
129 <li>List of normative Jamo short names, used in deriving HANGUL SYLLABLE names
130 algorithmically.</li>
131</ul>
132<p><b>ArabicShaping.txt (Section 8.2)</b>
133<ul>
134 <li>Basic Arabic and Syriac character shaping properties, such as initial,
135 medial and final shapes. These properties are normative for minimal shaping
136 of Arabic and Syriac.</li>
137</ul>
138<p><b>NamesList.txt (Chapter 14)</b>
139<ul>
140 <li>This file duplicates some of the material in the UnicodeData file, and
141 adds informative annotations uses in the character charts, as printed in the
142 Unicode Standard.</li>
143 <li><b>Note: </b>The information in NamesList.txt and Index.txt files matches
144 the appropriate version of the book. Changes in the Unicode Character
145 Database since then may not be reflected in these files, since they are
146 primarily of archival interest.</li>
147</ul>
148<p><b>Index.txt (Chapter 14)</b>
149<ul>
150 <li>Informative index to Unicode characters, as printed in the Unicode
151 Standard</li>
152 <li><b>Note: </b>The information in NamesList.txt and Index.txt files matches
153 the appropriate version of the book. Changes in the Unicode Character
154 Database since then may not be reflected in these files, since they are
155 primarily of archival interest.</li>
156</ul>
157<p><b>CompositionExclusions.txt (<a
158href="http://www.unicode.org/unicode/reports/tr15/">UAX #15 Unicode Normalization
159Forms</a>)</b>
160<ul>
161 <li>Normative properties for normalization.</li>
162</ul>
163<p><b>LineBreak.txt (<a href="http://www.unicode.org/unicode/reports/tr14/">UAX
164#14: Line Breaking Properties</a>)</b>
165<ul>
166 <li>Normative and informative properties for line breaking. To see which
167 properties are informative and which are normative, consult UAX #14.</li>
168</ul>
169<p><b>EastAsianWidth.txt (<a href="http://www.unicode.org/unicode/reports/tr11/">UAX
170#11: East Asian Character Width</a>)</b>
171<ul>
172 <li>Informative properties for determining the choice of wide vs. narrow
173 glyphs in East Asian contexts.</li>
174</ul>
175<p><b>BidiMirroring.txt</b><b> (<a
176href="http://www.unicode.org/unicode/reports/tr9/">UAX #9:&nbsp;The
177Bidirectional Algorithm</a>)</b></p>
178<ul>
179 <li>Informative properties for substituting characters in an implementation of
180 bidirectional mirroring.</li>
181</ul>
182<p><b>CaseFolding.txt (<a href="http://www.unicode.org/unicode/reports/tr21/">UTR
183#21: Case Mappings</a>)</b></p>
184<ul>
185 <li>Informative file mapping characters to their case-folded form.</li>
186</ul>
187<p><b>NormalizationTest.txt (<a
188href="http://www.unicode.org/unicode/reports/tr15/">UAX #15 Unicode Normalization
189Forms</a>)</b></p>
190<ul>
191 <li>Normative test file for conformance to Unicode Normalization Forms.</li>
192</ul>
193<p><b>diffXvY.txt</b>
194<ul>
195 <li>Mechanically-generated informative files containing accumulated
196 differences between successive versions of UnicodeData.txt</li>
197</ul>
198
199</body>
200
201</html>