1 # PropertyValueAliases-3.2.0.txt
2 # Date: 2002-03-19,23:31:21 GMT [MD]
4 # This file contains aliases for property values used in the UCD.
5 # These names can be used for XML formats of UCD data, for regular-expression
6 # property tests, and other programmatic textual descriptions of Unicode data.
7 # The names are not normative, except where they correspond to normative property
8 # values in the UCD. For information on which properties are normative, see
9 # UnicodeCharacterDatabase.html.
11 # The names may be translated in appropriate environments, and additional
12 # aliases may be useful.
16 # Each line describes a property value name.
17 # This consists of three fields, separated by semicolons.
19 # First Field: The first field describes the property for which that
20 # property value name is used.
21 # There is one special pseudo-property: "qc" stands for any quick-check property
23 # Second Field: The second field is an abbreviated name.
24 # If there is no abbreviated name available, the field is marked with "n/a".
26 # Third Field: The third field is a long name.
28 # In the case of ccc, their are 4 fields. The second field is numeric, third
29 # is abbreviated, and fourth is long.
31 # With loose matching of property names, the case distinctions, whitespace,
32 # and '_' are ignored.
34 # NOTE: The Block property values are in Blocks.txt, and not repeated here.
35 # For more information on the use of blocks, see UTR #24: Regular Expression Guidelines
37 # NOTE: Currently there is at most one abbreviated name and one long name for
38 # property value. However, in the future additional aliases
39 # may be added. In such a case, the first line for the property value
40 # would have the preferred alias for output.
42 # NOTE: The property value names are NOT unique across properties, especially
43 # with loose matches. For example,
44 # AL means Arabic Letter for the Bidi_Class property, and
45 # AL means Alpha_Left for the Combining_Class property, and
46 # AL means Alphabetic for the Line_Break property.
48 # In addition, some property names may be the same as some property value names:
49 # cc means Combining_Class property, and
50 # cc means the General_Category property value Control (cc)
52 # The combination of property value and property name is, however, unique.
53 # For more information, see UTR #24: Regular Expression Guidelines
54 # ================================================
57 bc ; AL ; Arabic_Letter
58 bc ; AN ; Arabic_Number
59 bc ; B ; Paragraph_Separator
60 bc ; BN ; Boundary_Neutral
61 bc ; CS ; Common_Separator
62 bc ; EN ; European_Number
63 bc ; ES ; European_Separator
64 bc ; ET ; European_Terminator
65 bc ; L ; Left_To_Right
66 bc ; LRE ; Left_To_Right_Embedding
67 bc ; LRO ; Left_To_Right_Override
68 bc ; NSM ; Nonspacing_Mark
69 bc ; ON ; Other_Neutral
70 bc ; PDF ; Pop_Directional_Format
71 bc ; R ; Right_To_Left
72 bc ; RLE ; Right_To_Left_Embedding
73 bc ; RLO ; Right_To_Left_Override
74 bc ; S ; Segment_Separator
77 ccc; 0; NR ; Not_Reordered
79 ccc; 202; ATBL ; Attached_Below_Left
80 ccc; 216; ATAR ; Attached_Above_Right
81 ccc; 218; BL ; Below_Left
83 ccc; 222; BR ; Below_Right
86 ccc; 228; AL ; Above_Left
88 ccc; 232; AR ; Above_Right
89 ccc; 233; DB ; Double_Below
90 ccc; 234; DA ; Double_Above
91 ccc; 240; IS ; Iota_Subscript
93 ccc; 8; KV ; Kana_Voicing
122 gc ; C ; Other # Cc | Cf | Cn | Co | Cs
126 gc ; Co ; Private_Use
128 gc ; L ; Letter # Ll | Lm | Lo | Lt | Lu
129 gc ; LC ; Cased_Letter # Ll | Lt | Lu
130 gc ; Ll ; Lowercase_Letter
131 gc ; Lm ; Modifier_Letter
132 gc ; Lo ; Other_Letter
133 gc ; Lt ; Titlecase_Letter
134 gc ; Lu ; Uppercase_Letter
135 gc ; M ; Mark # Mc | Me | Mn
136 gc ; Mc ; Spacing_Mark
137 gc ; Me ; Enclosing_Mark
138 gc ; Mn ; Nonspacing_Mark
139 gc ; N ; Number # Nd | Nl | No
140 gc ; Nd ; Decimal_Number
141 gc ; Nl ; Letter_Number
142 gc ; No ; Other_Number
143 gc ; P ; Punctuation # Pc | Pd | Pe | Pf | Pi | Po | Ps
144 gc ; Pc ; Connector_Punctuation
145 gc ; Pd ; Dash_Punctuation
146 gc ; Pe ; Close_Punctuation
147 gc ; Pf ; Final_Punctuation
148 gc ; Pi ; Initial_Punctuation
149 gc ; Po ; Other_Punctuation
150 gc ; Ps ; Open_Punctuation
151 gc ; S ; Symbol # Sc | Sk | Sm | So
152 gc ; Sc ; Currency_Symbol
153 gc ; Sk ; Modifier_Symbol
154 gc ; Sm ; Math_Symbol
155 gc ; So ; Other_Symbol
156 gc ; Z ; Separator # Zl | Zp | Zs
157 gc ; Zl ; Line_Separator
158 gc ; Zp ; Paragraph_Separator
159 gc ; Zs ; Space_Separator
167 jg ; n/a ; DALATH_RISH
170 jg ; n/a ; FINAL_SEMKATH
174 jg ; n/a ; HAMZA_ON_HEH_GOAL
181 jg ; n/a ; KNOTTED_HEH
186 jg ; n/a ; NO_JOINING_GROUP
193 jg ; n/a ; REVERSED_PE
200 jg ; n/a ; SYRIAC_WAW
203 jg ; n/a ; TEH_MARBUTA
207 jg ; n/a ; YEH_BARREE
208 jg ; n/a ; YEH_WITH_TAIL
213 jt ; C ; Join_Causing
214 jt ; D ; Dual_Joining
215 jt ; L ; Left_Joining
216 jt ; R ; Right_Joining
223 lb ; BA ; Break_After
224 lb ; BB ; Break_Before
225 lb ; BK ; Mandatory_Break
226 lb ; CB ; Contingent_Break
227 lb ; CL ; Close_Punctuation
228 lb ; CM ; Combining_Mark
229 lb ; CR ; Carriage_Return
230 lb ; EX ; Exclamation
233 lb ; ID ; Ideographic
234 lb ; IN ; Inseperable
235 lb ; IS ; Infix_Numeric
239 lb ; OP ; Open_Punctuation
240 lb ; PO ; Postfix_Numeric
241 lb ; PR ; Prefix_Numeric
243 lb ; SA ; Complex_Context
246 lb ; SY ; Break_Symbols
264 sc ; Cans ; Canadian_Aboriginal
267 sc ; Deva ; Devanagari
280 sc ; Ital ; Old_Italic
286 sc ; Mlym ; Malayalam
287 sc ; Mong ; Mongolian
291 sc ; Qaai ; Inherited