From: Jarkko Hietaniemi Date: Tue, 15 Jan 2002 02:14:29 +0000 (+0000) Subject: The Unicode categories doc patch to go with #14254, X-Git-Url: http://git.shadowcat.co.uk/gitweb/gitweb.cgi?a=commitdiff_plain;h=eb0cc9e3552a0fa3abde76a3fd73dea2d3b4e730;p=p5sagit%2Fp5-mst-13.2.git The Unicode categories doc patch to go with #14254, from Jeffrey. p4raw-id: //depot/perl@14263 --- diff --git a/lib/Unicode/UCD.pm b/lib/Unicode/UCD.pm index ff9cc8f..3f8b896 100644 --- a/lib/Unicode/UCD.pm +++ b/lib/Unicode/UCD.pm @@ -108,7 +108,7 @@ as defined by the Unicode standard: title titlecase equivalent mapping block block the character belongs to (used in \p{In...}) - script script the character belongs to + script script the character belongs to If no match is found, a reference to an empty hash is returned. @@ -280,13 +280,12 @@ positions within all blocks are defined. See also L. -If supplied with an argument that can't be a code point, charblock() -tries to do the opposite and interpret the argument as a character -block. The return value is a I: an anonymous list that -contains anonymous lists, which in turn contain I, -I code point pairs. You can test whether a code point -is in a range using the L function. If the argument is -not a known charater block, C is returned. +If supplied with an argument that can't be a code point, charblock() tries +to do the opposite and interpret the argument as a character block. The +return value is a I: an anonymous list of lists that contain +I, I code point pairs. You can test whether a +code point is in a range using the L function. If the +argument is not a known charater block, C is returned. =cut @@ -342,13 +341,12 @@ character belongs to, e.g. C, C, C. See also L. -If supplied with an argument that can't be a code point, charscript() -tries to do the opposite and interpret the argument as a character -script. The return value is a I: an anonymous list that -contains anonymous lists, which in turn contain I, -I code point pairs. You can test whether a code point -is in a range using the L function. If the argument is -not a known charater script, C is returned. +If supplied with an argument that can't be a code point, charscript() tries +to do the opposite and interpret the argument as a character script. The +return value is a I: an anonymous list of lists that contain +I, I code point pairs. You can test whether a +code point is in a range using the L function. If the +argument is not a known charater script, C is returned. =cut @@ -433,13 +431,13 @@ sub charscripts { The difference between a block and a script is that scripts are closer to the linguistic notion of a set of characters required to present languages, while block is more of an artifact of the Unicode character -numbering and separation into blocks of 256 characters. +numbering and separation into blocks of (mostly) 256 characters. For example the Latin B