From: Jarkko Hietaniemi Date: Sat, 14 Jun 2003 13:45:01 +0000 (+0000) Subject: Mention the Unicode::Regex::Set module. X-Git-Url: http://git.shadowcat.co.uk/gitweb/gitweb.cgi?a=commitdiff_plain;h=5ca1ac52233afde3fa5135257b2e37cba75b1c11;p=p5sagit%2Fp5-mst-13.2.git Mention the Unicode::Regex::Set module. p4raw-id: //depot/perl@19782 --- diff --git a/pod/perlunicode.pod b/pod/perlunicode.pod index 4508de7..91bb0f8 100644 --- a/pod/perlunicode.pod +++ b/pod/perlunicode.pod @@ -780,13 +780,13 @@ Level 1 - Basic Unicode Support capital letters with certain modifiers: the Full case-folding decomposes the letter, while the Simple case-folding would map it to a single character. - [ 9] see UTR#13 Unicode Newline Guidelines + [ 9] see UTR #13 Unicode Newline Guidelines [10] should do ^ and $ also on \x{85}, \x{2028} and \x{2029} (should also affect <>, $., and script line numbers) (the \x{85}, \x{2028} and \x{2029} do match \s) [a] You can mimic class subtraction using lookahead. -For example, what TR18 might write as +For example, what UTR #18 might write as [{Greek}-[{UNASSIGNED}]] @@ -801,6 +801,9 @@ But in this particular example, you probably really want which will match assigned characters known to be part of the Greek script. +Also see the Unicode::Regex::Set module, it does implement the full +UTR #18 grouping, intersection, union, and removal (subtraction) syntax. + [b] See L. =item *