X-Git-Url: http://git.shadowcat.co.uk/gitweb/gitweb.cgi?a=blobdiff_plain;f=pod%2Fperlebcdic.pod;h=44ad6b9f0d77349d0aa859b2a8a4f82b5d93d19b;hb=14aaa8fc27b8350048cdee657c0128eb979d0b2a;hp=0053d91a3881b36c3d2b4803c7fbc7ed0642aebe;hpb=64c66fb6d001b6ad9c6dcec93084b647d4c6eb13;p=p5sagit%2Fp5-mst-13.2.git

diff --git a/pod/perlebcdic.pod b/pod/perlebcdic.pod
index 0053d91..44ad6b9 100644
--- a/pod/perlebcdic.pod
+++ b/pod/perlebcdic.pod
@@ -117,6 +117,24 @@ all the possible code points, pack("U",ord("A")) would in EBCDIC
 equal I<A with acute> or chr(101), and unpack("U", "A") would equal
 65, or I<non-breaking space>, not 193, or ord "A".)
 
+=head2 Remaining Perl Unicode problems in EBCDIC
+
+=over 4
+
+=item *
+
+Many of the remaining seem to be related to case-insensitive matching:
+for example, C<< /[\x{131}]/ >> (LATIN SMALL LETTER DOTLESS I) does
+not match "I" case-insensitively, as it should under Unicode.
+(The match succeeds in ASCII-derived platforms.)
+
+=item *
+
+The extensions Unicode::Collate and Unicode::Normalized are not
+supported under EBCDIC, likewise for the encoding pragma.
+
+=back
+
 =head2 Unicode and UTF
 
 UTF is a Unicode Transformation Format.  UTF-8 is a Unicode conforming
@@ -819,7 +837,11 @@ As of perl 5.005_03 the letter range regular expression such as
 [A-Z] and [a-z] have been especially coded to not pick up gap 
 characters.  For example, characters such as E<ocirc> C<o WITH CIRCUMFLEX> 
 that lie between I and J would not be matched by the 
-regular expression range C</[H-K]/>.  
+regular expression range C</[H-K]/>.  This works in
+the other direction, too, if either of the range end points is
+explicitly numeric: C<[\x89-\x91]> will match C<\x8e>, even
+though C<\x89> is C<i> and C<\x91 > is C<j>, and C<\x8e>
+is a gap character from the alphabetic viewpoint.
 
 If you do want to match the alphabet gap characters in a single octet 
 regular expression try matching the hex or octal code such