X-Git-Url: http://git.shadowcat.co.uk/gitweb/gitweb.cgi?a=blobdiff_plain;f=lib%2Futf8.pm;h=5ddd4ba21a572f0718622e9d632fd9227bbe3821;hb=5dce09b1b12bc4dff9713d5dbb5291ac967ccf63;hp=269a1c2177f423540b327270e5fc30cf358af890;hpb=423cee853811c26846bd1948939b85f9866dfb4a;p=p5sagit%2Fp5-mst-13.2.git diff --git a/lib/utf8.pm b/lib/utf8.pm index 269a1c2..5ddd4ba 100644 --- a/lib/utf8.pm +++ b/lib/utf8.pm @@ -58,7 +58,7 @@ C<\x>. For instance, a Unicode smiley face is C<\x{263A}>. A character in the Latin-1 range (128..255) should be written C<\x{ab}> rather than C<\xab>, since the former will turn into a two-byte UTF-8 code, while the latter will continue to be interpreted as generating a -8-bit byte rather than a character. In fact, if -w is turned on, it will +8-bit byte rather than a character. In fact, if C<-w> is turned on, it will produce a warning that you might be generating invalid UTF-8. =item * @@ -71,8 +71,9 @@ attempt to canonicalize variable names for you.) =item * Regular expressions match characters instead of bytes. For instance, -"." matches a character instead of a byte. (However, the C<\O> pattern -is provided to force a match a single byte ("octet", hence C<\O>).) +"." matches a character instead of a byte. (However, the C<\C> pattern +is provided to force a match a single byte ("C" in C, hence +C<\C>).) =item *