From: Ævar Arnfjörð Bjarmason Date: Fri, 6 Apr 2007 21:41:57 +0000 (+0000) Subject: Re: [PATCH] perlreguts.pod: use the unicode name for ß and show the codepoint X-Git-Url: http://git.shadowcat.co.uk/gitweb/gitweb.cgi?a=commitdiff_plain;h=d38f6844f6115251e4fe0705e9b721f08bebf22b;p=p5sagit%2Fp5-mst-13.2.git Re: [PATCH] perlreguts.pod: use the unicode name for ß and show the codepoint Message-ID: <51dd1af80704061441v4b972257ta4c95230bdbc47c5@mail.gmail.com> p4raw-id: //depot/perl@30920 --- diff --git a/pod/perlreguts.pod b/pod/perlreguts.pod index c61a9cf..577f672 100644 --- a/pod/perlreguts.pod +++ b/pod/perlreguts.pod @@ -694,8 +694,8 @@ Unicode. For instance, in ASCII, it is safe to assume that C, but in UTF-8 it isn't. Unicode case folding is vastly more complex than the simple rules of ASCII, and even when not using Unicode but only localised single byte encodings, things can get -tricky (for example, GERMAN-SHARP-ESS should match 'SS' in localised -case-insensitive matching). +tricky (for example, B (U+00DF, E) +should match 'SS' in localised case-insensitive matching). Making things worse is that UTF-8 support was a later addition to the regex engine (as it was to perl) and this necessarily made things a lot