From: Nicholas Clark Date: Tue, 18 Apr 2006 13:18:27 +0000 (+0000) Subject: Note that Rafael has done the first cut of state variables. X-Git-Url: http://git.shadowcat.co.uk/gitweb/gitweb.cgi?a=commitdiff_plain;h=16fc99ce68b5ddd31da4fb77619f09a2db75bf97;p=p5sagit%2Fp5-mst-13.2.git Note that Rafael has done the first cut of state variables. Document that UTF-8 cleansing the tokeniser and the pads is TODO. p4raw-id: //depot/perl@27886 --- diff --git a/pod/perltodo.pod b/pod/perltodo.pod index 15995bc..3ff9942 100644 --- a/pod/perltodo.pod +++ b/pod/perltodo.pod @@ -532,6 +532,7 @@ Study the possibility of adding a new prototype character, C<_>, meaning C is deprecated, and should be replaced with C the syntax from Perl 6. +Rafael has sent a first cut patch to perl5-porters. =head2 regexp optimiser optional @@ -590,6 +591,14 @@ investigate what hooks would need to be added to make it possible to provide the full assertion support from a CPAN module, so that we aren't constraining the imagination of future CPAN authors. +=head2 Properly Unicode safe tokeniser and pads. + +The tokeniser isn't actually very UTF-8 clean. C is a hack - +variable names are stored in stashes as raw bytes, without the utf-8 flag +set. The pad API only takes a C pointer, so that's all bytes too. The +tokeniser ignores the UTF-8-ness of C, or any SVs returned from +source filters. All this could be fixed. +