diff options
author | Nicholas Clark <nick@ccl4.org> | 2006-04-18 13:18:27 +0000 |
---|---|---|
committer | Nicholas Clark <nick@ccl4.org> | 2006-04-18 13:18:27 +0000 |
commit | 16fc99ce68b5ddd31da4fb77619f09a2db75bf97 (patch) | |
tree | 40380e64d9d0a1fd8870d9985fadb71abd686cbf | |
parent | e3e6d68743ed761cb0544e3d813089f7cf291cd8 (diff) | |
download | perl-16fc99ce68b5ddd31da4fb77619f09a2db75bf97.tar.gz |
Note that Rafael has done the first cut of state variables.
Document that UTF-8 cleansing the tokeniser and the pads is TODO.
p4raw-id: //depot/perl@27886
-rw-r--r-- | pod/perltodo.pod | 9 |
1 files changed, 9 insertions, 0 deletions
diff --git a/pod/perltodo.pod b/pod/perltodo.pod index 15995bc24a..3ff99421d3 100644 --- a/pod/perltodo.pod +++ b/pod/perltodo.pod @@ -532,6 +532,7 @@ Study the possibility of adding a new prototype character, C<_>, meaning C<my $foo if 0;> is deprecated, and should be replaced with C<state $x = "initial value\n";> the syntax from Perl 6. +Rafael has sent a first cut patch to perl5-porters. =head2 regexp optimiser optional @@ -590,6 +591,14 @@ investigate what hooks would need to be added to make it possible to provide the full assertion support from a CPAN module, so that we aren't constraining the imagination of future CPAN authors. +=head2 Properly Unicode safe tokeniser and pads. + +The tokeniser isn't actually very UTF-8 clean. C<use utf8;> is a hack - +variable names are stored in stashes as raw bytes, without the utf-8 flag +set. The pad API only takes a C<char *> pointer, so that's all bytes too. The +tokeniser ignores the UTF-8-ness of C<PL_rsfp>, or any SVs returned from +source filters. All this could be fixed. + |