summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorNicholas Clark <nick@ccl4.org>2006-04-18 13:18:27 +0000
committerNicholas Clark <nick@ccl4.org>2006-04-18 13:18:27 +0000
commit16fc99ce68b5ddd31da4fb77619f09a2db75bf97 (patch)
tree40380e64d9d0a1fd8870d9985fadb71abd686cbf
parente3e6d68743ed761cb0544e3d813089f7cf291cd8 (diff)
downloadperl-16fc99ce68b5ddd31da4fb77619f09a2db75bf97.tar.gz
Note that Rafael has done the first cut of state variables.
Document that UTF-8 cleansing the tokeniser and the pads is TODO. p4raw-id: //depot/perl@27886
-rw-r--r--pod/perltodo.pod9
1 files changed, 9 insertions, 0 deletions
diff --git a/pod/perltodo.pod b/pod/perltodo.pod
index 15995bc24a..3ff99421d3 100644
--- a/pod/perltodo.pod
+++ b/pod/perltodo.pod
@@ -532,6 +532,7 @@ Study the possibility of adding a new prototype character, C<_>, meaning
C<my $foo if 0;> is deprecated, and should be replaced with
C<state $x = "initial value\n";> the syntax from Perl 6.
+Rafael has sent a first cut patch to perl5-porters.
=head2 regexp optimiser optional
@@ -590,6 +591,14 @@ investigate what hooks would need to be added to make it possible to provide
the full assertion support from a CPAN module, so that we aren't constraining
the imagination of future CPAN authors.
+=head2 Properly Unicode safe tokeniser and pads.
+
+The tokeniser isn't actually very UTF-8 clean. C<use utf8;> is a hack -
+variable names are stored in stashes as raw bytes, without the utf-8 flag
+set. The pad API only takes a C<char *> pointer, so that's all bytes too. The
+tokeniser ignores the UTF-8-ness of C<PL_rsfp>, or any SVs returned from
+source filters. All this could be fixed.
+