diff options
author | Karl Williamson <khw@cpan.org> | 2015-09-04 11:32:26 -0600 |
---|---|---|
committer | Karl Williamson <khw@cpan.org> | 2015-09-08 11:35:03 -0600 |
commit | f424037916111e114a9c424abacc210aac685dff (patch) | |
tree | bbdc980dfb7bf663725e9233d1e7ca0f03a19250 /pod/perlhacktips.pod | |
parent | 992001bfb28aa89a918dfb566d0413ea40d9b0f5 (diff) | |
download | perl-f424037916111e114a9c424abacc210aac685dff.tar.gz |
Refactor tr/// parsing to work on EBCDIC, fix other bug
This expands the concept introduced for regular expressions in v5.22 of
a portable range, to the transliteration operators. A portable range
has at least one endpoint expressed as \N{} that indicates that the
Unicode definition is desired, or has the endpoints expressed as both
uppercase ASCII alphabetic letters or both lowercase ASCII alphabetics.
The refactor fixes several EBCDIC problems, and it fixes the problem in
all platforms wherein the first endpoint of a range was not checked to
be <= the final endpoint in UTF-8 strings.
There remains a bug in which if any transliterated code point is larger
than IV_MAX, perl loops.
Diffstat (limited to 'pod/perlhacktips.pod')
-rw-r--r-- | pod/perlhacktips.pod | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/pod/perlhacktips.pod b/pod/perlhacktips.pod index 498a58dde1..5a3fb25db8 100644 --- a/pod/perlhacktips.pod +++ b/pod/perlhacktips.pod @@ -299,7 +299,7 @@ Also, the range 'A' - 'Z' in ASCII is an unbroken sequence of 26 upper case alphabetic characters. That is not true in EBCDIC. Nor for 'a' to 'z'. But '0' - '9' is an unbroken range in both systems. Don't assume anything about other ranges. (Note that special handling of ranges in -regular expression patterns makes it appear to Perl +regular expression patterns and transliterations makes it appear to Perl code that the aforementioned ranges are all unbroken.) Many of the comments in the existing code ignore the possibility of |