RT #130907: Fix the Unicode Bug in split " "

author: Aaron Crane <arc@cpan.org> 2017-03-04 12:50:58 +0000
committer: Aaron Crane <arc@cpan.org> 2017-07-15 19:35:43 +0100
commit: 20ae58f7a9bbf84d043d6e90f5988b6e3ca4ee3d (patch)
tree: 9cc42614061f228b5e0ba9ff22dc8b79f096595d /pod/perlunicode.pod
parent: 36f05cd79a2d07c67e7a1ce57cb3d0a1f632bee7 (diff)
download: perl-20ae58f7a9bbf84d043d6e90f5988b6e3ca4ee3d.tar.gz
1 files changed, 11 insertions, 0 deletions
diff --git a/pod/perlunicode.pod b/pod/perlunicode.pod
index 43f145918b..ef02b0a1f5 100644
--- a/pod/perlunicode.pod
+++ b/pod/perlunicode.pod
@@ -1836,6 +1836,17 @@ outside its scope, it could produce strings whose length in characters
 exceeded that of the right-hand side, where the right-hand side took up more
 bytes than the correct range endpoint.
 
+=item *
+
+In L<< C<split>'s special-case whitespace splitting|perlfunc/split >>.
+
+Starting in Perl 5.28.0, the C<split> function with a pattern specified as
+a string containing a single space handles whitespace characters consistently
+within the scope of of C<unicode_strings>. Prior to that, or outside its scope,
+characters that are whitespace according to Unicode rules but not according to
+ASCII rules were treated as field contents rather than field separators when
+they appear in byte-encoded strings.
+
 =back
 
 You can see from the above that the effect of C<unicode_strings>
author	Aaron Crane <arc@cpan.org>	2017-03-04 12:50:58 +0000
committer	Aaron Crane <arc@cpan.org>	2017-07-15 19:35:43 +0100
commit	20ae58f7a9bbf84d043d6e90f5988b6e3ca4ee3d (patch)
tree	9cc42614061f228b5e0ba9ff22dc8b79f096595d /pod/perlunicode.pod
parent	36f05cd79a2d07c67e7a1ce57cb3d0a1f632bee7 (diff)
download	perl-20ae58f7a9bbf84d043d6e90f5988b6e3ca4ee3d.tar.gz