diff options
author | Karl Williamson <khw@cpan.org> | 2016-01-05 16:12:55 -0700 |
---|---|---|
committer | Karl Williamson <khw@cpan.org> | 2016-01-08 14:17:11 -0700 |
commit | f1f6961f5a6fd77a3e3c36f242f1b72ce5dfe205 (patch) | |
tree | 52365bdb2759341217eb979be04a61f5b351eb2f /regcharclass.h | |
parent | cbdbe9d466e0d26852ca1ace0825220c8ca7d215 (diff) | |
download | perl-f1f6961f5a6fd77a3e3c36f242f1b72ce5dfe205.tar.gz |
Tailor \b{wb} for Perl
The Unicode \b{wb} matches the boundary between space characters in a
span of them. This is opposite of what \b does, and is counterintuitive
to Perl expectations. This commit tailors \b{wb} to not split up spans
of white space.
I have submitted a request to Unicode to re-examine their algorithm, and
this has been assigned to a subcommittee to look at, but the result
won't be available until after 5.24 is done. In any event, Unicode
encourages tailoring for local conditions.
Diffstat (limited to 'regcharclass.h')
-rw-r--r-- | regcharclass.h | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/regcharclass.h b/regcharclass.h index 7420800873..1402044dbf 100644 --- a/regcharclass.h +++ b/regcharclass.h @@ -2514,7 +2514,7 @@ * 1a0687fb9c6c4567e853913549df0944fe40821279a3e9cdaa6ab8679bc286fd lib/unicore/extracted/DLineBreak.txt * 40bcfed3ca727c19e1331f6c33806231d5f7eeeabd2e6a9e06a3740c85d0c250 lib/unicore/extracted/DNumType.txt * a18d502bad39d527ac5586d7bc93e29f565859e3bcc24ada627eff606d6f5fed lib/unicore/extracted/DNumValues.txt - * 21f614a12bfde0478588228d46f1b594bf7e23c7d1f51492c70b13f7c9b8de09 lib/unicore/mktables + * 2b18fcfeafc8e8a26ff1124ad4ca94020f287bc4651be7ea199d69ecd5dcf9c5 lib/unicore/mktables * 462c9aaa608fb2014cd9649af1c5c009485c60b9c8b15b89401fdc10cf6161c6 lib/unicore/version * 996abda3c0fbc2bfd575092af09e3b9b0331e624eb2e969a268457f8fd31ecbb regen/charset_translations.pl * d9c04ac46bdd81bb3e26519f2b8eb6242cb12337205add3f7cf092b0c58dccc4 regen/regcharclass.pl |