diff options
author | Karl Williamson <khw@cpan.org> | 2018-07-01 19:23:35 -0600 |
---|---|---|
committer | Karl Williamson <khw@cpan.org> | 2018-07-05 14:47:18 -0600 |
commit | 6c12993c67cedb7be68d2e7c7f52fbd9a39b92c8 (patch) | |
tree | 65bf0d61393e5122fa3f31d2fc8b24c3c6e420a1 /regen/regcharclass.pl | |
parent | 123deead4a6c7a01eafe0fb40c1b820f44301931 (diff) | |
download | perl-6c12993c67cedb7be68d2e7c7f52fbd9a39b92c8.tar.gz |
Use strict dfa to translate from UTF-8 to code point
With this commit, if a sequence passes the dfa, the result can be
returned immediately. Previously some rare potentially problematic
sequences could pass, which would then need further checking, which then
have to be done always. So this speeds up the general case.
Diffstat (limited to 'regen/regcharclass.pl')
-rwxr-xr-x | regen/regcharclass.pl | 4 |
1 files changed, 4 insertions, 0 deletions
diff --git a/regen/regcharclass.pl b/regen/regcharclass.pl index b837152a55..4884d1abf1 100755 --- a/regen/regcharclass.pl +++ b/regen/regcharclass.pl @@ -1736,3 +1736,7 @@ PROBLEMATIC_LOCALE_FOLDEDS_START : The first folded character of folds which are PATWS: pattern white space => generic cp : safe \p{_Perl_PatWS} + +HANGUL_ED: Hangul syllables whose first character is \xED +=> UTF8 :only_ascii_platform safe +0xD000 - 0xD7FF |