summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorKarl Williamson <khw@cpan.org>2019-02-05 11:30:05 -0700
committerKarl Williamson <khw@cpan.org>2019-02-05 11:44:29 -0700
commit0c8d0e83aeda5c8f4d2b739cf7288a43432b702c (patch)
treeb004ec20c54193129c2cf4dd32715a1df4bb1d57
parent30d8090de81085bd3dff00c83a7ab6d3ff8dfc8d (diff)
downloadperl-0c8d0e83aeda5c8f4d2b739cf7288a43432b702c.tar.gz
Docs for new Turkic UTF-8 locale support
-rw-r--r--pod/perldelta.pod10
-rw-r--r--pod/perllocale.pod11
2 files changed, 18 insertions, 3 deletions
diff --git a/pod/perldelta.pod b/pod/perldelta.pod
index 36f0652dd9..8d33af6a2a 100644
--- a/pod/perldelta.pod
+++ b/pod/perldelta.pod
@@ -27,6 +27,16 @@ here, but most should go in the L</Performance Enhancements> section.
[ List each enhancement as a =head2 entry ]
+=head2 Turkic UTF-8 locales are now seamlessly supported
+
+Turkic languages have different casing rules than other languages for
+the characters C<"i"> and C<"I">. The uppercase of C<"i"> is LATIN
+CAPITAL LETTER I WITH DOT ABOVE (U+0130); and the lowercase of C<"I"> is LATIN
+SMALL LETTER DOTLESS I (U+0131). Unicode furnishes alternate casing
+rules for use with Turkic languages. Previously, Perl ignored these,
+but now, it uses them when it detects that it is operating under a
+Turkic UTF-8 locale.
+
=head1 Security
XXX Any security-related notices go here. In particular, any security
diff --git a/pod/perllocale.pod b/pod/perllocale.pod
index 63f8947f96..e2a73e8484 100644
--- a/pod/perllocale.pod
+++ b/pod/perllocale.pod
@@ -40,9 +40,14 @@ Unicode, encoded in UTF-8. Starting in v5.20, Perl fully supports
UTF-8 locales, except for sorting and string comparisons like C<lt> and
C<ge>. Starting in v5.26, Perl can handle these reasonably as well,
depending on the platform's implementation. However, for earlier
-releases or for better control, use L<Unicode::Collate>. Perl continues to
-support the old non UTF-8 locales as well. There are currently no UTF-8
-locales for EBCDIC platforms.
+releases or for better control, use L<Unicode::Collate>. There are
+actually two slightly different types of UTF-8 locales: one for Turkic
+languages and one for everything else. Starting in Perl v5.30, Perl
+seamlessly handles both types; previously only the non-Turkic one was
+supported.
+
+Perl continues to support the old non UTF-8 locales as well. There are
+currently no UTF-8 locales for EBCDIC platforms.
(Unicode is also creating C<CLDR>, the "Common Locale Data Repository",
L<http://cldr.unicode.org/> which includes more types of information than