(mostly (Unicode)) pod nits

Message-Id: <20020906030157.GA28252@lyta.hut.fi> p4raw-id: //depot/perl@17850
author: Jarkko Hietaniemi <jhi@iki.fi> 2002-09-06 09:01:57 +0300
committer: Abhijit Menon-Sen <ams@wiw.org> 2002-09-06 03:31:32 +0000
commit: 63de3cb284beb0325229608ff63562933eba8f50 (patch)
tree: 1e4b9176d04cfd4cb024e1df6bc3fd9a3bea8047 /pod/perlunicode.pod
parent: 83d057904fcf43ccbeee0b8e23d13ba528a6cb6a (diff)
download: perl-63de3cb284beb0325229608ff63562933eba8f50.tar.gz
1 files changed, 15 insertions, 22 deletions
diff --git a/pod/perlunicode.pod b/pod/perlunicode.pod
index 8489702fd5..49f7432b9a 100644
--- a/pod/perlunicode.pod
+++ b/pod/perlunicode.pod
@@ -598,17 +598,8 @@ than one Unicode character.
 
 =back
 
-The following cases do not yet work:
-
-=over 8
-
-=item *
-
-the "final sigma" (Greek), and
-
-=item *
-
-anything to with locales (Lithuanian, Turkish, Azeri).
+Things to do with locales (Lithuanian, Turkish, Azeri) do B<not> work
+since Perl does not understand the concept of Unicode locales.
 
 =back
 
@@ -771,17 +762,19 @@ which will match assigned characters known to be part of the Greek script.
 
 Level 2 - Extended Unicode Support
 
-        3.1 Surrogates                          - MISSING
-        3.2 Canonical Equivalents               - MISSING       [11][12]
-        3.3 Locale-Independent Graphemes        - MISSING       [13]
-        3.4 Locale-Independent Words            - MISSING       [14]
-        3.5 Locale-Independent Loose Matches    - MISSING       [15]
-
-        [11] see UTR#15 Unicode Normalization
-        [12] have Unicode::Normalize but not integrated to regexes
-        [13] have \X but at this level . should equal that
-        [14] need three classes, not just \w and \W
-        [15] see UTR#21 Case Mappings
+        3.1 Surrogates                          - MISSING	[11]
+        3.2 Canonical Equivalents               - MISSING       [12][13]
+        3.3 Locale-Independent Graphemes        - MISSING       [14]
+        3.4 Locale-Independent Words            - MISSING       [15]
+        3.5 Locale-Independent Loose Matches    - MISSING       [16]
+
+        [11] Surrogates are solely a UTF-16 concept and Perl's internal
+             representation is UTF-8.  The Encode module does UTF-16, though.
+        [12] see UTR#15 Unicode Normalization
+        [13] have Unicode::Normalize but not integrated to regexes
+        [14] have \X but at this level . should equal that
+        [15] need three classes, not just \w and \W
+        [16] see UTR#21 Case Mappings
 
 =item *
author	Jarkko Hietaniemi <jhi@iki.fi>	2002-09-06 09:01:57 +0300
committer	Abhijit Menon-Sen <ams@wiw.org>	2002-09-06 03:31:32 +0000
commit	63de3cb284beb0325229608ff63562933eba8f50 (patch)
tree	1e4b9176d04cfd4cb024e1df6bc3fd9a3bea8047 /pod/perlunicode.pod
parent	83d057904fcf43ccbeee0b8e23d13ba528a6cb6a (diff)
download	perl-63de3cb284beb0325229608ff63562933eba8f50.tar.gz