diff options
author | Jarkko Hietaniemi <jhi@iki.fi> | 2002-09-06 09:01:57 +0300 |
---|---|---|
committer | Abhijit Menon-Sen <ams@wiw.org> | 2002-09-06 03:31:32 +0000 |
commit | 63de3cb284beb0325229608ff63562933eba8f50 (patch) | |
tree | 1e4b9176d04cfd4cb024e1df6bc3fd9a3bea8047 /pod/perlunicode.pod | |
parent | 83d057904fcf43ccbeee0b8e23d13ba528a6cb6a (diff) | |
download | perl-63de3cb284beb0325229608ff63562933eba8f50.tar.gz |
(mostly (Unicode)) pod nits
Message-Id: <20020906030157.GA28252@lyta.hut.fi>
p4raw-id: //depot/perl@17850
Diffstat (limited to 'pod/perlunicode.pod')
-rw-r--r-- | pod/perlunicode.pod | 37 |
1 files changed, 15 insertions, 22 deletions
diff --git a/pod/perlunicode.pod b/pod/perlunicode.pod index 8489702fd5..49f7432b9a 100644 --- a/pod/perlunicode.pod +++ b/pod/perlunicode.pod @@ -598,17 +598,8 @@ than one Unicode character. =back -The following cases do not yet work: - -=over 8 - -=item * - -the "final sigma" (Greek), and - -=item * - -anything to with locales (Lithuanian, Turkish, Azeri). +Things to do with locales (Lithuanian, Turkish, Azeri) do B<not> work +since Perl does not understand the concept of Unicode locales. =back @@ -771,17 +762,19 @@ which will match assigned characters known to be part of the Greek script. Level 2 - Extended Unicode Support - 3.1 Surrogates - MISSING - 3.2 Canonical Equivalents - MISSING [11][12] - 3.3 Locale-Independent Graphemes - MISSING [13] - 3.4 Locale-Independent Words - MISSING [14] - 3.5 Locale-Independent Loose Matches - MISSING [15] - - [11] see UTR#15 Unicode Normalization - [12] have Unicode::Normalize but not integrated to regexes - [13] have \X but at this level . should equal that - [14] need three classes, not just \w and \W - [15] see UTR#21 Case Mappings + 3.1 Surrogates - MISSING [11] + 3.2 Canonical Equivalents - MISSING [12][13] + 3.3 Locale-Independent Graphemes - MISSING [14] + 3.4 Locale-Independent Words - MISSING [15] + 3.5 Locale-Independent Loose Matches - MISSING [16] + + [11] Surrogates are solely a UTF-16 concept and Perl's internal + representation is UTF-8. The Encode module does UTF-16, though. + [12] see UTR#15 Unicode Normalization + [13] have Unicode::Normalize but not integrated to regexes + [14] have \X but at this level . should equal that + [15] need three classes, not just \w and \W + [16] see UTR#21 Case Mappings =item * |