diff options
author | Jarkko Hietaniemi <jhi@iki.fi> | 2002-09-06 09:01:57 +0300 |
---|---|---|
committer | Abhijit Menon-Sen <ams@wiw.org> | 2002-09-06 03:31:32 +0000 |
commit | 63de3cb284beb0325229608ff63562933eba8f50 (patch) | |
tree | 1e4b9176d04cfd4cb024e1df6bc3fd9a3bea8047 /pod | |
parent | 83d057904fcf43ccbeee0b8e23d13ba528a6cb6a (diff) | |
download | perl-63de3cb284beb0325229608ff63562933eba8f50.tar.gz |
(mostly (Unicode)) pod nits
Message-Id: <20020906030157.GA28252@lyta.hut.fi>
p4raw-id: //depot/perl@17850
Diffstat (limited to 'pod')
-rw-r--r-- | pod/perl.pod | 15 | ||||
-rw-r--r-- | pod/perlunicode.pod | 37 | ||||
-rw-r--r-- | pod/perluniintro.pod | 2 |
3 files changed, 21 insertions, 33 deletions
diff --git a/pod/perl.pod b/pod/perl.pod index 75331e1eb3..66a0821d2e 100644 --- a/pod/perl.pod +++ b/pod/perl.pod @@ -263,7 +263,8 @@ L<perlguts>, and L<xsubpp>. =item * -roll-your-own magic variables (including multiple simultaneous DBM implementations) +roll-your-own magic variables (including multiple simultaneous DBM +implementations) Described in L<perltie> and L<AnyDBM_File>. @@ -288,21 +289,15 @@ and L<perlbot>. =item * -compilability into C code or Perl bytecode - -Described in L<B> and L<B::Bytecode>. - -=item * - support for light-weight processes (threads) -Described in L<perlthrtut> and L<Thread>. +Described in L<perlthrtut> and L<threads>. =item * -support for internationalization, localization, and Unicode +support for Unicode, internationalization, and localization -Described in L<perllocale> and L<utf8>. +Described in L<perluniintro>, L<perllocale> and L<Locale::Maketext>. =item * diff --git a/pod/perlunicode.pod b/pod/perlunicode.pod index 8489702fd5..49f7432b9a 100644 --- a/pod/perlunicode.pod +++ b/pod/perlunicode.pod @@ -598,17 +598,8 @@ than one Unicode character. =back -The following cases do not yet work: - -=over 8 - -=item * - -the "final sigma" (Greek), and - -=item * - -anything to with locales (Lithuanian, Turkish, Azeri). +Things to do with locales (Lithuanian, Turkish, Azeri) do B<not> work +since Perl does not understand the concept of Unicode locales. =back @@ -771,17 +762,19 @@ which will match assigned characters known to be part of the Greek script. Level 2 - Extended Unicode Support - 3.1 Surrogates - MISSING - 3.2 Canonical Equivalents - MISSING [11][12] - 3.3 Locale-Independent Graphemes - MISSING [13] - 3.4 Locale-Independent Words - MISSING [14] - 3.5 Locale-Independent Loose Matches - MISSING [15] - - [11] see UTR#15 Unicode Normalization - [12] have Unicode::Normalize but not integrated to regexes - [13] have \X but at this level . should equal that - [14] need three classes, not just \w and \W - [15] see UTR#21 Case Mappings + 3.1 Surrogates - MISSING [11] + 3.2 Canonical Equivalents - MISSING [12][13] + 3.3 Locale-Independent Graphemes - MISSING [14] + 3.4 Locale-Independent Words - MISSING [15] + 3.5 Locale-Independent Loose Matches - MISSING [16] + + [11] Surrogates are solely a UTF-16 concept and Perl's internal + representation is UTF-8. The Encode module does UTF-16, though. + [12] see UTR#15 Unicode Normalization + [13] have Unicode::Normalize but not integrated to regexes + [14] have \X but at this level . should equal that + [15] need three classes, not just \w and \W + [16] see UTR#21 Case Mappings =item * diff --git a/pod/perluniintro.pod b/pod/perluniintro.pod index 870926ea1f..223dbae7fd 100644 --- a/pod/perluniintro.pod +++ b/pod/perluniintro.pod @@ -862,7 +862,7 @@ If you have the GNU recode installed, you can also use the Perl front-end C<Convert::Recode> for character conversions. The following are fast conversions from ISO 8859-1 (Latin-1) bytes -to UTF-8 bytes, the code works even with older Perl 5 versions. +to UTF-8 bytes and back, the code works even with older Perl 5 versions. # ISO 8859-1 to UTF-8 s/([\x80-\xFF])/chr(0xC0|ord($1)>>6).chr(0x80|ord($1)&0x3F)/eg; |