summaryrefslogtreecommitdiff
path: root/pod
diff options
context:
space:
mode:
authorJarkko Hietaniemi <jhi@iki.fi>2002-09-06 09:01:57 +0300
committerAbhijit Menon-Sen <ams@wiw.org>2002-09-06 03:31:32 +0000
commit63de3cb284beb0325229608ff63562933eba8f50 (patch)
tree1e4b9176d04cfd4cb024e1df6bc3fd9a3bea8047 /pod
parent83d057904fcf43ccbeee0b8e23d13ba528a6cb6a (diff)
downloadperl-63de3cb284beb0325229608ff63562933eba8f50.tar.gz
(mostly (Unicode)) pod nits
Message-Id: <20020906030157.GA28252@lyta.hut.fi> p4raw-id: //depot/perl@17850
Diffstat (limited to 'pod')
-rw-r--r--pod/perl.pod15
-rw-r--r--pod/perlunicode.pod37
-rw-r--r--pod/perluniintro.pod2
3 files changed, 21 insertions, 33 deletions
diff --git a/pod/perl.pod b/pod/perl.pod
index 75331e1eb3..66a0821d2e 100644
--- a/pod/perl.pod
+++ b/pod/perl.pod
@@ -263,7 +263,8 @@ L<perlguts>, and L<xsubpp>.
=item *
-roll-your-own magic variables (including multiple simultaneous DBM implementations)
+roll-your-own magic variables (including multiple simultaneous DBM
+implementations)
Described in L<perltie> and L<AnyDBM_File>.
@@ -288,21 +289,15 @@ and L<perlbot>.
=item *
-compilability into C code or Perl bytecode
-
-Described in L<B> and L<B::Bytecode>.
-
-=item *
-
support for light-weight processes (threads)
-Described in L<perlthrtut> and L<Thread>.
+Described in L<perlthrtut> and L<threads>.
=item *
-support for internationalization, localization, and Unicode
+support for Unicode, internationalization, and localization
-Described in L<perllocale> and L<utf8>.
+Described in L<perluniintro>, L<perllocale> and L<Locale::Maketext>.
=item *
diff --git a/pod/perlunicode.pod b/pod/perlunicode.pod
index 8489702fd5..49f7432b9a 100644
--- a/pod/perlunicode.pod
+++ b/pod/perlunicode.pod
@@ -598,17 +598,8 @@ than one Unicode character.
=back
-The following cases do not yet work:
-
-=over 8
-
-=item *
-
-the "final sigma" (Greek), and
-
-=item *
-
-anything to with locales (Lithuanian, Turkish, Azeri).
+Things to do with locales (Lithuanian, Turkish, Azeri) do B<not> work
+since Perl does not understand the concept of Unicode locales.
=back
@@ -771,17 +762,19 @@ which will match assigned characters known to be part of the Greek script.
Level 2 - Extended Unicode Support
- 3.1 Surrogates - MISSING
- 3.2 Canonical Equivalents - MISSING [11][12]
- 3.3 Locale-Independent Graphemes - MISSING [13]
- 3.4 Locale-Independent Words - MISSING [14]
- 3.5 Locale-Independent Loose Matches - MISSING [15]
-
- [11] see UTR#15 Unicode Normalization
- [12] have Unicode::Normalize but not integrated to regexes
- [13] have \X but at this level . should equal that
- [14] need three classes, not just \w and \W
- [15] see UTR#21 Case Mappings
+ 3.1 Surrogates - MISSING [11]
+ 3.2 Canonical Equivalents - MISSING [12][13]
+ 3.3 Locale-Independent Graphemes - MISSING [14]
+ 3.4 Locale-Independent Words - MISSING [15]
+ 3.5 Locale-Independent Loose Matches - MISSING [16]
+
+ [11] Surrogates are solely a UTF-16 concept and Perl's internal
+ representation is UTF-8. The Encode module does UTF-16, though.
+ [12] see UTR#15 Unicode Normalization
+ [13] have Unicode::Normalize but not integrated to regexes
+ [14] have \X but at this level . should equal that
+ [15] need three classes, not just \w and \W
+ [16] see UTR#21 Case Mappings
=item *
diff --git a/pod/perluniintro.pod b/pod/perluniintro.pod
index 870926ea1f..223dbae7fd 100644
--- a/pod/perluniintro.pod
+++ b/pod/perluniintro.pod
@@ -862,7 +862,7 @@ If you have the GNU recode installed, you can also use the
Perl front-end C<Convert::Recode> for character conversions.
The following are fast conversions from ISO 8859-1 (Latin-1) bytes
-to UTF-8 bytes, the code works even with older Perl 5 versions.
+to UTF-8 bytes and back, the code works even with older Perl 5 versions.
# ISO 8859-1 to UTF-8
s/([\x80-\xFF])/chr(0xC0|ord($1)>>6).chr(0x80|ord($1)&0x3F)/eg;