summaryrefslogtreecommitdiff
path: root/pod/perlretut.pod
diff options
context:
space:
mode:
authorJarkko Hietaniemi <jhi@iki.fi>2001-07-06 00:14:57 +0000
committerJarkko Hietaniemi <jhi@iki.fi>2001-07-06 00:14:57 +0000
commit5e42d7b483c9b59a3bad60858cd4bfe9fb4caf8e (patch)
treee87a380270631800c69571e341b695cccb478560 /pod/perlretut.pod
parent10a6ecd25e80ad20ebf67b311125411d51e78bc0 (diff)
downloadperl-5e42d7b483c9b59a3bad60858cd4bfe9fb4caf8e.tar.gz
Unterminated C< (noticed by Richard Hatch), and few other
small Unicode doc tweaks. p4raw-id: //depot/perl@11164
Diffstat (limited to 'pod/perlretut.pod')
-rw-r--r--pod/perlretut.pod13
1 files changed, 7 insertions, 6 deletions
diff --git a/pod/perlretut.pod b/pod/perlretut.pod
index 7f8e8f5430..869a422e31 100644
--- a/pod/perlretut.pod
+++ b/pod/perlretut.pod
@@ -1751,12 +1751,10 @@ letter, the braces can be dropped. For instance, C<\pM> is the
character class of Unicode 'marks', for example accent marks.
For the full list see L<perlunicode>.
-The Unicode has also been separated into blocks of charaters which you
-can test with C<\p{In...}> (in) and C<\P{In...}> (not in), for example
-C<\p{InLatin}, C<\p{InGreek}>, or C<\P{InKatakana}>. For the full list see
-L<perlunicode>.
-
-For the the full and latest information see the latest Unicode standard.
+The Unicode has also been separated into various sets of charaters
+which you can test with C<\p{In...}> (in) and C<\P{In...}> (not in),
+for example C<\p{InLatin}>, C<\p{InGreek}>, or C<\P{InKatakana}>.
+For the full list see L<perlunicode>.
C<\X> is an abbreviation for a character class sequence that includes
the Unicode 'combining character sequences'. A 'combining character
@@ -1768,6 +1766,9 @@ S<C<COMBINING RING> >, which translates in Danish to A with the circle
atop it, as in the word Angstrom. C<\X> is equivalent to C<\PM\pM*}>,
i.e., a non-mark followed by one or more marks.
+For the the full and latest information about Unicode see the latest
+Unicode standard, or the Unicode Consortium's website http://www.unicode.org/
+
As if all those classes weren't enough, Perl also defines POSIX style
character classes. These have the form C<[:name:]>, with C<name> the
name of the POSIX class. The POSIX classes are C<alpha>, C<alnum>,