summaryrefslogtreecommitdiff
path: root/pod/perlre.pod
diff options
context:
space:
mode:
authorJeffrey Friedl <jfriedl@regex.info>2000-07-16 10:55:29 -0700
committerJarkko Hietaniemi <jhi@iki.fi>2000-08-18 21:07:06 +0000
commitaaa51d5e11b8b0db616a7f939c784733b4cfef87 (patch)
tree5cc57e412e80d1006256d7edc7526dd927cbe2bd /pod/perlre.pod
parent2a4ebaa641b7ba24b2dcfc940bb2b5da27d05b4e (diff)
downloadperl-aaa51d5e11b8b0db616a7f939c784733b4cfef87.tar.gz
Add [[:blank:]] as suggested in
Subject: [ID 20000716.024] [=cc=] / [:blank:] Message-Id: <200007170055.RAA23528@fummy.dsl.yahoo.com> (the [=cc=] has already been taken care of by #6439 so the whole bug report can be closed) and make [[:space:]] to be equivalent to isspace(3) (as opposed to \s, which is isSPACE()). The difference is that now [[:space:]] matches the mythical vertical tab, while \s doesn't. p4raw-id: //depot/perl@6703
Diffstat (limited to 'pod/perlre.pod')
-rw-r--r--pod/perlre.pod19
1 files changed, 13 insertions, 6 deletions
diff --git a/pod/perlre.pod b/pod/perlre.pod
index c964be8b8f..fa4aad25fd 100644
--- a/pod/perlre.pod
+++ b/pod/perlre.pod
@@ -199,20 +199,26 @@ equivalents (if available) are as follows:
alpha
alnum
ascii
+ blank [1]
cntrl
digit \d
graph
lower
print
punct
- space \s
+ space \s [2]
upper
- word \w
+ word \w [3]
xdigit
+ [1] A GNU extension equivalent to C<[ \t]>, `all horizontal whitespace'.
+ [2] Not I<exactly equivalent> to C<\s> since the C<[[:space:]]> includes
+ also the (very rare) `vertical tabulator', "\ck", chr(11).
+ [3] A Perl extension.
+
For example use C<[:upper:]> to match all the uppercase characters.
-Note that the C<[]> are part of the C<[::]> construct, not part of the whole
-character class. For example:
+Note that the C<[]> are part of the C<[::]> construct, not part of the
+whole character class. For example:
[01[:alpha:]%]
@@ -224,6 +230,7 @@ If the C<utf8> pragma is used, the following equivalences to Unicode
alpha IsAlpha
alnum IsAlnum
ascii IsASCII
+ blank IsSpace
cntrl IsCntrl
digit IsDigit
graph IsGraph
@@ -238,8 +245,8 @@ If the C<utf8> pragma is used, the following equivalences to Unicode
For example C<[:lower:]> and C<\p{IsLower}> are equivalent.
If the C<utf8> pragma is not used but the C<locale> pragma is, the
-classes correlate with the isalpha(3) interface (except for `word',
-which is a Perl extension, mirroring C<\w>).
+classes correlate with the usual isalpha(3) interface (except for
+`word' and `blank').
The assumedly non-obviously named classes are: