summaryrefslogtreecommitdiff
path: root/pod/perlreref.pod
diff options
context:
space:
mode:
authorJarkko Hietaniemi <jhi@iki.fi>2003-08-12 19:41:50 +0000
committerJarkko Hietaniemi <jhi@iki.fi>2003-08-12 19:41:50 +0000
commite04a154e63afa686ad245e67ef7cc7496d92c1a2 (patch)
tree5fb87bd72ad55f0e3c6302e74e41405e4d9732fb /pod/perlreref.pod
parent65eba18fa4db1d6a117170ee75351b16dc792fca (diff)
downloadperl-e04a154e63afa686ad245e67ef7cc7496d92c1a2.tar.gz
perlreref tweaks from SADAHIRO Tomoyuki and Ian Truskett.
p4raw-id: //depot/perl@20661
Diffstat (limited to 'pod/perlreref.pod')
-rw-r--r--pod/perlreref.pod52
1 files changed, 28 insertions, 24 deletions
diff --git a/pod/perlreref.pod b/pod/perlreref.pod
index fc38b1372c..700814140d 100644
--- a/pod/perlreref.pod
+++ b/pod/perlreref.pod
@@ -104,15 +104,19 @@ This one works differently from normal strings:
[f-j-] Dash escaped or at start or end means 'dash'
[^f-j] Caret indicates "match any character _except_ these"
-The following work within or without a character class:
-
- \d A digit, same as [0-9]
- \D A nondigit, same as [^0-9]
- \w A word character (alphanumeric), same as [a-zA-Z0-9_]
- \W A non-word character, [^a-zA-Z0-9_]
- \s A whitespace character, same as [ \t\n\r\f]
- \S A non-whitespace character, [^ \t\n\r\f]
- \C Match a byte (with Unicode, '.' matches char)
+The following sequences work within or without a character class.
+The first six are locale aware, all are Unicode aware. The default
+character class equivalent are given. See L<perllocale> and
+L<perlunicode> for details.
+
+ \d A digit [0-9]
+ \D A nondigit [^0-9]
+ \w A word character [a-zA-Z0-9_]
+ \W A non-word character [^a-zA-Z0-9_]
+ \s A whitespace character [ \t\n\r\f]
+ \S A non-whitespace character [^ \t\n\r\f]
+
+ \C Match a byte (with Unicode, '.' matches a character)
\pP Match P-named (Unicode) property
\p{...} Match Unicode property with long name
\PP Match non-P
@@ -121,21 +125,21 @@ The following work within or without a character class:
POSIX character classes and their Unicode and Perl equivalents:
- alnum IsAlnum Alphanumeric
- alpha IsAlpha Alphabetic
- ascii IsASCII Any ASCII char
- blank IsSpace [ \t] Horizontal whitespace (GNU)
- cntrl IsCntrl Control characters
- digit IsDigit \d Digits
- graph IsGraph Alphanumeric and punctuation
- lower IsLower Lowercase chars (locale aware)
- print IsPrint Alphanumeric, punct, and space
- punct IsPunct Punctuation
- space IsSpace [\s\ck] Whitespace
- IsSpacePerl \s Perl's whitespace definition
- upper IsUpper Uppercase chars (locale aware)
- word IsWord \w Alphanumeric plus _ (Perl)
- xdigit IsXDigit [\dA-Fa-f] Hexadecimal digit
+ alnum IsAlnum Alphanumeric
+ alpha IsAlpha Alphabetic
+ ascii IsASCII Any ASCII char
+ blank IsSpace [ \t] Horizontal whitespace (GNU extension)
+ cntrl IsCntrl Control characters
+ digit IsDigit \d Digits
+ graph IsGraph Alphanumeric and punctuation
+ lower IsLower Lowercase chars (locale and Unicode aware)
+ print IsPrint Alphanumeric, punct, and space
+ punct IsPunct Punctuation
+ space IsSpace [\s\ck] Whitespace
+ IsSpacePerl \s Perl's whitespace definition
+ upper IsUpper Uppercase chars (locale and Unicode aware)
+ word IsWord \w Alphanumeric plus _ (Perl extension)
+ xdigit IsXDigit [0-9A-Fa-f] Hexadecimal digit
Within a character class: