From 5db9882c5b5d071bd70347ae5b5e248fc73c24d1 Mon Sep 17 00:00:00 2001 From: Karl Williamson Date: Sat, 16 Jul 2011 18:14:46 -0600 Subject: perlrecharclass: Nits One nit is that the only difference between [\h\v] and \s is VT; it's not just one difference. Another nit is that the synonyms in the table may be to either the ASCII or full-range depending on various things. --- pod/perlrecharclass.pod | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/pod/perlrecharclass.pod b/pod/perlrecharclass.pod index aac1c27c0b..c07f5032d9 100644 --- a/pod/perlrecharclass.pod +++ b/pod/perlrecharclass.pod @@ -26,7 +26,7 @@ to mean just the bracketed form. Certainly, most Perl documentation does that. The dot (or period), C<.> is probably the most used, and certainly the most well-known character class. By default, a dot matches any -character, except for the newline. The default can be changed to +character, except for the newline. That default can be changed to add matching the newline by using the I modifier: either for the entire regular expression with the C modifier, or locally with C<(?s)>. (The experimental C<\N> backslash sequence, described @@ -89,7 +89,7 @@ names are respectively C, C<4F>, and C. =head3 Digits C<\d> matches a single character considered to be a decimal I. -If the C regular expression modifier in effect, it matches [0-9]. +If the C regular expression modifier is in effect, it matches [0-9]. Otherwise, it matches anything that is matched by C<\p{Digit}>, which includes [0-9]. (An unlikely possible exception is that under locale matching rules, the @@ -272,8 +272,8 @@ the same characters, without regard to other factors, such as whether the source string is in UTF-8 format. One might think that C<\s> is equivalent to C<[\h\v]>. This is not true. -For example, the vertical tab (C<"\x0b">) is not matched by C<\s>, it is -however considered vertical whitespace. +The difference is that the vertical tab (C<"\x0b">) is not matched by +C<\s>; it is however considered vertical whitespace. The following table is a complete listing of characters matched by C<\s>, C<\h> and C<\v> as of Unicode 6.0. @@ -653,7 +653,7 @@ appropriate characters in the full Unicode character set. For example, C<\p{Alpha}> matches not just the ASCII alphabetic characters, but any character in the entire Unicode character set considered alphabetic. An entry in the column labelled "backslash sequence" is a (short) -synonym for the Full-range Unicode form. +equivalent. [[:...:]] ASCII-range Full-range backslash Note Unicode Unicode sequence -- cgit v1.2.1