diff options
author | Karl Williamson <public@khwilliamson.com> | 2011-07-16 18:14:46 -0600 |
---|---|---|
committer | Karl Williamson <public@khwilliamson.com> | 2011-07-17 09:34:55 -0600 |
commit | 5db9882c5b5d071bd70347ae5b5e248fc73c24d1 (patch) | |
tree | 7f7bd9d3d9164286578d8fdd934b5327cb50e0a3 /pod/perlrecharclass.pod | |
parent | 516074bbdc51e536e82ee0a6d2105196e7461dd0 (diff) | |
download | perl-5db9882c5b5d071bd70347ae5b5e248fc73c24d1.tar.gz |
perlrecharclass: Nits
One nit is that the only difference between [\h\v] and \s is VT; it's
not just one difference.
Another nit is that the synonyms in the table may be to either
the ASCII or full-range depending on various things.
Diffstat (limited to 'pod/perlrecharclass.pod')
-rw-r--r-- | pod/perlrecharclass.pod | 10 |
1 files changed, 5 insertions, 5 deletions
diff --git a/pod/perlrecharclass.pod b/pod/perlrecharclass.pod index aac1c27c0b..c07f5032d9 100644 --- a/pod/perlrecharclass.pod +++ b/pod/perlrecharclass.pod @@ -26,7 +26,7 @@ to mean just the bracketed form. Certainly, most Perl documentation does that. The dot (or period), C<.> is probably the most used, and certainly the most well-known character class. By default, a dot matches any -character, except for the newline. The default can be changed to +character, except for the newline. That default can be changed to add matching the newline by using the I<single line> modifier: either for the entire regular expression with the C</s> modifier, or locally with C<(?s)>. (The experimental C<\N> backslash sequence, described @@ -89,7 +89,7 @@ names are respectively C<COLON>, C<4F>, and C<F4>. =head3 Digits C<\d> matches a single character considered to be a decimal I<digit>. -If the C</a> regular expression modifier in effect, it matches [0-9]. +If the C</a> regular expression modifier is in effect, it matches [0-9]. Otherwise, it matches anything that is matched by C<\p{Digit}>, which includes [0-9]. (An unlikely possible exception is that under locale matching rules, the @@ -272,8 +272,8 @@ the same characters, without regard to other factors, such as whether the source string is in UTF-8 format. One might think that C<\s> is equivalent to C<[\h\v]>. This is not true. -For example, the vertical tab (C<"\x0b">) is not matched by C<\s>, it is -however considered vertical whitespace. +The difference is that the vertical tab (C<"\x0b">) is not matched by +C<\s>; it is however considered vertical whitespace. The following table is a complete listing of characters matched by C<\s>, C<\h> and C<\v> as of Unicode 6.0. @@ -653,7 +653,7 @@ appropriate characters in the full Unicode character set. For example, C<\p{Alpha}> matches not just the ASCII alphabetic characters, but any character in the entire Unicode character set considered alphabetic. An entry in the column labelled "backslash sequence" is a (short) -synonym for the Full-range Unicode form. +equivalent. [[:...:]] ASCII-range Full-range backslash Note Unicode Unicode sequence |