diff options
author | Karl Williamson <public@khwilliamson.com> | 2011-06-24 12:34:46 -0600 |
---|---|---|
committer | Karl Williamson <public@khwilliamson.com> | 2011-06-24 12:53:50 -0600 |
commit | d9742aa386b81e67ce2ecd1229bb8f04bf2899ec (patch) | |
tree | 77fa67ab5104f09c60ce0ddd55077d18fe1f601b /pod/perlunicode.pod | |
parent | 2f2f3ec9c49055c69541020aa1bf644ca5f8b6f1 (diff) | |
download | perl-d9742aa386b81e67ce2ecd1229bb8f04bf2899ec.tar.gz |
perlunicode: Nits
a missing word, remove L<> from a verbatim block, add L<> for a module,
clarify wording
Diffstat (limited to 'pod/perlunicode.pod')
-rw-r--r-- | pod/perlunicode.pod | 8 |
1 files changed, 4 insertions, 4 deletions
diff --git a/pod/perlunicode.pod b/pod/perlunicode.pod index 5a0dd7b8d9..5f8ef88a97 100644 --- a/pod/perlunicode.pod +++ b/pod/perlunicode.pod @@ -503,7 +503,7 @@ and inherit the script value of the controlling character. (Note that there are several different sets of digits in Unicode that are equivalent to 0-9 and are matchable by C<\d> in a regular expression. If they are used in a single language only, they are in that language's -script. Only sets are used across several languages are in the +script. Only sets that are used across several languages are in the C<Common> script.) For more about scripts versus blocks, see UAX#24 "Unicode Script Property": @@ -1081,7 +1081,7 @@ Level 1 - Basic Unicode Support [1] \x{...} [2] \p{...} \P{...} [3] supports not only minimal list, but all Unicode character - properties (see L</Unicode Character Properties>) + properties (see Unicode Character Properties above) [4] \d \D \s \S \w \W \X [:prop:] [:^prop:] [5] can use regular expression look-ahead [a] or user-defined character properties [b] to emulate set @@ -1089,7 +1089,7 @@ Level 1 - Basic Unicode Support [6] \b \B [7] note that Perl does Full case-folding in matching (but with bugs), not Simple: for example U+1F88 is equivalent to - U+1F00 U+03B9, not with 1F80. This difference matters + U+1F00 U+03B9, instead of just U+1F80. This difference matters mainly for certain Greek capital letters with certain modifiers: the Full case-folding decomposes the letter, while the Simple case-folding would map it to a single @@ -1121,7 +1121,7 @@ But in this particular example, you probably really want which will match assigned characters known to be part of the Greek script. -Also see the Unicode::Regex::Set module, it does implement the full +Also see the L<Unicode::Regex::Set> module, it does implement the full UTS#18 grouping, intersection, union, and removal (subtraction) syntax. [b] '+' for union, '-' for removal (set-difference), '&' for intersection |