diff options
author | Karl Williamson <public@khwilliamson.com> | 2011-07-05 20:27:40 -0600 |
---|---|---|
committer | Karl Williamson <public@khwilliamson.com> | 2011-07-05 20:32:05 -0600 |
commit | 615d795d657ccfcad8a25f1adf594120d271288a (patch) | |
tree | 77003d159741e7cee041e99caa76497c762ddb6a /pod/perlretut.pod | |
parent | 582da94270934217d199ef2000e655c89b0e769d (diff) | |
download | perl-615d795d657ccfcad8a25f1adf594120d271288a.tar.gz |
perlretut: Remove references to /dul
These modifiers are automatically selected when various
pragmas are in effect, and their explicit use is rarely called
for, so shouldn't be mentioned in a tutorial introduction.
Diffstat (limited to 'pod/perlretut.pod')
-rw-r--r-- | pod/perlretut.pod | 17 |
1 files changed, 8 insertions, 9 deletions
diff --git a/pod/perlretut.pod b/pod/perlretut.pod index 46d9a36701..af7ac32836 100644 --- a/pod/perlretut.pod +++ b/pod/perlretut.pod @@ -1654,7 +1654,7 @@ important not only to match what is desired, but to reject what is not desired. (There are other regexp modifiers that are available, such as -C<//o>, C<//d>, and C<//l>, but their specialized uses are beyond the +C<//o>, but their specialized uses are beyond the scope of this introduction. ) =head3 Search and replace @@ -1925,14 +1925,13 @@ One can also use short names or restrict names to a certain alphabet: A list of full names can be found in F<NamesList.txt> in the Unicode standard (available at L<http://www.unicode.org/Public/UNIDATA/>). -The answer to requirement 2), as of 5.6.0, is that a regexp (mostly) -uses Unicode characters. (For messy backward compatibility reasons, -most but not all semantics of a match will assume Unicode, unless, -starting in Perl 5.14, you tell it to use full Unicode. You can do this -explicitly by using the C<//u> modifier, or you can ask Perl to use the -modifier implicitly for all regexes in a scope by using C<use 5.012> (or -higher) or C<use feature 'unicode_strings'>.) If you want to handle -Unicode properly, you should ensure that one of these is the case.) +The answer to requirement 2) is, as of 5.6.0, that a regexp (mostly) +uses Unicode characters. (The "mostly" is for messy backward +compatibility reasons, but starting in Perl 5.14, any regex compiled in +the scope of a C<use feature 'unicode_strings'> (which is automatically +turned on within the scope of a C<use 5.012> or higher) will turn that +"mostly" into "always". If you want to handle Unicode properly, you +should ensure that C<'unicode_strings'> is turned on.) Internally, this is encoded to bytes using either UTF-8 or a native 8 bit encoding, depending on the history of the string, but conceptually it is a sequence of characters, not bytes. See L<perlunitut> for a |