summaryrefslogtreecommitdiff
path: root/pod/perlretut.pod
diff options
context:
space:
mode:
authorKarl Williamson <public@khwilliamson.com>2011-07-05 20:27:40 -0600
committerKarl Williamson <public@khwilliamson.com>2011-07-05 20:32:05 -0600
commit615d795d657ccfcad8a25f1adf594120d271288a (patch)
tree77003d159741e7cee041e99caa76497c762ddb6a /pod/perlretut.pod
parent582da94270934217d199ef2000e655c89b0e769d (diff)
downloadperl-615d795d657ccfcad8a25f1adf594120d271288a.tar.gz
perlretut: Remove references to /dul
These modifiers are automatically selected when various pragmas are in effect, and their explicit use is rarely called for, so shouldn't be mentioned in a tutorial introduction.
Diffstat (limited to 'pod/perlretut.pod')
-rw-r--r--pod/perlretut.pod17
1 files changed, 8 insertions, 9 deletions
diff --git a/pod/perlretut.pod b/pod/perlretut.pod
index 46d9a36701..af7ac32836 100644
--- a/pod/perlretut.pod
+++ b/pod/perlretut.pod
@@ -1654,7 +1654,7 @@ important not only to match what is desired, but to reject what is not
desired.
(There are other regexp modifiers that are available, such as
-C<//o>, C<//d>, and C<//l>, but their specialized uses are beyond the
+C<//o>, but their specialized uses are beyond the
scope of this introduction. )
=head3 Search and replace
@@ -1925,14 +1925,13 @@ One can also use short names or restrict names to a certain alphabet:
A list of full names can be found in F<NamesList.txt> in the Unicode standard
(available at L<http://www.unicode.org/Public/UNIDATA/>).
-The answer to requirement 2), as of 5.6.0, is that a regexp (mostly)
-uses Unicode characters. (For messy backward compatibility reasons,
-most but not all semantics of a match will assume Unicode, unless,
-starting in Perl 5.14, you tell it to use full Unicode. You can do this
-explicitly by using the C<//u> modifier, or you can ask Perl to use the
-modifier implicitly for all regexes in a scope by using C<use 5.012> (or
-higher) or C<use feature 'unicode_strings'>.) If you want to handle
-Unicode properly, you should ensure that one of these is the case.)
+The answer to requirement 2) is, as of 5.6.0, that a regexp (mostly)
+uses Unicode characters. (The "mostly" is for messy backward
+compatibility reasons, but starting in Perl 5.14, any regex compiled in
+the scope of a C<use feature 'unicode_strings'> (which is automatically
+turned on within the scope of a C<use 5.012> or higher) will turn that
+"mostly" into "always". If you want to handle Unicode properly, you
+should ensure that C<'unicode_strings'> is turned on.)
Internally, this is encoded to bytes using either UTF-8 or a native 8
bit encoding, depending on the history of the string, but conceptually
it is a sequence of characters, not bytes. See L<perlunitut> for a