Here are the long-expected Unicode/UTF-8 modifications.

p4raw-id: //depot/utfperl@1651
author: Larry Wall <larry@wall.org> 1998-07-24 05:44:33 +0000
committer: Larry Wall <larry@wall.org> 1998-07-24 05:44:33 +0000
commit: a0ed51b321531af4b47cce24205ab9656f043f0f (patch)
tree: 610356407b37a4041ea8bcaf44571579b2da5613 /pod/perlre.pod
parent: 9332a1c1d80ded85a2b1f32b1c8968a35e3b0fbb (diff)
download: perl-a0ed51b321531af4b47cce24205ab9656f043f0f.tar.gz
1 files changed, 5 insertions, 0 deletions
diff --git a/pod/perlre.pod b/pod/perlre.pod
index 924a2c4115..b7fda54061 100644
--- a/pod/perlre.pod
+++ b/pod/perlre.pod
@@ -141,6 +141,7 @@ also work:
     \e		escape (think troff)  (ESC)
     \033	octal char (think of a PDP-11)
     \x1B	hex char
+    \x{263a}	wide hex char         (Unicode SMILEY)
     \c[		control char
     \l		lowercase next char (think vi)
     \u		uppercase next char (think vi)
@@ -165,6 +166,10 @@ In addition, Perl defines the following:
     \S	Match a non-whitespace character
     \d	Match a digit character
     \D	Match a non-digit character
+    \pP	Match P, named property.  Use \p{Prop} for longer names.
+    \PP	Match non-P
+    \X	Match eXtended Unicode "combining character sequence", \pM\pm*
+    \C	Match a single C char (octet) even under utf8.
 
 A C<\w> matches a single alphanumeric character, not a whole
 word.  To match a word you'd need to say C<\w+>.  If C<use locale> is in
author	Larry Wall <larry@wall.org>	1998-07-24 05:44:33 +0000
committer	Larry Wall <larry@wall.org>	1998-07-24 05:44:33 +0000
commit	a0ed51b321531af4b47cce24205ab9656f043f0f (patch)
tree	610356407b37a4041ea8bcaf44571579b2da5613 /pod/perlre.pod
parent	9332a1c1d80ded85a2b1f32b1c8968a35e3b0fbb (diff)
download	perl-a0ed51b321531af4b47cce24205ab9656f043f0f.tar.gz