summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15>2009-03-18 16:38:23 +0000
committerph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15>2009-03-18 16:38:23 +0000
commit42895029897bc56f99d0d9acd546a98c45961a5a (patch)
treee1119dba56a1620b7dbe4fbec49e7e038256b84c
parent401f2d4f645cdfae8ef0e1738d26532ccada17e4 (diff)
downloadpcre-42895029897bc56f99d0d9acd546a98c45961a5a.tar.gz
Add words about \b in UTF-8 mode.
git-svn-id: svn://vcs.exim.org/pcre/code/trunk@394 2f5784b3-3f2a-0410-8824-cb99058d5e15
-rw-r--r--doc/pcre.37
-rw-r--r--doc/pcrepattern.35
2 files changed, 7 insertions, 5 deletions
diff --git a/doc/pcre.3 b/doc/pcre.3
index 4f738d9..2a7b9e1 100644
--- a/doc/pcre.3
+++ b/doc/pcre.3
@@ -252,7 +252,8 @@ digits, spaces, or word characters remain the same set as before, all with
values less than 256. This remains true even when PCRE includes Unicode
property support, because to do otherwise would slow down PCRE in many common
cases. If you really want to test for a wider sense of, say, "digit", you
-must use Unicode property tests such as \ep{Nd}.
+must use Unicode property tests such as \ep{Nd}. Note that this also applies to
+\eb, because it is defined in terms of \ew and \eW.
.P
7. Similarly, characters that match the POSIX named character classes are all
low-valued characters.
@@ -289,6 +290,6 @@ two digits 10, at the domain cam.ac.uk.
.rs
.sp
.nf
-Last updated: 12 April 2008
-Copyright (c) 1997-2008 University of Cambridge.
+Last updated: 18 March 2009
+Copyright (c) 1997-2009 University of Cambridge.
.fi
diff --git a/doc/pcrepattern.3 b/doc/pcrepattern.3
index 1189b29..e948087 100644
--- a/doc/pcrepattern.3
+++ b/doc/pcrepattern.3
@@ -364,7 +364,8 @@ In UTF-8 mode, characters with values greater than 128 never match \ed, \es, or
\ew, and always match \eD, \eS, and \eW. This is true even when Unicode
character property support is available. These sequences retain their original
meanings from before UTF-8 support was available, mainly for efficiency
-reasons.
+reasons. Note that this also affects \eb, because it is defined in terms of \ew
+and \eW.
.P
The sequences \eh, \eH, \ev, and \eV are Perl 5.10 features. In contrast to the
other sequences, these do match certain high-valued codepoints in UTF-8 mode.
@@ -2244,6 +2245,6 @@ Cambridge CB2 3QH, England.
.rs
.sp
.nf
-Last updated: 08 March 2009
+Last updated: 18 March 2009
Copyright (c) 1997-2009 University of Cambridge.
.fi