diff options
author | ph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15> | 2009-03-18 16:38:23 +0000 |
---|---|---|
committer | ph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15> | 2009-03-18 16:38:23 +0000 |
commit | 42895029897bc56f99d0d9acd546a98c45961a5a (patch) | |
tree | e1119dba56a1620b7dbe4fbec49e7e038256b84c | |
parent | 401f2d4f645cdfae8ef0e1738d26532ccada17e4 (diff) | |
download | pcre-42895029897bc56f99d0d9acd546a98c45961a5a.tar.gz |
Add words about \b in UTF-8 mode.
git-svn-id: svn://vcs.exim.org/pcre/code/trunk@394 2f5784b3-3f2a-0410-8824-cb99058d5e15
-rw-r--r-- | doc/pcre.3 | 7 | ||||
-rw-r--r-- | doc/pcrepattern.3 | 5 |
2 files changed, 7 insertions, 5 deletions
@@ -252,7 +252,8 @@ digits, spaces, or word characters remain the same set as before, all with values less than 256. This remains true even when PCRE includes Unicode property support, because to do otherwise would slow down PCRE in many common cases. If you really want to test for a wider sense of, say, "digit", you -must use Unicode property tests such as \ep{Nd}. +must use Unicode property tests such as \ep{Nd}. Note that this also applies to +\eb, because it is defined in terms of \ew and \eW. .P 7. Similarly, characters that match the POSIX named character classes are all low-valued characters. @@ -289,6 +290,6 @@ two digits 10, at the domain cam.ac.uk. .rs .sp .nf -Last updated: 12 April 2008 -Copyright (c) 1997-2008 University of Cambridge. +Last updated: 18 March 2009 +Copyright (c) 1997-2009 University of Cambridge. .fi diff --git a/doc/pcrepattern.3 b/doc/pcrepattern.3 index 1189b29..e948087 100644 --- a/doc/pcrepattern.3 +++ b/doc/pcrepattern.3 @@ -364,7 +364,8 @@ In UTF-8 mode, characters with values greater than 128 never match \ed, \es, or \ew, and always match \eD, \eS, and \eW. This is true even when Unicode character property support is available. These sequences retain their original meanings from before UTF-8 support was available, mainly for efficiency -reasons. +reasons. Note that this also affects \eb, because it is defined in terms of \ew +and \eW. .P The sequences \eh, \eH, \ev, and \eV are Perl 5.10 features. In contrast to the other sequences, these do match certain high-valued codepoints in UTF-8 mode. @@ -2244,6 +2245,6 @@ Cambridge CB2 3QH, England. .rs .sp .nf -Last updated: 08 March 2009 +Last updated: 18 March 2009 Copyright (c) 1997-2009 University of Cambridge. .fi |