summaryrefslogtreecommitdiff
path: root/doc/html/pcreapi.html
diff options
context:
space:
mode:
authorph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15>2013-03-22 16:13:13 +0000
committerph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15>2013-03-22 16:13:13 +0000
commit712f2578028ec79534921d1b06f7b9d0fa1e643b (patch)
tree210216515dd0488a595b09cbe3b12bc2353f5645 /doc/html/pcreapi.html
parent54b46b0215cca9f79390afab565a31db76372d74 (diff)
downloadpcre-712f2578028ec79534921d1b06f7b9d0fa1e643b.tar.gz
Fix COMMIT in recursion; document backtracking verbs in assertions and
subroutines. git-svn-id: svn://vcs.exim.org/pcre/code/trunk@1298 2f5784b3-3f2a-0410-8824-cb99058d5e15
Diffstat (limited to 'doc/html/pcreapi.html')
-rw-r--r--doc/html/pcreapi.html23
1 files changed, 15 insertions, 8 deletions
diff --git a/doc/html/pcreapi.html b/doc/html/pcreapi.html
index 59398df..656cfc1 100644
--- a/doc/html/pcreapi.html
+++ b/doc/html/pcreapi.html
@@ -1288,9 +1288,14 @@ be used.
PCRE_INFO_MAXLOOKBEHIND
</pre>
Return the number of characters (NB not bytes) in the longest lookbehind
-assertion in the pattern. Note that the simple assertions \b and \B require a
-one-character lookbehind. This information is useful when doing multi-segment
-matching using the partial matching facilities.
+assertion in the pattern. This information is useful when doing multi-segment
+matching using the partial matching facilities. Note that the simple assertions
+\b and \B require a one-character lookbehind. \A also registers a
+one-character lookbehind, though it does not actually inspect the previous
+character. This is to ensure that at least one character from the old segment
+is retained when a new segment is processed. Otherwise, if there are no
+lookbehinds in the pattern, \A might match incorrectly at the start of a new
+segment.
<pre>
PCRE_INFO_MINLENGTH
</pre>
@@ -2332,10 +2337,12 @@ character.
The first byte of a character has the value 0xfe or 0xff. These values can
never occur in a valid UTF-8 string.
<pre>
- PCRE_UTF8_ERR2
+ PCRE_UTF8_ERR22
</pre>
-Non-character. These are the last two characters in each plane (0xfffe, 0xffff,
-0x1fffe, 0x1ffff .. 0x10fffe, 0x10ffff), and the characters 0xfdd0..0xfdef.
+This error code was formerly used when the presence of a so-called
+"non-character" caused an error. Unicode corrigendum #9 makes it clear that
+such characters should not cause a string to be rejected, and so this code is
+no longer in use and is never returned.
</P>
<br><a name="SEC18" href="#TOC1">EXTRACTING CAPTURED SUBSTRINGS BY NUMBER</a><br>
<P>
@@ -2777,9 +2784,9 @@ Cambridge CB2 3QH, England.
</P>
<br><a name="SEC26" href="#TOC1">REVISION</a><br>
<P>
-Last updated: 08 November 2012
+Last updated: 27 February 2013
<br>
-Copyright &copy; 1997-2012 University of Cambridge.
+Copyright &copy; 1997-2013 University of Cambridge.
<br>
<p>
Return to the <a href="index.html">PCRE index page</a>.