summaryrefslogtreecommitdiff
path: root/doc/pcre2syntax.3
diff options
context:
space:
mode:
authorph10 <ph10@6239d852-aaf2-0410-a92c-79f79f948069>2015-04-13 17:29:05 +0000
committerph10 <ph10@6239d852-aaf2-0410-a92c-79f79f948069>2015-04-13 17:29:05 +0000
commitf7b165394f595cce65573950cf3e27c9f027b227 (patch)
treedc177e3f10c59870383bcbd39955208d53413b47 /doc/pcre2syntax.3
parent8dee6799eb1696b1f4c5193055c2fb8c70bf7ec5 (diff)
downloadpcre2-f7b165394f595cce65573950cf3e27c9f027b227.tar.gz
Implement PCRE2_NEVER_BACKSLASH_C.
git-svn-id: svn://vcs.exim.org/pcre2/code/trunk@247 6239d852-aaf2-0410-a92c-79f79f948069
Diffstat (limited to 'doc/pcre2syntax.3')
-rw-r--r--doc/pcre2syntax.314
1 files changed, 10 insertions, 4 deletions
diff --git a/doc/pcre2syntax.3 b/doc/pcre2syntax.3
index cfc6d0f..10a1584 100644
--- a/doc/pcre2syntax.3
+++ b/doc/pcre2syntax.3
@@ -1,4 +1,4 @@
-.TH PCRE2SYNTAX 3 "15 March 2015" "PCRE2 10.20"
+.TH PCRE2SYNTAX 3 "13 April 2015" "PCRE2 10.20"
.SH NAME
PCRE2 - Perl-compatible regular expressions (revised API)
.SH "PCRE2 REGULAR EXPRESSION SYNTAX SUMMARY"
@@ -44,7 +44,7 @@ characters "8" and "9".
.sp
. any character except newline;
in dotall mode, any character whatsoever
- \eC one data unit, even in UTF mode (best avoided)
+ \eC one code unit, even in UTF mode (best avoided)
\ed a decimal digit
\eD a character that is not a decimal digit
\eh a horizontal white space character
@@ -61,6 +61,10 @@ characters "8" and "9".
\eW a "non-word" character
\eX a Unicode extended grapheme cluster
.sp
+The application can lock out the use of \eC by setting the
+PCRE2_NEVER_BACKSLASH_C option. It is dangerous because it may leave the
+current matching point in the middle of a UTF-8 or UTF-16 character.
+.P
By default, \ed, \es, and \ew match only ASCII characters, even in UTF-8 mode
or in the 16-bit and 32-bit libraries. However, if locale-specific matching is
happening, \es and \ew may also match characters with code points in the range
@@ -396,7 +400,9 @@ appear.
(*UCP) set PCRE2_UCP (use Unicode properties for \ed etc)
.sp
Note that LIMIT_MATCH and LIMIT_RECURSION can only reduce the value of the
-limits set by the caller of pcre2_match(), not increase them.
+limits set by the caller of pcre2_match(), not increase them. The application
+can lock out the use of (*UTF) and (*UCP) by setting the PCRE2_NEVER_UTF or
+PCRE2_NEVER_UCP options, respectively, at compile time.
.
.
.SH "NEWLINE CONVENTION"
@@ -543,6 +549,6 @@ Cambridge, England.
.rs
.sp
.nf
-Last updated: 15 March 2015
+Last updated: 13 April 2015
Copyright (c) 1997-2015 University of Cambridge.
.fi