summaryrefslogtreecommitdiff
path: root/README
diff options
context:
space:
mode:
authorph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15>2009-03-17 21:16:01 +0000
committerph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15>2009-03-17 21:16:01 +0000
commitf6cacbfecd16ac9d19ee6fe6c26c107ee4846b1c (patch)
treefb3166c26b8f211814f8302719f5bd93842aa575 /README
parent8d719b57aa95b8a2baebd8fd13c6b80c2da84680 (diff)
downloadpcre-f6cacbfecd16ac9d19ee6fe6c26c107ee4846b1c.tar.gz
Add support for UTF-8 in EBCDIC environments.
git-svn-id: svn://vcs.exim.org/pcre/code/trunk@391 2f5784b3-3f2a-0410-8824-cb99058d5e15
Diffstat (limited to 'README')
-rw-r--r--README19
1 files changed, 12 insertions, 7 deletions
diff --git a/README b/README
index e149bf4..10f73d7 100644
--- a/README
+++ b/README
@@ -161,10 +161,13 @@ library. You can read more about them in the pcrebuild man page.
it will try to find a C++ compiler and C++ header files, and if it succeeds,
it will try to build the C++ wrapper.
-. If you want to make use of the support for UTF-8 character strings in PCRE,
- you must add --enable-utf8 to the "configure" command. Without it, the code
- for handling UTF-8 is not included in the library. (Even when included, it
- still has to be enabled by an option at run time.)
+. If you want to make use of the support for UTF-8 Unicode character strings in
+ PCRE, you must add --enable-utf8 to the "configure" command. Without it, the
+ code for handling UTF-8 is not included in the library. Even when included,
+ it still has to be enabled by an option at run time. When PCRE is compiled
+ with this option, its input can only either be ASCII or UTF-8, even when
+ running on EBCDIC platforms. It is not possible to use both --enable-utf8 and
+ --enable-ebcdic at the same time.
. If, in addition to support for UTF-8 character strings, you want to include
support for the \P, \p, and \X sequences that recognize Unicode character
@@ -255,11 +258,13 @@ library. You can read more about them in the pcrebuild man page.
pcre_chartables.c.dist. See "Character tables" below for further information.
. It is possible to compile PCRE for use on systems that use EBCDIC as their
- default character code (as opposed to ASCII) by specifying
+ character code (as opposed to ASCII) by specifying
--enable-ebcdic
- This automatically implies --enable-rebuild-chartables (see above).
+ This automatically implies --enable-rebuild-chartables (see above). However,
+ when PCRE is built this way, it always operates in EBCDIC. It cannot support
+ both EBCDIC and UTF-8.
. It is possible to compile pcregrep to use libz and/or libbz2, in order to
read .gz and .bz2 files (respectively), by specifying one or both of
@@ -755,4 +760,4 @@ The distribution should contain the following files:
Philip Hazel
Email local part: ph10
Email domain: cam.ac.uk
-Last updated: 27 February 2009
+Last updated: 17 March 2009