diff options
author | ph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15> | 2009-03-17 21:16:01 +0000 |
---|---|---|
committer | ph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15> | 2009-03-17 21:16:01 +0000 |
commit | f6cacbfecd16ac9d19ee6fe6c26c107ee4846b1c (patch) | |
tree | fb3166c26b8f211814f8302719f5bd93842aa575 /README | |
parent | 8d719b57aa95b8a2baebd8fd13c6b80c2da84680 (diff) | |
download | pcre-f6cacbfecd16ac9d19ee6fe6c26c107ee4846b1c.tar.gz |
Add support for UTF-8 in EBCDIC environments.
git-svn-id: svn://vcs.exim.org/pcre/code/trunk@391 2f5784b3-3f2a-0410-8824-cb99058d5e15
Diffstat (limited to 'README')
-rw-r--r-- | README | 19 |
1 files changed, 12 insertions, 7 deletions
@@ -161,10 +161,13 @@ library. You can read more about them in the pcrebuild man page. it will try to find a C++ compiler and C++ header files, and if it succeeds, it will try to build the C++ wrapper. -. If you want to make use of the support for UTF-8 character strings in PCRE, - you must add --enable-utf8 to the "configure" command. Without it, the code - for handling UTF-8 is not included in the library. (Even when included, it - still has to be enabled by an option at run time.) +. If you want to make use of the support for UTF-8 Unicode character strings in + PCRE, you must add --enable-utf8 to the "configure" command. Without it, the + code for handling UTF-8 is not included in the library. Even when included, + it still has to be enabled by an option at run time. When PCRE is compiled + with this option, its input can only either be ASCII or UTF-8, even when + running on EBCDIC platforms. It is not possible to use both --enable-utf8 and + --enable-ebcdic at the same time. . If, in addition to support for UTF-8 character strings, you want to include support for the \P, \p, and \X sequences that recognize Unicode character @@ -255,11 +258,13 @@ library. You can read more about them in the pcrebuild man page. pcre_chartables.c.dist. See "Character tables" below for further information. . It is possible to compile PCRE for use on systems that use EBCDIC as their - default character code (as opposed to ASCII) by specifying + character code (as opposed to ASCII) by specifying --enable-ebcdic - This automatically implies --enable-rebuild-chartables (see above). + This automatically implies --enable-rebuild-chartables (see above). However, + when PCRE is built this way, it always operates in EBCDIC. It cannot support + both EBCDIC and UTF-8. . It is possible to compile pcregrep to use libz and/or libbz2, in order to read .gz and .bz2 files (respectively), by specifying one or both of @@ -755,4 +760,4 @@ The distribution should contain the following files: Philip Hazel Email local part: ph10 Email domain: cam.ac.uk -Last updated: 27 February 2009 +Last updated: 17 March 2009 |