summaryrefslogtreecommitdiff
path: root/maint/README
diff options
context:
space:
mode:
authorph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15>2010-03-01 17:45:08 +0000
committerph10 <ph10@2f5784b3-3f2a-0410-8824-cb99058d5e15>2010-03-01 17:45:08 +0000
commite48c6f09e052dae85940cd492b88ff5c09d16e20 (patch)
tree356712c4710ec805dc50c8e84545840f223eae14 /maint/README
parent9b76f93a61723c197527d3d447226409b17fa27b (diff)
downloadpcre-e48c6f09e052dae85940cd492b88ff5c09d16e20.tar.gz
Update Unicode tables to Unicode version 5.2.0.
git-svn-id: svn://vcs.exim.org/pcre/code/trunk@491 2f5784b3-3f2a-0410-8824-cb99058d5e15
Diffstat (limited to 'maint/README')
-rw-r--r--maint/README14
1 files changed, 13 insertions, 1 deletions
diff --git a/maint/README b/maint/README
index c2e2282..aab8009 100644
--- a/maint/README
+++ b/maint/README
@@ -70,9 +70,21 @@ GenerateUtt.py scripts must be edited to add the new names. Then MultiStage2.py
can be run to generate a new version of pcre_ucd.c, and GenerateUtt.py can be
run to generate the tricky tables for inclusion in pcre_tables.c.
+If MultiStage2.py gives the error "ValueError: list.index(x): x not in list",
+the cause is usually a missing (or misspelt) name in the list of scripts. I
+couldn't find a straightforward list of scripts on the Unicode site, but
+there's a useful Wikipedia page that list them, and notes the Unicode version
+in which they were introduced:
+
+http://en.wikipedia.org/wiki/Unicode_scripts#Table_of_Unicode_scripts
+
The ucptest program can be compiled and used to check that the new tables in
pcre_ucd.c work properly, using the data files in ucptestdata to check a number
-of test characters.
+of test characters. The source file ucptest.c must be updated whenever new
+Unicode script names are added.
+
+Note also that both the pcresyntax.3 and pcrepattern.3 man pages contain lists
+of Unicode script names.
Preparing for a PCRE release