| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
| |
Often, when DEBUG is set, an uninitialized variable message gets printed
as well. This fixes that.
|
|
|
|
|
|
|
|
|
| |
This patch causes utf8_heavy.pl to know about the new data structure that
mktables now generates to indicate what substitute table to use for one that
has different results under /i matching.
Note that regcomp.c, as of this commit, does not generate the names that would
exercise this code.
|
|
|
|
|
|
|
|
|
|
|
| |
In L<perlunicode/"User-Defined Character Properties">, it says you can
create custom properties by defining subroutines whose names begin with
"In" or "Is". However, perl doesn't actually enforce that naming
restriction, so \p{foo::bar} will call foo::Bar() if it exists.
This commit finally enforces this convention. Note that this broke a
number of existing tests for properties, since they didn't always use an
Is/In prefix.
|
| |
|
|
|
|
|
|
|
|
|
|
|
| |
If things aren't just so, it could be that utf8_heavy calls something
which requires a pattern, such as split or just a pattern match that
ends up calling utf8_heavy again, ad infinitum. When this happens,
memory gets eaten up and the machine grinds to a halt, likely requiring a
manual forced reboot.
To prevent this undesirable situation, utf8_heavy now stacks all its
calls in progress, and if any is a repeat, panics.
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
It's possible for this to be called during the compilation phase of Perl
by miniperl before the Unicode tables have been built. This patch
checks if dynamic loading is available, and if not evals the require
needed to gain access to the tables. If it succeeds, the tables have
been built; if it doesn't, instead of dying, just return empty tables,
as currently the things being built don't require information outside
the ASCII range, which is hard-coded into Perl without needing the
tables.
In the future, that may not be the case, and then likely the tables will
have to be shipped with Perl, and make regen would be done to rebuild
them.
|
| |
|
|
|
|
|
|
| |
This fixes a bug where a spurious error was reported from utf8_heavy.
This been found by Salvador Ortiz Garcia who suggested to localize $@;
I merely added $!.
|
|
|
|
|
| |
Naive implementation.
p4raw-id: //depot/perl@33088
|
|
|
|
|
|
| |
to avoid magic leaking and smoke failures under
utf-8 locales
p4raw-id: //depot/perl@31450
|
|
|
|
|
|
| |
From: Jonathan Steinert (via RT) <perlbug-followup@perl.org>
Message-ID: <rt-3.6.HEAD-30557-1178021932-1416.42839-75-0@perl.org>
p4raw-id: //depot/perl@31119
|
|
|
|
|
| |
So need to propagate errors with $@.
p4raw-id: //depot/perl@29723
|
|
|
|
|
|
| |
Subject: Re: XS-assisted SWASHGET (esp. for t/uni/class.t speedup)
Message-Id: <20051204162508.D726.BQW10602@nifty.com>
p4raw-id: //depot/perl@26255
|
|
|
|
|
| |
Message-Id: <20051127170016.A786.BQW10602@nifty.com>
p4raw-id: //depot/perl@26229
|
|
|
|
|
|
|
|
|
|
| |
Message-Id: <20051123175603.FFD5.BQW10602@nifty.com>
And :
Message-Id: <20051123202935.4D9D.BQW10602@nifty.com>
with some nits to use U8 instead of char more consistently
p4raw-id: //depot/perl@26199
|
|
|
|
|
| |
at build by mktables, stored in lib/unicore/PVA.pl
p4raw-id: //depot/perl@22881
|
|
|
| |
p4raw-id: //depot/perl@22880
|
|
|
|
|
|
|
|
|
| |
Date: Thu, 22 Apr 2004 14:31:30 -0400 (EDT)
Message-ID: <Pine.LNX.4.44.0404221429040.10466-101000@perlmonk.org>
Date: Mon, 26 Apr 2004 12:37:21 -0400 (EDT)
Message-ID: <Pine.LNX.4.44.0404261222320.7154-400000@perlmonk.org>
p4raw-id: //depot/perl@22744
|
|
|
|
|
| |
Message-ID: <Pine.LNX.4.44.0404122011160.3038-200000@perlmonk.org>
p4raw-id: //depot/perl@22693
|
|
|
|
|
|
| |
which will cause utf8_heavy.pl noise (reported by Daniel Yacob,
analysis and fix from SADAHIRO Tomoyuki)
p4raw-id: //depot/perl@19835
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
changes 18219, 18236, 18242-3, 18247-8,
18253-5, 18257, 18273-6
p4raw-id: //depot/perl@18280
p4raw-branched: from //depot/maint-5.8/perl@18279 'branch in'
t/op/lc_user.t
p4raw-integrated: from //depot/maint-5.8/perl@18279 'copy in'
lib/File/Copy.pm (@17645..) lib/utf8_heavy.pl pod/perlsec.pod
(@18080..) hints/irix_6.sh (@18173..) t/uni/tr_utf8.t
(@18197..) pod/perlunicode.pod (@18242..) t/op/pat.t (@18248..)
t/op/split.t (@18274..) 'edit in' pod/perlguts.pod (@18242..)
'merge in' pp.c (@18126..) MANIFEST (@18234..)
p4raw-integrated: from //depot/maint-5.8/perl@18254 'merge in'
pod/perldiag.pod (@18234..)
|
|
|
|
|
| |
Message-ID: <20021016155051.GB268437@lyta.hut.fi>
p4raw-id: //depot/perl@18035
|
|
|
|
|
| |
Message-ID: <20020920142245.GG280265@lyta.hut.fi>
p4raw-id: //depot/perl@17933
|
|
|
|
|
|
|
|
|
|
|
|
| |
"fix" for utf8_heavy.pl, lexical UTF8 var crashed
in test 92 of run/fresh_perl.t on MacOS
(as pudge rightfully points out, this is voodoo
programming at it best, the real bug is somewhere else,
now we just happened to shake the chicken the right way)
p4raw-id: //depot/perl@16355
p4raw-integrated: from //depot/macperl@16354 'merge in'
lib/utf8_heavy.pl (@16123..)
|
|
|
|
|
|
|
| |
Message-Id: <539D985A-5D1A-11D6-BB19-00039301D480@dan.co.jp>
(plus a respective perlunicode tweak)
p4raw-id: //depot/perl@16354
|
|
|
| |
p4raw-id: //depot/perl@16054
|
|
|
|
|
| |
removed, noticed by Dan Kogai.
p4raw-id: //depot/perl@16012
|
|
|
|
|
| |
not good because the script context is not shown.
p4raw-id: //depot/perl@15581
|
|
|
|
|
|
| |
and Is/*.pl to lib/*.pl, remove In.pl and Is.pl, introduce
Canonical.pl and Exact.pl.
p4raw-id: //depot/perl@14294
|
|
|
| |
p4raw-id: //depot/perl@14272
|
|
|
|
|
| |
documentation not yet updated.
p4raw-id: //depot/perl@14254
|
|
|
|
|
| |
In* and Is* names).
p4raw-id: //depot/perl@14242
|
|
|
|
|
| |
Message-Id: <200112160355.fBG3t1t84835@ventrue.corp.yahoo.com>
p4raw-id: //depot/perl@13710
|
|
|
| |
p4raw-id: //depot/perl@13706
|
|
|
|
|
|
|
|
| |
implement Category=, Script=, Block=
(these are based on an upcoming update of TR#18)
Fix a bug where we got two In categories named "old italic",
and another where shortcut for the Is categories wasn't taken.
p4raw-id: //depot/perl@12500
|
|
|
|
|
| |
more complex case conversions.
p4raw-id: //depot/perl@12450
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
- Cleaner.
- Faster: 15-20 seconds as opposed to several minutes.
- More dynamic: the names of the various categories
such as the linebreak ones are dynamic, not static.
- Is.pl: long names for the general category properties
are now available.
- Ranges (<... ,First>, <..., Last>) from the general
categories work now.
- No more mktables.PL because the mktables.PL is not
and never has been run to create a mktables.
- syllables.txt and Is/Syl*.pl removed: non-standard
(not part of the Unicode), and the whole concept is
being reworked (http://syllabary.sourceforge.net/),
the old way wouldn't even work with the new Syllables.txt
(it would result in 1000+ new categories)
p4raw-id: //depot/perl@12427
|
|
|
| |
p4raw-id: //depot/perl@12373
|
|
|
| |
p4raw-id: //depot/perl@12335
|
|
|
| |
p4raw-id: //depot/perl@12334
|
|
|
|
|
|
| |
only IsL&) and Inherited (negative lookahead good);
add tests for Common, Inherited, and L&.
p4raw-id: //depot/perl@12320
|
|
|
|
|
| |
(The Unicode standard uses L& quite often.)
p4raw-id: //depot/perl@12319
|
|
|
| |
p4raw-id: //depot/perl@12286
|
|
|
|
|
| |
Unicode script/block properties.
p4raw-id: //depot/perl@12281
|
|
|
| |
p4raw-id: //depot/perl@12278
|
|
|
| |
p4raw-id: //depot/perl@12272
|
|
|
|
|
|
|
| |
between the property definition and the curlies; now can
invert the property by having a caret between the open
curly and the property.
p4raw-id: //depot/perl@12269
|
|
|
|
|
|
|
| |
case doesn't matter, and any space or dash can be
matched by any space, dash, underbar, or empty.
(may be going too far on leniency)
p4raw-id: //depot/perl@12264
|
|
|
|
|
|
| |
conflicts between core lib/unicode and Unicode::
files in case-ignoring filesystems.
p4raw-id: //depot/perl@11623
|