| Commit message (Collapse) | Author | Age | Files | Lines |
... | |
|
|
|
| |
expression also requires "use re 'eval'", just as '(?{ code })' does.
|
|
|
|
| |
using a 'regex'/'regexp' mash.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Also revert 8902bb05b18c9858efa90229ca1ee42b17277554 as it merely
masked one symptom of the deeper problems.
Also fixes RT #69973, which was a segfault which was exposed by
8902bb05, see the ticket for further details.
http://rt.perl.org/rt3//Public/Bug/Display.html?id=69973
At the code of this is the problem that in unicode matching a bunch
of code points have case folding rules beyond just A-Z/a-z. Since
the case folding rules are decided at runtime by the string, we cant
use the same TRIE tables for both unicode/non-unicode matching.
Until this is reconciled or some other solution is found case insensitive
matching only gets the TRIE optimisation when the pattern is uniocde.
From CaseFolding.txt:
00B5; C; 03BC; # MICRO SIGN
00C0; C; 00E0; # LATIN CAPITAL LETTER A WITH GRAVE
00C1; C; 00E1; # LATIN CAPITAL LETTER A WITH ACUTE
00C2; C; 00E2; # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
00C3; C; 00E3; # LATIN CAPITAL LETTER A WITH TILDE
00C4; C; 00E4; # LATIN CAPITAL LETTER A WITH DIAERESIS
00C5; C; 00E5; # LATIN CAPITAL LETTER A WITH RING ABOVE
00C6; C; 00E6; # LATIN CAPITAL LETTER AE
00C7; C; 00E7; # LATIN CAPITAL LETTER C WITH CEDILLA
00C8; C; 00E8; # LATIN CAPITAL LETTER E WITH GRAVE
00C9; C; 00E9; # LATIN CAPITAL LETTER E WITH ACUTE
00CA; C; 00EA; # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
00CB; C; 00EB; # LATIN CAPITAL LETTER E WITH DIAERESIS
00CC; C; 00EC; # LATIN CAPITAL LETTER I WITH GRAVE
00CD; C; 00ED; # LATIN CAPITAL LETTER I WITH ACUTE
00CE; C; 00EE; # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
00CF; C; 00EF; # LATIN CAPITAL LETTER I WITH DIAERESIS
00D0; C; 00F0; # LATIN CAPITAL LETTER ETH
00D1; C; 00F1; # LATIN CAPITAL LETTER N WITH TILDE
00D2; C; 00F2; # LATIN CAPITAL LETTER O WITH GRAVE
00D3; C; 00F3; # LATIN CAPITAL LETTER O WITH ACUTE
00D4; C; 00F4; # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
00D5; C; 00F5; # LATIN CAPITAL LETTER O WITH TILDE
00D6; C; 00F6; # LATIN CAPITAL LETTER O WITH DIAERESIS
00D8; C; 00F8; # LATIN CAPITAL LETTER O WITH STROKE
00D9; C; 00F9; # LATIN CAPITAL LETTER U WITH GRAVE
00DA; C; 00FA; # LATIN CAPITAL LETTER U WITH ACUTE
00DB; C; 00FB; # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
00DC; C; 00FC; # LATIN CAPITAL LETTER U WITH DIAERESIS
00DD; C; 00FD; # LATIN CAPITAL LETTER Y WITH ACUTE
00DE; C; 00FE; # LATIN CAPITAL LETTER THORN
00DF; F; 0073 0073; # LATIN SMALL LETTER SHARP S
|
| |
|
|
|
|
|
|
| |
(There's special-case code in the core to make re.pm available early as
lib/re.pm, as it's needed by other modules in the XS toolchain, so we don't need
a special case and attendant fragility in ExtUtils::ParseXS)
|
|
|
|
|
|
|
|
|
|
| |
This is a lot simpler than the previous implementation, which would lazy load
the XS code only if it was needed, and then jump through more hopes to try to
give the user a reasonable error message if it failed.
In the real world, people either have a full perl installed with the re
extension, or their install is already horribly broken. Don't let code related
to bootstrapping the core build pollute the installation.
|
| |
|
|
|
|
|
|
|
|
|
|
|
| |
Inspired by, and in parts borrows from, Schwern's branch on github, but takes a
slightly different approach in places.
Not quite perfect yet - ext/File-Glob still runs from t, at least one FIXME
needs fixing, and the changes to dual-life modules' tests need to be filtered
back upstream, and possibly modified to suit their respective authors.
But it works.
|
| |
|
|
|
|
|
|
| |
checked for warnings here - not sure whether it was me or gcc that
didn't notice anythign wrong. g++ certainly did.
p4raw-id: //depot/perl@34678
|
|
|
|
|
|
| |
From: "Jerry D. Hedden" <jdhedden@cpan.org>
Message-ID: <1ff86f510801081201q5c36f055re6165ebfe8876c2e@mail.gmail.com>
p4raw-id: //depot/perl@32911
|
|
|
|
|
| |
SVt_FORMAT - the just string buffer.
p4raw-id: //depot/perl@32863
|
|
|
| |
p4raw-id: //depot/perl@32849
|
|
|
| |
p4raw-id: //depot/perl@32842
|
|
|
|
|
|
|
| |
regcomp.c and regexec.c RXp_* where necessary] so that in future we
can maintain source compatibility when we add an extra level of
dereferencing.
p4raw-id: //depot/perl@32802
|
|
|
|
|
|
|
| |
shift 12 in two places, so replace that magic number with a macro
RXf_PMf_STD_PMMOD_SHIFT defined adjacent to the flags it interacts
with.
p4raw-id: //depot/perl@32774
|
|
|
|
|
|
| |
RX_WRAPLEN() to preserve source compatibility when they get moved
around.
p4raw-id: //depot/perl@32758
|
|
|
|
|
|
| |
the macros RX_PRECOMP() and RX_PRELEN(). This will allow us to reduce
the regexp storage overhead by computing them at retrieve time.
p4raw-id: //depot/perl@32753
|
|
|
|
|
|
|
| |
From: "Robin Barker" <Robin.Barker@npl.co.uk>
Message-ID: <46A0F33545E63740BC7563DE59CA9C6D09399D@exchsvr2.npl.ad.local>
Date: Sat, 22 Dec 2007 00:39:47 -0000
p4raw-id: //depot/perl@32703
|
|
|
|
|
|
|
| |
From: "John E. Malmberg" <wb8tyw@qsl.net>
Message-id: <46CB93A1.2090407@qsl.net>
p4raw-link: @31739 on //depot/perl: c83e6f195f905dd4809cef6ea71ef6cef8c9f7b8
p4raw-id: //depot/perl@31748
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Date: Fri, 29 Jun 2007 23:38:07 +0200
Message-ID: <20070629213807.GA14454@abigail.nl>
Subject: [PATCH pod/perlre.pod] Keeping up with the changes.
From: Abigail <abigail@abigail.be>
Date: Sat, 30 Jun 2007 01:24:36 +0200
Message-ID: <20070629232436.GA15326@abigail.nl>
Plus tweaks, and debug enahancements.
p4raw-id: //depot/perl@31506
|
|
|
|
|
|
| |
From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Message-ID: <51dd1af80706172033h1908aa0ge15698204e0b79ed@mail.gmail.com>
p4raw-id: //depot/perl@31409
|
|
|
| |
p4raw-id: //depot/perl@31359
|
|
|
|
|
|
| |
From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Message-ID: <51dd1af80706031324y5618d519p460da27a2e7fe712@mail.gmail.com>
p4raw-id: //depot/perl@31341
|
|
|
|
|
|
| |
report_uninit was not exported on Win32 and
my_reg_numbered_buff_length had wrong prototype
p4raw-id: //depot/perl@31132
|
|
|
| |
p4raw-id: //depot/perl@31131
|
|
|
|
|
|
| |
From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Message-ID: <51dd1af80705011658g1156e14cw4d2b21a8d772ed41@mail.gmail.com>
p4raw-id: //depot/perl@31130
|
|
|
|
|
|
| |
From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Message-ID: <51dd1af80704261922j3db0615wa86ccc4cb65b2713@mail.gmail.com>
p4raw-id: //depot/perl@31106
|
|
|
| |
p4raw-id: //depot/perl@31056
|
|
|
|
|
|
| |
From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Message-ID: <51dd1af80703291552y1073bcb6r954b043eb68a4459@mail.gmail.com>
p4raw-id: //depot/perl@30849
|
|
|
|
|
| |
Message-ID: <9b18b3110704031646p7ac8dbearf9e41397a5f884d8@mail.gmail.com>
p4raw-id: //depot/perl@30841
|
|
|
|
|
|
|
|
|
| |
pattern is a qr.
Message-ID: <9b18b3110703210239x540f5ad9mdb41c2ea6229ac31@mail.gmail.com>
plus two follow-up patches (minor tweaks)
p4raw-id: //depot/perl@30678
|
|
|
|
|
| |
Message-ID: <9b18b3110703071144t787e028s8a79fa1986624b54@mail.gmail.com>
p4raw-id: //depot/perl@30517
|
|
|
|
|
| |
Message-ID: <9b18b3110702280845p7860ca08taf1aead39a178aa4@mail.gmail.com>
p4raw-id: //depot/perl@30436
|
|
|
| |
p4raw-id: //depot/perl@30417
|
|
|
| |
p4raw-id: //depot/perl@30415
|
|
|
|
|
| |
Message-ID: <9b18b3110702260745u373980e8x19ba3afdf33bd424@mail.gmail.com>
p4raw-id: //depot/perl@30413
|
|
|
| |
p4raw-id: //depot/perl@30352
|
|
|
|
|
|
| |
as TODO. (This should restore compilation with g++)
p4raw-link: @30081 on //depot/perl: ded05c2a789e70bb7204e21b2aa98c6d1ac776c2
p4raw-id: //depot/perl@30351
|
|
|
|
|
| |
Message-ID: <9b18b3110702131304q370f3530j463c1a59c5ac1dfe@mail.gmail.com>
p4raw-id: //depot/perl@30278
|
|
|
|
|
| |
Message-ID: <9b18b3110702131127q79cc6df1lb1480d9a40d15213@mail.gmail.com>
p4raw-id: //depot/perl@30265
|
|
|
|
|
| |
Message-ID: <9b18b3110701301458k2f6a8254hea6c6db28489c38b@mail.gmail.com>
p4raw-id: //depot/perl@30084
|
|
|
|
|
| |
Message-ID: <9b18b3110701301451l1443a186p39df7a6e8b65ea3c@mail.gmail.com>
p4raw-id: //depot/perl@30081
|
|
|
|
|
| |
Message-ID: <9b18b3110701140624v452f7684x5e9d2890805489fd@mail.gmail.com>
p4raw-id: //depot/perl@29842
|
|
|
|
|
|
|
| |
and add support for %-
Message-ID: <9b18b3110612291245q792fe91cu69422d2b81bb4f0b@mail.gmail.com>
p4raw-id: //depot/perl@29682
|
|
|
| |
p4raw-id: //depot/perl@29474
|
|
|
|
|
|
| |
Actually the regexp engine structure only needs
one compilation function hook.
p4raw-id: //depot/perl@29459
|
|
|
| |
p4raw-id: //depot/perl@29458
|
|
|
|
|
|
|
| |
specific.
Message-ID: <9b18b3110611301306p5cad5deal4aa55559b8c8defd@mail.gmail.com>
p4raw-id: //depot/perl@29430
|
|
|
|
|
| |
Message-ID: <45687324.3040102@iki.fi>
p4raw-id: //depot/perl@29378
|