summaryrefslogtreecommitdiff
path: root/ext/re
Commit message (Collapse)AuthorAgeFilesLines
...
* Document that interpolating a '(??{ code })' construct in a regularAbigail2010-01-061-3/+5
| | | | expression also requires "use re 'eval'", just as '(?{ code })' does.
* Fix a couple of typos and standardize on using 'regexp' instead ofAbigail2010-01-061-9/+9
| | | | using a 'regex'/'regexp' mash.
* disable non-unicode case insensitive trie matchingYves Orton2009-10-251-6/+6
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Also revert 8902bb05b18c9858efa90229ca1ee42b17277554 as it merely masked one symptom of the deeper problems. Also fixes RT #69973, which was a segfault which was exposed by 8902bb05, see the ticket for further details. http://rt.perl.org/rt3//Public/Bug/Display.html?id=69973 At the code of this is the problem that in unicode matching a bunch of code points have case folding rules beyond just A-Z/a-z. Since the case folding rules are decided at runtime by the string, we cant use the same TRIE tables for both unicode/non-unicode matching. Until this is reconciled or some other solution is found case insensitive matching only gets the TRIE optimisation when the pattern is uniocde. From CaseFolding.txt: 00B5; C; 03BC; # MICRO SIGN 00C0; C; 00E0; # LATIN CAPITAL LETTER A WITH GRAVE 00C1; C; 00E1; # LATIN CAPITAL LETTER A WITH ACUTE 00C2; C; 00E2; # LATIN CAPITAL LETTER A WITH CIRCUMFLEX 00C3; C; 00E3; # LATIN CAPITAL LETTER A WITH TILDE 00C4; C; 00E4; # LATIN CAPITAL LETTER A WITH DIAERESIS 00C5; C; 00E5; # LATIN CAPITAL LETTER A WITH RING ABOVE 00C6; C; 00E6; # LATIN CAPITAL LETTER AE 00C7; C; 00E7; # LATIN CAPITAL LETTER C WITH CEDILLA 00C8; C; 00E8; # LATIN CAPITAL LETTER E WITH GRAVE 00C9; C; 00E9; # LATIN CAPITAL LETTER E WITH ACUTE 00CA; C; 00EA; # LATIN CAPITAL LETTER E WITH CIRCUMFLEX 00CB; C; 00EB; # LATIN CAPITAL LETTER E WITH DIAERESIS 00CC; C; 00EC; # LATIN CAPITAL LETTER I WITH GRAVE 00CD; C; 00ED; # LATIN CAPITAL LETTER I WITH ACUTE 00CE; C; 00EE; # LATIN CAPITAL LETTER I WITH CIRCUMFLEX 00CF; C; 00EF; # LATIN CAPITAL LETTER I WITH DIAERESIS 00D0; C; 00F0; # LATIN CAPITAL LETTER ETH 00D1; C; 00F1; # LATIN CAPITAL LETTER N WITH TILDE 00D2; C; 00F2; # LATIN CAPITAL LETTER O WITH GRAVE 00D3; C; 00F3; # LATIN CAPITAL LETTER O WITH ACUTE 00D4; C; 00F4; # LATIN CAPITAL LETTER O WITH CIRCUMFLEX 00D5; C; 00F5; # LATIN CAPITAL LETTER O WITH TILDE 00D6; C; 00F6; # LATIN CAPITAL LETTER O WITH DIAERESIS 00D8; C; 00F8; # LATIN CAPITAL LETTER O WITH STROKE 00D9; C; 00F9; # LATIN CAPITAL LETTER U WITH GRAVE 00DA; C; 00FA; # LATIN CAPITAL LETTER U WITH ACUTE 00DB; C; 00FB; # LATIN CAPITAL LETTER U WITH CIRCUMFLEX 00DC; C; 00FC; # LATIN CAPITAL LETTER U WITH DIAERESIS 00DD; C; 00FD; # LATIN CAPITAL LETTER Y WITH ACUTE 00DE; C; 00FE; # LATIN CAPITAL LETTER THORN 00DF; F; 0073 0073; # LATIN SMALL LETTER SHARP S
* Untangle File::Basename and re - avoid hard-coding the hints value.Nicholas Clark2009-09-121-9/+0
|
* re.pm will load under miniperl, so don't hard-code its values into ParseXSNicholas Clark2009-09-121-5/+0
| | | | | | (There's special-case code in the core to make re.pm available early as lib/re.pm, as it's needed by other modules in the XS toolchain, so we don't need a special case and attendant fragility in ExtUtils::ParseXS)
* Always load the re.xs code, unless we're running under miniperl.Nicholas Clark2009-09-111-35/+17
| | | | | | | | | | This is a lot simpler than the previous implementation, which would lazy load the XS code only if it was needed, and then jump through more hopes to try to give the user a reasonable error message if it failed. In the real world, people either have a full perl installed with the re extension, or their install is already horribly broken. Don't let code related to bootstrapping the core build pollute the installation.
* add more positive gofs GPOS tests and fix some bugs tooYves Orton2009-09-101-1/+2
|
* Make extensions in ext run their tests from the extension's own directory.Nicholas Clark2009-08-285-15/+5
| | | | | | | | | | | Inspired by, and in parts borrows from, Schwern's branch on github, but takes a slightly different approach in places. Not quite perfect yet - ext/File-Glob still runs from t, at least one FIXME needs fixing, and the changes to dual-life modules' tests need to be filtered back upstream, and possibly modified to suit their respective authors. But it works.
* Remove now-redundant references to MAN3PODS in core modules' Makefile.PLs.Nicholas Clark2009-03-261-1/+0
|
* Change 34672 should have removed a const from re.xs. I thought I'dNicholas Clark2008-10-311-1/+1
| | | | | | checked for warnings here - not sure whether it was me or gcc that didn't notice anythign wrong. g++ certainly did. p4raw-id: //depot/perl@34678
* Move re::regexp_pattern to universal.cJerry D. Hedden2008-01-093-129/+16
| | | | | | From: "Jerry D. Hedden" <jdhedden@cpan.org> Message-ID: <1ff86f510801081201q5c36f055re6165ebfe8876c2e@mail.gmail.com> p4raw-id: //depot/perl@32911
* Allow sv_setsv_flags() to copy SVt_REGEXP much like it copiesNicholas Clark2008-01-061-0/+4
| | | | | SVt_FORMAT - the just string buffer. p4raw-id: //depot/perl@32863
* Replace all reads of RXf_UTF8 with RX_UTF8().Nicholas Clark2008-01-051-2/+3
| | | p4raw-id: //depot/perl@32849
* Make ext/re/t/re_funcs.t warnings clean.Nicholas Clark2008-01-051-1/+2
| | | p4raw-id: //depot/perl@32842
* Wrap all deferences of struct regexp* in macros RX_*() [and forNicholas Clark2008-01-021-11/+11
| | | | | | | regcomp.c and regexec.c RXp_* where necessary] so that in future we can maintain source compatibility when we add an extra level of dereferencing. p4raw-id: //depot/perl@32802
* The position of the modifier flag bits is actually encoded by a rightNicholas Clark2007-12-291-1/+2
| | | | | | | shift 12 in two places, so replace that magic number with a macro RXf_PMf_STD_PMMOD_SHIFT defined adjacent to the flags it interacts with. p4raw-id: //depot/perl@32774
* Wrap wrapped and wraplen from struct regexp in macros RW_WRAPPED() andNicholas Clark2007-12-291-1/+1
| | | | | | RX_WRAPLEN() to preserve source compatibility when they get moved around. p4raw-id: //depot/perl@32758
* Wrap all accesses to the members precomp and prelen of struct regexp inNicholas Clark2007-12-281-1/+1
| | | | | | the macros RX_PRECOMP() and RX_PRELEN(). This will allow us to reduce the regexp storage overhead by computing them at retrieve time. p4raw-id: //depot/perl@32753
* ext/ constingRobin Barker2007-12-221-1/+1
| | | | | | | From: "Robin Barker" <Robin.Barker@npl.co.uk> Message-ID: <46A0F33545E63740BC7563DE59CA9C6D09399D@exchsvr2.npl.ad.local> Date: Sat, 22 Dec 2007 00:39:47 -0000 p4raw-id: //depot/perl@32703
* [patch@31739] regop.t fix for VMSJohn E. Malmberg2007-08-221-0/+2
| | | | | | | From: "John E. Malmberg" <wb8tyw@qsl.net> Message-id: <46CB93A1.2090407@qsl.net> p4raw-link: @31739 on //depot/perl: c83e6f195f905dd4809cef6ea71ef6cef8c9f7b8 p4raw-id: //depot/perl@31748
* /p vs (?p)Abigail2007-06-301-0/+1
| | | | | | | | | | | | | Date: Fri, 29 Jun 2007 23:38:07 +0200 Message-ID: <20070629213807.GA14454@abigail.nl> Subject: [PATCH pod/perlre.pod] Keeping up with the changes. From: Abigail <abigail@abigail.be> Date: Sat, 30 Jun 2007 01:24:36 +0200 Message-ID: <20070629232436.GA15326@abigail.nl> Plus tweaks, and debug enahancements. p4raw-id: //depot/perl@31506
* SvRX() and SvRXOK() macrosÆvar Arnfjörð Bjarmason2007-06-181-22/+2
| | | | | | From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com> Message-ID: <51dd1af80706172033h1908aa0ge15698204e0b79ed@mail.gmail.com> p4raw-id: //depot/perl@31409
* Avoid multiply defined symbols in re extension following #31341.Craig A. Berry2007-06-081-0/+6
| | | p4raw-id: //depot/perl@31359
* Re: [PATCH] Callbacks for named captures (%+ and %-)Ævar Arnfjörð Bjarmason2007-06-065-29/+30
| | | | | | From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com> Message-ID: <51dd1af80706031324y5618d519p460da27a2e7fe712@mail.gmail.com> p4raw-id: //depot/perl@31341
* Fix breakages caused by #31130:Steve Hay2007-05-031-8/+9
| | | | | | report_uninit was not exported on Win32 and my_reg_numbered_buff_length had wrong prototype p4raw-id: //depot/perl@31132
* Silence some VC6 compiler warnings in the regex codeSteve Hay2007-05-031-2/+2
| | | p4raw-id: //depot/perl@31131
* FETCH/STORE/LENGTH callbacks for numbered capture variablesÆvar Arnfjörð Bjarmason2007-05-032-6/+16
| | | | | | From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com> Message-ID: <51dd1af80705011658g1156e14cw4d2b21a8d772ed41@mail.gmail.com> p4raw-id: //depot/perl@31130
* Re: [PATCH] Cleanup of the regexp APIÆvar Arnfjörð Bjarmason2007-04-302-16/+18
| | | | | | From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com> Message-ID: <51dd1af80704261922j3db0615wa86ccc4cb65b2713@mail.gmail.com> p4raw-id: //depot/perl@31106
* Fix a function signature missed by #31027 to silence a VC6 warningSteve Hay2007-04-241-1/+1
| | | p4raw-id: //depot/perl@31056
* Re: Proposed changes and to regular expression interfaces in coreÆvar Arnfjörð Bjarmason2007-04-062-0/+3
| | | | | | From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com> Message-ID: <51dd1af80703291552y1073bcb6r954b043eb68a4459@mail.gmail.com> p4raw-id: //depot/perl@30849
* Re: pmdynflags and thread safetyYves Orton2007-04-041-1/+1
| | | | | Message-ID: <9b18b3110704031646p7ac8dbearf9e41397a5f884d8@mail.gmail.com> p4raw-id: //depot/perl@30841
* Resolve PL_curpm issues with (??{}) and fix corruption of match results when ↵Yves Orton2007-03-222-50/+26
| | | | | | | | | pattern is a qr. Message-ID: <9b18b3110703210239x540f5ad9mdb41c2ea6229ac31@mail.gmail.com> plus two follow-up patches (minor tweaks) p4raw-id: //depot/perl@30678
* Re: [PATCH] Tweaks so that miniperl.exe doesnt croak while building perl.exeYves Orton2007-03-083-288/+2
| | | | | Message-ID: <9b18b3110703071144t787e028s8a79fa1986624b54@mail.gmail.com> p4raw-id: //depot/perl@30517
* Re: New file: t/op/regexp_email.tYves Orton2007-03-011-2/+8
| | | | | Message-ID: <9b18b3110702280845p7860ca08taf1aead39a178aa4@mail.gmail.com> p4raw-id: //depot/perl@30436
* dmq points out that %+ and %- are not arrays. Fix the doc.Rafael Garcia-Suarez2007-02-271-3/+3
| | | p4raw-id: //depot/perl@30417
* Rework and fix docs for %+, %- and re::Tie::Hash::NamedCapture.Rafael Garcia-Suarez2007-02-271-29/+41
| | | p4raw-id: //depot/perl@30415
* Re: [perl #41565] qr// memory corruptionYves Orton2007-02-261-2/+2
| | | | | Message-ID: <9b18b3110702260745u373980e8x19ba3afdf33bd424@mail.gmail.com> p4raw-id: //depot/perl@30413
* Actually, only changes to regexec.c from #30081 needed to be reverted.Rafael Garcia-Suarez2007-02-192-0/+2
| | | p4raw-id: //depot/perl@30352
* Revert change #30081 at dmq's request, and mark its testsRafael Garcia-Suarez2007-02-192-2/+0
| | | | | | as TODO. (This should restore compilation with g++) p4raw-link: @30081 on //depot/perl: ded05c2a789e70bb7204e21b2aa98c6d1ac776c2 p4raw-id: //depot/perl@30351
* Re: [PATCH] Document that m//k worksYves Orton2007-02-141-3/+3
| | | | | Message-ID: <9b18b3110702131304q370f3530j463c1a59c5ac1dfe@mail.gmail.com> p4raw-id: //depot/perl@30278
* add hooks for capture buffers into regex engine.Yves Orton2007-02-132-1/+7
| | | | | Message-ID: <9b18b3110702131127q79cc6df1lb1480d9a40d15213@mail.gmail.com> p4raw-id: //depot/perl@30265
* Improve regex stringification codeYves Orton2007-01-312-13/+11
| | | | | Message-ID: <9b18b3110701301458k2f6a8254hea6c6db28489c38b@mail.gmail.com> p4raw-id: //depot/perl@30084
* $1 in nested regex EVAL doesnt work correctly.Yves Orton2007-01-312-0/+2
| | | | | Message-ID: <9b18b3110701301451l1443a186p39df7a6e8b65ea3c@mail.gmail.com> p4raw-id: //depot/perl@30081
* Make offsets support conditionalYves Orton2007-01-161-0/+1
| | | | | Message-ID: <9b18b3110701140624v452f7684x5e9d2890805489fd@mail.gmail.com> p4raw-id: //depot/perl@29842
* Re: [PATCH] Change implementation of %+ to use a proper tied hash interface ↵Yves Orton2007-01-044-28/+373
| | | | | | | and add support for %- Message-ID: <9b18b3110612291245q792fe91cu69422d2b81bb4f0b@mail.gmail.com> p4raw-id: //depot/perl@29682
* Avoid my_regdupe type mismatch warning after #29430Craig A. Berry2006-12-061-1/+1
| | | p4raw-id: //depot/perl@29474
* Better version of last patch, by Yves Orton.Rafael Garcia-Suarez2006-12-042-5/+2
| | | | | | Actually the regexp engine structure only needs one compilation function hook. p4raw-id: //depot/perl@29459
* The new regexp compilation function must be added to the engine structure.Rafael Garcia-Suarez2006-12-042-0/+3
| | | p4raw-id: //depot/perl@29458
* Continue split of perl internal regexp structures from ones that are engine ↵Yves Orton2006-12-012-6/+4
| | | | | | | specific. Message-ID: <9b18b3110611301306p5cad5deal4aa55559b8c8defd@mail.gmail.com> p4raw-id: //depot/perl@29430
* smoke signs suppressionJarkko Hietaniemi2006-11-251-3/+5
| | | | | Message-ID: <45687324.3040102@iki.fi> p4raw-id: //depot/perl@29378