| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
|
|
|
|
|
|
|
|
| |
class matching
This also alters which Unicode properties that the POSIX character
class and the Perl "special" character classes, like \w and \d map
to. At the same time it allows a number of tests for POSIX character
class behaviour to be switched from todo to non todo. Legacy testing
is still available by changing the define and setting the
PERL_TEST_LEGACY_POSIX_CC value to true.
|
|
|
|
|
|
|
|
|
|
|
| |
Commit c74340f9 added backreferences as well as the idea of a ->swap
regex pointer to keep track of the match offsets in case of backtracking.
The problem is that when Perl re-enters the regex engine to handle
utf8::SWASHNEW, the ->swap is not saved/restored/cleared so any capture
from the utf8 (Perl) code could inadvertently modify the regex match
data that caused the utf8 swash to get built.
This change should close out RT #60508
|
| |
|
|
|
|
|
|
|
|
| |
This looks to be a simple oversight. All tests pass here.
Hugo
Signed-off-by: H.Merijn Brand <h.m.brand@xs4all.nl>
|
| |
|
|
|
|
|
|
| |
(was Re: [perl #56194] Regex: (((??{1 + $^N}))) behaves differently in
5.10.0 than in blead)
Adds some basic documentation about the test structure.
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
PL_reglastparen and PL_reglastcloseparen contains a pointer are set to & rex->lastparen and & rex->lastcloseparen.
In case END the rex var is modified but PL_reglastparen and PL_reglastcloseparen is not.
Some part of the codes access PL_reglastparen while other parts use rex->lastparen.
This patch corrects this and adds 3 assertions.
I'm currently unable to proof (with a test case) that the code in case EVAL_ab is really nessesary...
Logically speaking it is nessesary but I do not know if it can cause test failures.
Also in the patch are missing regressions between 5.8 -> 5.10 and 5.10 -> 5.11. (and a test script that contains these regressions)
Message-ID: <rt-3.6.HEAD-4802-1236806863-900.56194-15-0@perl.org>
[Includes message and patch edits by committer.]
|
|
|
|
|
| |
Message-ID: <20090215185207.gsnhhqdegckws0co@horde.wizbit.be>
Message-ID: <20090215210634.u02f15b284ogc4s4@horde.wizbit.be>
|
|
|
|
|
|
| |
(Tweaked by rgs)
Message-ID: <496D3F02.6020204@khwilliamson.com>
|
|
|
|
|
|
|
| |
(reminder)
Date: 17 Nov 2007 16:29:29 +0100
Message-ID: <87r6iohova.fsf@biokovo-amd64.herceg.de>
|
| |
|
|
|
|
|
|
|
|
|
|
|
| |
into a new test file
Subject: t/op/pat.t
Message-ID: <20081118082905.GJ3172@almanda>
With tweaks: Seems the new harness is quite picky about # signs in test names,
and doesnt like SKIP and TODO to be used together.
p4raw-id: //depot/perl@34882
|
|
|
|
|
|
|
|
|
| |
#define PERL_LEGACY_UNICODE_CHARCLASS_MAPPINGS = 0
as signalled by the environment variable REAL_POSIX_CC being true.
Otherwise test are as they used to be, or TODO'ed.
p4raw-id: //depot/perl@34785
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
the regex engine)
Perlbug #60156 and #49302 (and probably others) resolve down to the problem
that the definition of \s and \w and \d and the POSIX charclasses are different
for unicode strings and for non-unicode strings. This broke the character class
logic in the regex engine. The easiest fix to make the character class logic sane
again is to define new properties which do match.
This change creates new property classes that can be used instead of the
traditional ones (it does not change the previously defined ones). If the
define in regcomp.h:
#define PERL_LEGACY_UNICODE_CHARCLASS_MAPPINGS 1
is changed to 0, then the new mappings will be used. This will fix a bunch
of bugs that are reported as TODO items in the new reg_posixcc.t test file.
p4raw-id: //depot/perl@34769
|
|
|
|
|
|
|
|
|
|
|
|
| |
[ 34580]
Subject: Re: [perl #60034] (?>) causes wrongness on long string
Message-ID: <48FFABE4.2030507@profvince.com>
[just the changes, as TODOs]
p4raw-link: @34580 on //depot/maint-5.8/perl: 7e8969c0412490151f26944243e2767cf32aca95
p4raw-id: //depot/perl@34581
p4raw-integrated: from //depot/maint-5.8/perl@34579 'edit in'
t/op/pat.t (@34396..)
|
|
|
|
|
|
| |
From: "Robin Barker" <Robin.Barker@npl.co.uk>
Message-ID: <46A0F33545E63740BC7563DE59CA9C6D093B34@exchsvr2.npl.ad.local>
p4raw-id: //depot/perl@33776
|
|
|
|
|
| |
This makes all minitests pass on my machine.
p4raw-id: //depot/perl@33768
|
|
|
| |
p4raw-id: //depot/perl@33767
|
|
|
|
|
|
| |
From: "Robin Barker" <Robin.Barker@npl.co.uk>
Message-ID: <46A0F33545E63740BC7563DE59CA9C6D093B12@exchsvr2.npl.ad.local>
p4raw-id: //depot/perl@33752
|
|
|
|
|
|
| |
based on code by Wolf-Dietrich Moeller
p4raw-link: @33685 on //depot/perl: ef0d4e17921ee3de62e9c127fad712496e6d7810
p4raw-id: //depot/perl@33686
|
|
|
|
|
|
|
|
|
| |
-- lastcloseparen is literally the index of the last paren closed
-- lastparen is index of the highest index paren that has been closed.
In nested parens, they will be completely different.
'ab'=~/(a(b))/ will have: lastparen = 2, lastcloseparen = 1
'ab'=~/(a)(b)/ will have: lastparen = lastcloseparen = 2
p4raw-id: //depot/perl@33325
|
|
|
| |
p4raw-id: //depot/perl@33317
|
|
|
| |
p4raw-id: //depot/perl@33316
|
|
|
|
|
|
|
| |
Message-ID: <20080207165330.GA25617@abigail.be>
with tweaks (one more test marked TODO)
p4raw-id: //depot/perl@33313
|
|
|
|
|
| |
Some TODOs still remain. Bounties?
p4raw-id: //depot/perl@32940
|
|
|
|
|
| |
(Certain regexps could SEGV if cloned).
p4raw-id: //depot/perl@32932
|
|
|
|
|
| |
a new thread, to test ithread's cloning, particularly of regexps.
p4raw-id: //depot/perl@32931
|
|
|
|
|
|
|
|
| |
inside of (?{...}) blocks as reported by Moritz Lenz in
Subject: Bugs in extended regexp features
Message-ID: <477FACED.4000505@casella.verplant.org>
p4raw-id: //depot/perl@32857
|
|
|
| |
p4raw-id: //depot/perl@32761
|
|
|
| |
p4raw-id: //depot/perl@32749
|
|
|
|
|
|
| |
so don't upgrade them to ORANGE before attaching qr magic.
(And don't stop using qr magic once regexps become first class)
p4raw-id: //depot/perl@32748
|
|
|
| |
p4raw-id: //depot/perl@32747
|
|
|
|
|
| |
the rest of the test.
p4raw-id: //depot/perl@32746
|
|
|
|
|
|
|
| |
Related to [perl #36207] among others
Message-ID: <9b18b3110712170621h41de2c76k331971e3660abcb0@mail.gmail.com>
p4raw-id: //depot/perl@32628
|
|
|
|
|
|
|
| |
pattern
Message-Id: <200709221427.30425@bloodgate.com>
p4raw-id: //depot/perl@31961
|
|
|
|
|
| |
when shrinking an SV, shrink the sv_len_utf8 cache too!
p4raw-id: //depot/perl@31867
|
|
|
|
|
|
|
|
|
|
| |
ensure proper scope cleanup.
Fix and test for issue raised in:
Subject: Very strange interaction between regex and lexical array in blead
Message-ID: <20070818015537.0088db31@r2d2>
p4raw-id: //depot/perl@31733
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
http://www.nntp.perl.org/group/perl.perl5.porters/2007/06/msg125667.html
by reverting part of change #29354.
Unfortunately match vars after a /g match in scalar context will be
unsafe (again) after this, but such matches on long strings won't be
as diabolically slow.
Question: why does the new test in t/op/pat.t pass, but the same test
in t/op/reg_unsafe.t fail? (Latter is TODO for now)
p4raw-link: @29354 on //depot/perl: 58e23c8d7d24dd08c87b5d56819ad45527176c15
p4raw-id: //depot/perl@31451
|
|
|
|
|
|
| |
From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Message-ID: <51dd1af80706031324y5618d519p460da27a2e7fe712@mail.gmail.com>
p4raw-id: //depot/perl@31341
|
|
|
|
|
| |
Message-ID: <9b18b3110706030821u39460f96ic342dda8ba9fcdf7@mail.gmail.com>
p4raw-id: //depot/perl@31328
|
|
|
|
|
|
|
| |
regex engine.
Message-ID: <9b18b3110704270709y50ef652ci436b3bb29abca275@mail.gmail.com>
p4raw-id: //depot/perl@31102
|
|
|
|
|
|
|
| |
regex engine.
Message-ID: <9b18b3110704240746u461e4bdcl208ef7d7f9c5ef64@mail.gmail.com>
p4raw-id: //depot/perl@31081
|
|
|
|
|
|
|
| |
PCRE and unicode tr18
Message-ID: <9b18b3110704221434g43457742p28cab00289f83639@mail.gmail.com>
p4raw-id: //depot/perl@31026
|
|
|
| |
p4raw-id: //depot/perl@30882
|
|
|
|
|
|
| |
From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Message-ID: <51dd1af80704091502m19130c28y29e6e656516fabfd@mail.gmail.com>
p4raw-id: //depot/perl@30881
|
|
|
|
|
|
|
|
| |
By setting the outer saved $^R to the current $^R just at the
end of a successful match, and ensuring that that the savestack
doesn't get popped beforehand, the code is simplified and fixes a
bug.
p4raw-id: //depot/perl@30818
|
|
|
|
|
|
|
|
|
| |
pattern is a qr.
Message-ID: <9b18b3110703210239x540f5ad9mdb41c2ea6229ac31@mail.gmail.com>
plus two follow-up patches (minor tweaks)
p4raw-id: //depot/perl@30678
|
|
|
|
|
| |
Message-ID: <9b18b3110703191740m6bf21942p6521f3016ed8092f@mail.gmail.com>
p4raw-id: //depot/perl@30647
|