summaryrefslogtreecommitdiff
path: root/regcomp.c
Commit message (Collapse)AuthorAgeFilesLines
...
* leading (.*) should not be implicitly ^'d Jeff Pinyan2002-01-071-1/+1
| | | | | Message-ID: <Pine.GSO.4.21.0201061532400.27018-100000@crusoe.crusoe.net> p4raw-id: //depot/perl@14115
* More regex and utf8 debug dumping.Jarkko Hietaniemi2002-01-071-3/+7
| | | p4raw-id: //depot/perl@14114
* Unused variable.Jarkko Hietaniemi2002-01-071-1/+0
| | | p4raw-id: //depot/perl@14113
* Finish up (ha!) the Unicode case folding;Jarkko Hietaniemi2002-01-051-16/+54
| | | | | enhance regex dumping code. p4raw-id: //depot/perl@14096
* Ooops. Retract the regex parts of #14090.Jarkko Hietaniemi2002-01-051-28/+9
| | | p4raw-id: //depot/perl@14091
* [REPATCH] Re: [PATCH] Re: socketpair blip on unicos/mk, tooNicholas Clark2002-01-051-9/+28
| | | | | Message-ID: <20020105181013.I300@Bagpuss.unfortu.net> p4raw-id: //depot/perl@14090
* Do not add the first characters of multicharacter foldingsJarkko Hietaniemi2002-01-031-3/+4
| | | | | into character classes. p4raw-id: //depot/perl@14048
* Allow emitting the foldcased version of "\x{df}", "ss"Jarkko Hietaniemi2002-01-031-2/+2
| | | | | (among others, another example would be \x{fb00} -> "ff") p4raw-id: //depot/perl@14046
* Now that to_utf8_fold() was fixed.Jarkko Hietaniemi2002-01-011-3/+1
| | | p4raw-id: //depot/perl@14004
* Unused variable.Jarkko Hietaniemi2001-12-311-1/+0
| | | p4raw-id: //depot/perl@13988
* Add some comments to the recent Unicode case-folding saga.Jarkko Hietaniemi2001-12-311-2/+21
| | | p4raw-id: //depot/perl@13985
* Delay folding until necessary; start of handlingJarkko Hietaniemi2001-12-311-11/+37
| | | | | folding into several characters. p4raw-id: //depot/perl@13969
* Fix all the I case foldings as per CaseFold.txt.Jarkko Hietaniemi2001-12-301-1/+2
| | | p4raw-id: //depot/perl@13964
* Fix all the C(ommon) case foldings as per CaseFold.txt.Jarkko Hietaniemi2001-12-301-0/+11
| | | p4raw-id: //depot/perl@13963
* Must cast constants if they can be quads.Jarkko Hietaniemi2001-12-241-3/+3
| | | p4raw-id: //depot/perl@13872
* The funky final sigma casefolding.Jarkko Hietaniemi2001-12-231-1/+13
| | | p4raw-id: //depot/perl@13866
* Fix encoding pragma.Jarkko Hietaniemi2001-12-231-13/+19
| | | p4raw-id: //depot/perl@13859
* and [BUG] \X and \C fixed, \X still dorked Jeffrey Friedl2001-12-221-0/+2
| | | | | | | Message-Id: <200112220718.fBM7IHG25075@ventrue.corp.yahoo.com> (partially applied, most of the new tests need to be rethought) p4raw-id: //depot/perl@13857
* Unicode casefolding fixes.Jarkko Hietaniemi2001-12-221-12/+10
| | | p4raw-id: //depot/perl@13843
* Unadorned numbers evil.Jarkko Hietaniemi2001-12-131-1/+1
| | | p4raw-id: //depot/perl@13672
* Fixes for casting problems detected on a SuSE 7.2 Itanium,Jarkko Hietaniemi2001-12-121-1/+1
| | | | | | mostly gcc -Wall complaining about the argument of %p not being a void *. p4raw-id: //depot/perl@13659
* Apply NetBSD patch-ae: another gcc sparc64 bug.Jarkko Hietaniemi2001-12-091-0/+28
| | | p4raw-id: //depot/perl@13566
* memset() is cheaper than a loop of 256 bit-a-a-timesNicholas Clark2001-11-241-7/+3
| | | | | Message-ID: <20011124163042.R37621@plum.flirble.org> p4raw-id: //depot/perl@13236
* Re: regex oddness with \Z and $&Hugo van der Sanden2001-11-231-0/+1
| | | | | Message-Id: <200111231515.fANFFOR10341@crypt.compulink.co.uk> p4raw-id: //depot/perl@13198
* Retract #13048: that introduced a malloc() (through savepvn())Jarkko Hietaniemi2001-11-181-3/+0
| | | | | to *every* m//, which killed the performance quite badly. p4raw-id: //depot/perl@13079
* Re: [PATCH regcomp.[ch], regexp.h, pp_hot.c, t/op/re_tests] lazy $& and \Z fixJeff Pinyan2001-11-161-0/+3
| | | | | Message-ID: <Pine.GSO.4.21.0111141634020.16649-100000@crusoe.crusoe.net> p4raw-id: //depot/perl@13048
* More UTF-8 EXACT tweaking, plus a forgotten UTF-8 Jarkko Hietaniemi2001-11-061-1/+2
| | | | | toggle-on from the encoding pragma. p4raw-id: //depot/perl@12872
* Implement the encoding pragma for regex literals.Jarkko Hietaniemi2001-11-061-0/+15
| | | p4raw-id: //depot/perl@12864
* Regex debugging fixes from Hugo.Jarkko Hietaniemi2001-11-051-1/+1
| | | p4raw-id: //depot/perl@12858
* Don't bother doing POSIX charclass parsing if itJarkko Hietaniemi2001-11-031-18/+22
| | | | | | | | | | | | | | | | | | possibly cannot be so. Prepares way for charclass syntax like [[abc]||[def]] (or just [[abc][def]]) for union, [[\w]&&[$a]] for intersection, and [[a-z]&&[^def]] for subtraction. Currently /[[a]/ (or /[a[]/) parses as a character class containing two characters, "[" and "a", this may have to be broken for the syntax described above, otherwise we would have to scan the whole pattern to find out whether the square brackets match pairwise. Luckily, the special case of "[" doesn't seem to be documented (as opposed to "]" and "-"), so we may have better story for breaking it... One can always use \[ if one wants a literal "[", so there. p4raw-id: //depot/perl@12835
* Comment correction.Jarkko Hietaniemi2001-11-031-1/+1
| | | p4raw-id: //depot/perl@12834
* STRLEN != int.Jarkko Hietaniemi2001-10-251-1/+1
| | | p4raw-id: //depot/perl@12658
* This takes care of some of the re 'debug' cases butJarkko Hietaniemi2001-10-251-2/+3
| | | | | | | not all cases since the information whether the pattern or the target are utf8 seems to be either lost or not spread widely enough, sigh. p4raw-id: //depot/perl@12631
* Dump Unicode better for re 'debug'. The regprop()Jarkko Hietaniemi2001-10-241-4/+14
| | | | | | is unfinished since have to figure out how to detect Unicodeness in there. p4raw-id: //depot/perl@12621
* Make the toupper/lower/title API for Unicode not rightJarkko Hietaniemi2001-10-091-4/+4
| | | | | | but at least less wrong: prepare for the mapping being more than just one-character-to-one-character. p4raw-id: //depot/perl@12371
* Be careful to pull chars from the varargs stack whenJarkko Hietaniemi2001-10-011-3/+5
| | | | | formatting chars. p4raw-id: //depot/perl@12292
* Further tweaks to the Unicode properties.Jarkko Hietaniemi2001-10-011-2/+2
| | | p4raw-id: //depot/perl@12286
* More leniency to the \p and \P: now can have whitespaceJarkko Hietaniemi2001-09-291-5/+20
| | | | | | | between the property definition and the curlies; now can invert the property by having a caret between the open curly and the property. p4raw-id: //depot/perl@12269
* Re: the remaining bugs in \x escapes (was Re: [PATCH] oct and hex in ↵Nicholas Clark2001-09-101-4/+6
| | | | | | | glorious 64 bit (with less bugs) (was Re: hex and oct again (was Re: FreeBSD MD5 crypt? Re: crypt/hex/oct and Unicode?))) Message-ID: <20010911000031.G1512@plum.flirble.org> p4raw-id: //depot/perl@11990
* Using strlen() not good on embedded nul bytes.Jarkko Hietaniemi2001-09-101-13/+13
| | | p4raw-id: //depot/perl@11967
* [PATCH regcomp.c] zero-width assertions CAN be ?'dJeff Pinyan2001-09-091-0/+4
| | | | | | | | | | | | | | | Date: Sat, 8 Sep 2001 15:42:30 -0400 (EDT) Message-ID: <Pine.GSO.4.21.0109081535480.24489-100000@crusoe.crusoe.net> Subject: Re: [PATCH t/op/misc.t] regcomp.c patch broke test From: "Jeff 'japhy/Marillion' Pinyan" <jeffp@crusoe.net> Date: Sat, 8 Sep 2001 18:33:12 -0400 (EDT) Message-ID: <Pine.GSO.4.21.0109081832030.24489-100000@crusoe.crusoe.net> Subject: [PATCH t/lib/warnings/regcomp] (?=...)? gives no warning now From: "Jeff 'japhy/Marillion' Pinyan" <jeffp@crusoe.net> Date: Sat, 8 Sep 2001 18:37:22 -0400 (EDT) Message-ID: <Pine.GSO.4.21.0109081835340.24489-100000@crusoe.crusoe.net> p4raw-id: //depot/perl@11956
* oct and hex in glorious 64 bit (with less bugs) (was Re: hex and oct again ↵Nicholas Clark2001-09-051-14/+21
| | | | | | | (was Re: FreeBSD MD5 crypt? Re: crypt/hex/oct and Unicode?)) Message-ID: <20010904224250.P25120@plum.flirble.org> p4raw-id: //depot/perl@11874
* Rename the variable: it *used* to be (wrongly) that theJarkko Hietaniemi2001-09-021-1/+1
| | | | | | code related to PL_reg_sv (so PL_reg_sv_utf8 was logical) but that is no more the case: PL_reg_match_utf8 is better. p4raw-id: //depot/perl@11823
* remove deprecated PERL_OBJECT cruft, it has long since stoppedGurusamy Sarathy2001-08-311-13/+3
| | | | | working in 5.7.x p4raw-id: //depot/perl@11803
* Fixes bug in change 11717 that bus errored on HP-UX 10.20Artur Bergman2001-08-311-1/+1
| | | | | Might break on platforms where bool is larger than 8 bites ??? p4raw-id: //depot/perl@11800
* Change 11797 sneaked in a faulty regcomp.c change which wasn'tArtur Bergman2001-08-311-1/+1
| | | | | supposed to happen. p4raw-id: //depot/perl@11798
* Re: Problem in ext/Time/HiRest/HiRes.t John P. Linderman2001-08-311-1/+1
| | | | | | Message-Id: <200108311220.IAA54125@raptor.research.att.com> Fixes test 14 which could fail randomly in rare cases. p4raw-id: //depot/perl@11797
* New try for ID 20010407.006: detach the semanticsJarkko Hietaniemi2001-08-181-0/+1
| | | | | "was the last match target UTF8" into its own variable. p4raw-id: //depot/perl@11717
* Re: [ID 20010809.023] perlre misleads when stating that (?i) should be at ↵Hugo van der Sanden2001-08-151-0/+1
| | | | | | | front of pattern Message-Id: <200108151032.f7FAWBI30961@crypt.compulink.co.uk> p4raw-id: //depot/perl@11677
* Not quite so relicy as thought in #11651 (op/concat #4 and #5Jarkko Hietaniemi2001-08-121-1/+5
| | | | | stopped working). p4raw-id: //depot/perl@11653