summaryrefslogtreecommitdiff
path: root/regcomp.c
Commit message (Collapse)AuthorAgeFilesLines
* Better place to initialize (IRIX cc noticed thatJarkko Hietaniemi2002-03-161-1/+2
| | | | | the many "goto defchars" bypass the initialization) p4raw-id: //depot/perl@15265
* HP-UX gcc 3 warnings found by Sarathy.Jarkko Hietaniemi2002-03-161-1/+1
| | | p4raw-id: //depot/perl@15255
* more warnings tidyupPaul Marquess2002-03-111-7/+7
| | | | | | From: "Paul Marquess" <paul_marquess@yahoo.co.uk> Message-ID: <AIEAJICLCBDNAAOLLOKLMEEGDPAA.paul_marquess@yahoo.co.uk> p4raw-id: //depot/perl@15155
* As suggested by Anton Tagunov, eq and cmp now obey theJarkko Hietaniemi2002-03-071-1/+1
| | | | | | | | | encoding pragma (by remapping their byte argument if the other argument is in Unicode). Also fix a bug found by Anton where ord undef under the encoding pragma would barf. ([ID 20020307.009] A null pointer dereference with 'use encoding') Finally, use the nicer form of sv_recode_to_utf8. p4raw-id: //depot/perl@15085
* taint + deprecated warningsPaul Marquess2002-03-041-3/+2
| | | | | | From: "Paul Marquess" <paul_marquess@yahoo.co.uk> Message-ID: <AIEAJICLCBDNAAOLLOKLCEKGDOAA.paul_marquess@yahoo.co.uk> p4raw-id: //depot/perl@15003
* Use to_uni_fold().Jarkko Hietaniemi2002-02-241-6/+1
| | | p4raw-id: //depot/perl@14855
* Using uninitialized (and wrong) variables is not good.Jarkko Hietaniemi2002-02-241-2/+2
| | | p4raw-id: //depot/perl@14851
* EBCDIC: this seems to calm the last of theJarkko Hietaniemi2002-02-241-12/+20
| | | | | Malformed UTF-8 warnings. p4raw-id: //depot/perl@14850
* The POSIX extensions are deadly.Jarkko Hietaniemi2002-02-191-3/+2
| | | p4raw-id: //depot/perl@14765
* EBCDIC: now the worst seems to be over forJarkko Hietaniemi2002-02-181-1/+5
| | | | | | the "Malformed" warnings. Still a few of them, and plenty of test failures, but getting better. p4raw-id: //depot/perl@14739
* "Can't happen" - yes, it can, with buggy UTF-8/EBCDIC.Jarkko Hietaniemi2002-02-111-12/+24
| | | | | Stepping backward in the string would be majorly bad. p4raw-id: //depot/perl@14633
* (??{ }) anomalyYitzchak Scott-Thoennes2002-02-041-0/+2
| | | | | Message-ID: <ZxgX8gzkgCXO092yn@efn.org> p4raw-id: //depot/perl@14547
* EBCDIC: more natification.Jarkko Hietaniemi2002-02-031-6/+9
| | | p4raw-id: //depot/perl@14540
* EBCDIC: character classes must remap the low 256.Jarkko Hietaniemi2002-02-031-2/+2
| | | p4raw-id: //depot/perl@14534
* Copyright++. (Not all the toplevel *.h have one, it seems.)Jarkko Hietaniemi2002-01-231-1/+1
| | | p4raw-id: //depot/perl@14391
* Don't bother checking for the Greek specialJarkko Hietaniemi2002-01-131-1/+1
| | | | | cases if the node is too short. p4raw-id: //depot/perl@14228
* Special treatment for U+03B0 and U+0390 in //i,Jarkko Hietaniemi2002-01-121-0/+44
| | | | | | this means that we can remove the minlen pessimisations introduced by the #14096. p4raw-id: //depot/perl@14219
* micropatchHugo van der Sanden2002-01-101-3/+1
| | | | | Message-Id: <200201101820.g0AIKw208580@crypt.compulink.co.uk> p4raw-id: //depot/perl@14174
* [patch] save a test Hugo van der Sanden2002-01-081-2/+3
| | | | | Message-Id: <200201081618.g08GIlN02821@crypt.compulink.co.uk> p4raw-id: //depot/perl@14137
* leading (.*) should not be implicitly ^'d Jeff Pinyan2002-01-071-1/+1
| | | | | Message-ID: <Pine.GSO.4.21.0201061532400.27018-100000@crusoe.crusoe.net> p4raw-id: //depot/perl@14115
* More regex and utf8 debug dumping.Jarkko Hietaniemi2002-01-071-3/+7
| | | p4raw-id: //depot/perl@14114
* Unused variable.Jarkko Hietaniemi2002-01-071-1/+0
| | | p4raw-id: //depot/perl@14113
* Finish up (ha!) the Unicode case folding;Jarkko Hietaniemi2002-01-051-16/+54
| | | | | enhance regex dumping code. p4raw-id: //depot/perl@14096
* Ooops. Retract the regex parts of #14090.Jarkko Hietaniemi2002-01-051-28/+9
| | | p4raw-id: //depot/perl@14091
* [REPATCH] Re: [PATCH] Re: socketpair blip on unicos/mk, tooNicholas Clark2002-01-051-9/+28
| | | | | Message-ID: <20020105181013.I300@Bagpuss.unfortu.net> p4raw-id: //depot/perl@14090
* Do not add the first characters of multicharacter foldingsJarkko Hietaniemi2002-01-031-3/+4
| | | | | into character classes. p4raw-id: //depot/perl@14048
* Allow emitting the foldcased version of "\x{df}", "ss"Jarkko Hietaniemi2002-01-031-2/+2
| | | | | (among others, another example would be \x{fb00} -> "ff") p4raw-id: //depot/perl@14046
* Now that to_utf8_fold() was fixed.Jarkko Hietaniemi2002-01-011-3/+1
| | | p4raw-id: //depot/perl@14004
* Unused variable.Jarkko Hietaniemi2001-12-311-1/+0
| | | p4raw-id: //depot/perl@13988
* Add some comments to the recent Unicode case-folding saga.Jarkko Hietaniemi2001-12-311-2/+21
| | | p4raw-id: //depot/perl@13985
* Delay folding until necessary; start of handlingJarkko Hietaniemi2001-12-311-11/+37
| | | | | folding into several characters. p4raw-id: //depot/perl@13969
* Fix all the I case foldings as per CaseFold.txt.Jarkko Hietaniemi2001-12-301-1/+2
| | | p4raw-id: //depot/perl@13964
* Fix all the C(ommon) case foldings as per CaseFold.txt.Jarkko Hietaniemi2001-12-301-0/+11
| | | p4raw-id: //depot/perl@13963
* Must cast constants if they can be quads.Jarkko Hietaniemi2001-12-241-3/+3
| | | p4raw-id: //depot/perl@13872
* The funky final sigma casefolding.Jarkko Hietaniemi2001-12-231-1/+13
| | | p4raw-id: //depot/perl@13866
* Fix encoding pragma.Jarkko Hietaniemi2001-12-231-13/+19
| | | p4raw-id: //depot/perl@13859
* and [BUG] \X and \C fixed, \X still dorked Jeffrey Friedl2001-12-221-0/+2
| | | | | | | Message-Id: <200112220718.fBM7IHG25075@ventrue.corp.yahoo.com> (partially applied, most of the new tests need to be rethought) p4raw-id: //depot/perl@13857
* Unicode casefolding fixes.Jarkko Hietaniemi2001-12-221-12/+10
| | | p4raw-id: //depot/perl@13843
* Unadorned numbers evil.Jarkko Hietaniemi2001-12-131-1/+1
| | | p4raw-id: //depot/perl@13672
* Fixes for casting problems detected on a SuSE 7.2 Itanium,Jarkko Hietaniemi2001-12-121-1/+1
| | | | | | mostly gcc -Wall complaining about the argument of %p not being a void *. p4raw-id: //depot/perl@13659
* Apply NetBSD patch-ae: another gcc sparc64 bug.Jarkko Hietaniemi2001-12-091-0/+28
| | | p4raw-id: //depot/perl@13566
* memset() is cheaper than a loop of 256 bit-a-a-timesNicholas Clark2001-11-241-7/+3
| | | | | Message-ID: <20011124163042.R37621@plum.flirble.org> p4raw-id: //depot/perl@13236
* Re: regex oddness with \Z and $&Hugo van der Sanden2001-11-231-0/+1
| | | | | Message-Id: <200111231515.fANFFOR10341@crypt.compulink.co.uk> p4raw-id: //depot/perl@13198
* Retract #13048: that introduced a malloc() (through savepvn())Jarkko Hietaniemi2001-11-181-3/+0
| | | | | to *every* m//, which killed the performance quite badly. p4raw-id: //depot/perl@13079
* Re: [PATCH regcomp.[ch], regexp.h, pp_hot.c, t/op/re_tests] lazy $& and \Z fixJeff Pinyan2001-11-161-0/+3
| | | | | Message-ID: <Pine.GSO.4.21.0111141634020.16649-100000@crusoe.crusoe.net> p4raw-id: //depot/perl@13048
* More UTF-8 EXACT tweaking, plus a forgotten UTF-8 Jarkko Hietaniemi2001-11-061-1/+2
| | | | | toggle-on from the encoding pragma. p4raw-id: //depot/perl@12872
* Implement the encoding pragma for regex literals.Jarkko Hietaniemi2001-11-061-0/+15
| | | p4raw-id: //depot/perl@12864
* Regex debugging fixes from Hugo.Jarkko Hietaniemi2001-11-051-1/+1
| | | p4raw-id: //depot/perl@12858
* Don't bother doing POSIX charclass parsing if itJarkko Hietaniemi2001-11-031-18/+22
| | | | | | | | | | | | | | | | | | possibly cannot be so. Prepares way for charclass syntax like [[abc]||[def]] (or just [[abc][def]]) for union, [[\w]&&[$a]] for intersection, and [[a-z]&&[^def]] for subtraction. Currently /[[a]/ (or /[a[]/) parses as a character class containing two characters, "[" and "a", this may have to be broken for the syntax described above, otherwise we would have to scan the whole pattern to find out whether the square brackets match pairwise. Luckily, the special case of "[" doesn't seem to be documented (as opposed to "]" and "-"), so we may have better story for breaking it... One can always use \[ if one wants a literal "[", so there. p4raw-id: //depot/perl@12835
* Comment correction.Jarkko Hietaniemi2001-11-031-1/+1
| | | p4raw-id: //depot/perl@12834