summaryrefslogtreecommitdiff
path: root/regcomp.c
Commit message (Collapse)AuthorAgeFilesLines
* [PATCH 5.7.0] make regcomp reenterableIlya Zakharevich2000-11-181-578/+624
| | | | | | | | | | | | | Date: Fri, 17 Nov 2000 20:35:11 -0500 Message-ID: <20001117203511.A13121@monk.mps.ohio-state.edu> Subject: Re: [PATCH 5.7.0] make regcomp reenterable From: Ilya Zakharevich <ilya@math.ohio-state.edu> Date: Fri, 17 Nov 2000 21:03:47 -0500 Message-ID: <20001117210347.A16570@monk.mps.ohio-state.edu> Plus a little bit of tweaking in pregcomp(). p4raw-id: //depot/perl@7741
* restore match data on backtracingIlya Zakharevich2000-11-181-10/+32
| | | | | Message-ID: <20001117172802.A1032@monk.mps.ohio-state.edu> p4raw-id: //depot/perl@7733
* Too profiler-happy: with optimization the #7590 actually makesJarkko Hietaniemi2000-11-071-3/+6
| | | | | | the test to run 0.5% _slower_. Requires much more instrumentation. Retract #7590. p4raw-id: //depot/perl@7591
* Shave off about 5% (Digital UNIX, -g, pixie) of the op/regexpJarkko Hietaniemi2000-11-071-6/+3
| | | | | | execution time in regcomp.c S_cl_any() and S_cl_is_anything() by using memset() and testing bytewise (as opposed to bitwise). p4raw-id: //depot/perl@7590
* [ID 20001031.004] Uninitialized auto variable in regcomp.cMartin Husemann2000-11-011-2/+2
| | | | | Message-Id: <200010312239.e9VMdZR01580@night-porter.duskware.de> p4raw-id: //depot/perl@7512
* Continue the internal UTF-8 API tweaking.Jarkko Hietaniemi2000-10-251-3/+3
| | | | | | | | Rename utf8_to_uv_chk() back to utf8_to_uv() because it's used much more than the simpler API, now called utf8_to_uv_simple(). Still not quite happy with API, too much partial duplication of functionality. p4raw-id: //depot/perl@7439
* Make the UTF-8 decoding stricter and more verbose whenJarkko Hietaniemi2000-10-241-8/+13
| | | | | | | | | | | | malformation happens. This involved adding an argument to utf8_to_uv_chk(), which involved changing its prototype, and prefer STRLEN over I32 for the UTF-8 length, which as a domino effect necessitated changing the prototypes of scan_bin(), scan_oct(), scan_hex(), and reg_uni(). The stricter UTF-8 decoding checking uses Markus Kuhn's UTF-8 Decode Stress Tester from http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt p4raw-id: //depot/perl@7416
* Re-instate Perl_utf8_to_uv without checking parameter - added in change 7075.Nick Ing-Simmons2000-09-301-3/+3
| | | | | | | i.e. rename Simon's function to Perl_utf8_to_uv_chk, change all calls to it to use new name and add Perl_utf8_to_uv() as a wrapper which calls it passing 0 to checking to get the warning. p4raw-id: //depot/perl@7096
* continued -Wformat supportRobin Barker2000-09-141-11/+11
| | | | | Message-Id: <200009141707.SAA13276@tempest.npl.co.uk> p4raw-id: //depot/perl@7081
* Fix for a parsing bug, not for the original bug.Spider Boardman2000-09-141-0/+3
| | | | | | Subject: Re: [ID 20000910.005] Another segfault with regexes. Message-Id: <200009132152.RAA24029@leggy.zk3.dec.com> p4raw-id: //depot/perl@7076
* Batch of UTF-8 patches from Simon Cozens.Jarkko Hietaniemi2000-09-141-3/+3
| | | p4raw-id: //depot/perl@7075
* nextchar() abuse misses an optimisationHugo van der Sanden2000-08-221-2/+2
| | | | | Message-Id: <200008221021.LAA03332@crypt.compulink.co.uk> p4raw-id: //depot/perl@6770
* Rename the macro argument because some preprocessorsJarkko Hietaniemi2000-08-211-8/+8
| | | | | | can't tell the difference and expand arguments also inside double quoted strings. p4raw-id: //depot/perl@6747
* Fix a core dump in lib/selfloader under -DDEBUGGING.Spider.Boardman@Orb.Nashua.NH.US2000-08-191-2/+7
| | | | | | Subject: PATCH @6698 for [ID 20000817.007] Not OK: perl v5.7.0 +SUIDMAIL +DEVEL6676 on alpha-dec_osf 4.0f (UNINSTALLED) Message-Id: <200008182241.SAA29667@Orb.Nashua.NH.US> p4raw-id: //depot/perl@6709
* Add [[:blank:]] as suggested inJeffrey Friedl2000-08-181-6/+55
| | | | | | | | | | | | | | Subject: [ID 20000716.024] [=cc=] / [:blank:] Message-Id: <200007170055.RAA23528@fummy.dsl.yahoo.com> (the [=cc=] has already been taken care of by #6439 so the whole bug report can be closed) and make [[:space:]] to be equivalent to isspace(3) (as opposed to \s, which is isSPACE()). The difference is that now [[:space:]] matches the mythical vertical tab, while \s doesn't. p4raw-id: //depot/perl@6703
* Tweak the regex compilation errors once more.Jarkko Hietaniemi2000-08-171-3/+3
| | | p4raw-id: //depot/perl@6663
* Change the regx compilation error markers to use = instead of <Lupe Christoph2000-08-161-2/+2
| | | | | | | | since pod makes using the latter quite messy. Reported in ID 20000814.006 by Abigail and in Subject: Unknown escape E<> ? Message-ID: <20000811003027.F17420@alanya.lupe-christoph.de> p4raw-id: //depot/perl@6653
* Get back into sync with Jeffrey on the enhanced regex warnings.Jarkko Hietaniemi2000-08-101-6/+6
| | | p4raw-id: //depot/perl@6563
* Re: enhanced(?) regex error messagesJeffrey Friedl2000-08-091-108/+303
| | | | | | | | Message-Id: <200008090759.AAA07144@ventrue.yahoo.com> (plus two small patches sent privately) (this still seems to leave few test failures) p4raw-id: //depot/perl@6560
* Re: enhanced(?) regex error messages Hugo van der Sanden2000-08-081-5/+5
| | | | | | | Message-Id: <200008080225.DAA10998@crypt.compulink.co.uk> plus Capitalize the error messages, plus perldiag them. p4raw-id: //depot/perl@6546
* Make regular expression parse error messages easier to understand.Jeffrey Friedl2000-08-071-17/+42
| | | | | | Subject: Re: enhanced(?) regex error messages Message-Id: <200008050634.XAA20360@ventrue.yahoo.com> p4raw-id: //depot/perl@6540
* [ID 20000803.001] further regexp counting problemsHugo van der Sanden2000-08-031-0/+2
| | | | | Message-Id: <200008031725.SAA10580@crypt.compulink.co.uk> p4raw-id: //depot/perl@6508
* Make the unimplemented POSIX regex features [[.cc.]] and [[=c=]]Jeffrey Friedl2000-07-261-5/+5
| | | | | | | | | | to be fatal errors (instead of by default ignoring them, and ignoring with a bug: even though -w gave an error, the opening [ was left in) Reported in: Subject: [ID 20000716.024] [=cc=] / [:blank:] Message-Id: <200007170055.RAA23528@fummy.dsl.yahoo.com> p4raw-id: //depot/perl@6439
* A missing 'break' after the [[:space:]] switch case.Stephen P. Potter2000-07-241-0/+1
| | | | | | Subject: [ID 20000718.006] Missing 'break' in regcomp.c Message-Id: <200007182025.QAA28383@spp.users.ds.net> p4raw-id: //depot/perl@6426
* integrate cfgperl contents into mainlineGurusamy Sarathy2000-07-141-0/+3
|\ | | | | p4raw-id: //depot/perl@6400
| * The {multiplier} of a fixed substring was overlooked whichHugo van der Sanden2000-07-131-0/+3
| | | | | | | | | | | | | | | | caused a wrong initial search offset for that substring. Subject: [PATCH 5.6.0] Re: [ID 20000613.001] Regex works in v5.005_03 but fails in v5.06 Message-Id: <200007131827.TAA14487@crypt.compulink.co.uk> p4raw-id: //depot/cfgperl@6392
| * Do no -warn on \_, only on \alpha.David Dyck2000-06-271-3/+6
|/ | | | | | Subject: New Unrecognized escape warning for /\_/ from activestate perl-current5.6.0 Message-ID: <Pine.LNX.4.05.10006260830280.3054-100000@dd.tc.fluke.com> p4raw-id: //depot/cfgperl@6241
* fix broken parsing of /\x{ab}/Gurusamy Sarathy2000-05-031-1/+1
| | | p4raw-id: //depot/perl@6057
* change#3798 broke the meaning of "\0_7_7", tr/\0_// etc.; fix itGurusamy Sarathy2000-05-021-1/+10
| | | | | | | | such that underscores are only ignored in literal numbers, "\x{...}", and hex/oct argument p4raw-link: @3798 on //depot/cfgperl: 252aa0820e6bce274b33bd342cfc65e18a59a165 p4raw-id: //depot/perl@6044
* fix for failure to match $foo =~ /(?i)/ (from Ilya Zakharevich)Gurusamy Sarathy2000-04-271-1/+7
| | | p4raw-id: //depot/perl@5973
* tweak to use SIZE_ONLYGurusamy Sarathy2000-03-141-1/+1
| | | p4raw-id: //depot/perl@5745
* avoid dup warning (from Simon Cozens)Gurusamy Sarathy2000-03-141-1/+3
| | | p4raw-id: //depot/perl@5734
* fix optimizer bug in /^(?p{"a"})b/ (from Ilya Zakharevich)Gurusamy Sarathy2000-03-111-1/+2
| | | p4raw-id: //depot/perl@5644
* Cpp out deadcode that IRIX compiler noticed.Jarkko Hietaniemi2000-03-111-0/+3
| | | p4raw-id: //depot/cfgperl@5638
* fixes for most warnings identified by gcc -WallGurusamy Sarathy2000-03-051-15/+6
| | | p4raw-id: //depot/perl@5540
* integrate cfgperl and vmsperl contents into mainlineGurusamy Sarathy2000-03-031-6/+6
|\ | | | | p4raw-id: //depot/perl@5486
| * Integrate with Sarathy.Jarkko Hietaniemi2000-03-031-1/+1
| |\ | | | | | | p4raw-id: //depot/cfgperl@5469
| * | /[[:alpha]/ now dies on unmatched [] instead ofJarkko Hietaniemi2000-03-031-6/+6
| | | | | | | | | | | | | | | "an unknown character class" (spotted by Tom Christiansen) p4raw-id: //depot/cfgperl@5465
* | | warn about /(?p{})/ (from Simon Cozens)Gurusamy Sarathy2000-03-031-0/+3
| |/ |/| | | p4raw-id: //depot/perl@5475
* | /(?p{})/ changed to /(??{})/, per Larry's suggestion (fromGurusamy Sarathy2000-03-031-1/+1
|/ | | | | Simon Cozens <simon@othersideofthe.earth.li>) p4raw-id: //depot/perl@5467
* Some compilers have problems with quad switch() controls.Jarkko Hietaniemi2000-02-291-3/+8
| | | p4raw-id: //depot/cfgperl@5366
* More UTF8_MAXLEN fixesGisle Aas2000-02-241-1/+1
| | | | | | To: perl5-porters@perl.org Message-ID: <m3ema2tc8q.fsf@eik.g.aas.no> p4raw-id: //depot/cfgperl@5247
* lexical warnings update, ability to inspect bitmask in callingGurusamy Sarathy2000-02-201-25/+25
| | | | | scope, among other things (from Paul Marquess) p4raw-id: //depot/perl@5170
* more purification (pp_require() could access free memory; vdie()Gurusamy Sarathy2000-02-131-0/+4
| | | | | | | | could think message was random length when passed a null argument; utilize() didn't set up the hash for the method name leading to pp_method_named() accessing random state; PL_curpm wasn't zeroed properly) p4raw-id: //depot/perl@5072
* fix uninitialized memory reads found by purifyGurusamy Sarathy2000-02-111-2/+2
| | | p4raw-id: //depot/perl@5063
* set SvUTF8 on vectors only if there are chars > 127; update copyrightGurusamy Sarathy2000-02-061-1/+1
| | | | | years (from Gisle Aas) p4raw-id: //depot/perl@5009
* HINT_UTF8 is not propagated to the op tree anymore; add aGurusamy Sarathy2000-02-011-2/+1
| | | | | perlunicode.pod that reflects changes to unicode support so far p4raw-id: //depot/perl@4941
* runtime now looks at the SVf_UTF8 bit on the SV to decideGurusamy Sarathy2000-01-311-1/+3
| | | | | | | | | whether to use widechar semantics; lexer and RE engine continue to need "use utf8" to enable unicode awareness in literals and patterns (TODO: this needs to be fixed); $1 et al are marked SvUTF8 if the pattern was compiled for utf8 (TODO: propagating it from the data is probably better) p4raw-id: //depot/perl@4930
* add patch for printf-style format typechecks (from Robin BarkerGurusamy Sarathy2000-01-221-5/+5
| | | | | <rmb1@cise.npl.co.uk>); fixes for problems so identified p4raw-id: //depot/perl@4836
* ebcdic fix for Data::Dumper from Peter PrymmerGurusamy Sarathy2000-01-021-2/+20
| | | p4raw-id: //depot/perl@4745