summaryrefslogtreecommitdiff
path: root/regcomp.c
Commit message (Collapse)AuthorAgeFilesLines
* Using strlen() not good on embedded nul bytes.Jarkko Hietaniemi2001-09-101-13/+13
| | | p4raw-id: //depot/perl@11967
* [PATCH regcomp.c] zero-width assertions CAN be ?'dJeff Pinyan2001-09-091-0/+4
| | | | | | | | | | | | | | | Date: Sat, 8 Sep 2001 15:42:30 -0400 (EDT) Message-ID: <Pine.GSO.4.21.0109081535480.24489-100000@crusoe.crusoe.net> Subject: Re: [PATCH t/op/misc.t] regcomp.c patch broke test From: "Jeff 'japhy/Marillion' Pinyan" <jeffp@crusoe.net> Date: Sat, 8 Sep 2001 18:33:12 -0400 (EDT) Message-ID: <Pine.GSO.4.21.0109081832030.24489-100000@crusoe.crusoe.net> Subject: [PATCH t/lib/warnings/regcomp] (?=...)? gives no warning now From: "Jeff 'japhy/Marillion' Pinyan" <jeffp@crusoe.net> Date: Sat, 8 Sep 2001 18:37:22 -0400 (EDT) Message-ID: <Pine.GSO.4.21.0109081835340.24489-100000@crusoe.crusoe.net> p4raw-id: //depot/perl@11956
* oct and hex in glorious 64 bit (with less bugs) (was Re: hex and oct again ↵Nicholas Clark2001-09-051-14/+21
| | | | | | | (was Re: FreeBSD MD5 crypt? Re: crypt/hex/oct and Unicode?)) Message-ID: <20010904224250.P25120@plum.flirble.org> p4raw-id: //depot/perl@11874
* Rename the variable: it *used* to be (wrongly) that theJarkko Hietaniemi2001-09-021-1/+1
| | | | | | code related to PL_reg_sv (so PL_reg_sv_utf8 was logical) but that is no more the case: PL_reg_match_utf8 is better. p4raw-id: //depot/perl@11823
* remove deprecated PERL_OBJECT cruft, it has long since stoppedGurusamy Sarathy2001-08-311-13/+3
| | | | | working in 5.7.x p4raw-id: //depot/perl@11803
* Fixes bug in change 11717 that bus errored on HP-UX 10.20Artur Bergman2001-08-311-1/+1
| | | | | Might break on platforms where bool is larger than 8 bites ??? p4raw-id: //depot/perl@11800
* Change 11797 sneaked in a faulty regcomp.c change which wasn'tArtur Bergman2001-08-311-1/+1
| | | | | supposed to happen. p4raw-id: //depot/perl@11798
* Re: Problem in ext/Time/HiRest/HiRes.t John P. Linderman2001-08-311-1/+1
| | | | | | Message-Id: <200108311220.IAA54125@raptor.research.att.com> Fixes test 14 which could fail randomly in rare cases. p4raw-id: //depot/perl@11797
* New try for ID 20010407.006: detach the semanticsJarkko Hietaniemi2001-08-181-0/+1
| | | | | "was the last match target UTF8" into its own variable. p4raw-id: //depot/perl@11717
* Re: [ID 20010809.023] perlre misleads when stating that (?i) should be at ↵Hugo van der Sanden2001-08-151-0/+1
| | | | | | | front of pattern Message-Id: <200108151032.f7FAWBI30961@crypt.compulink.co.uk> p4raw-id: //depot/perl@11677
* Not quite so relicy as thought in #11651 (op/concat #4 and #5Jarkko Hietaniemi2001-08-121-1/+5
| | | | | stopped working). p4raw-id: //depot/perl@11653
* More (less) regex/utf8 relics. (Toned down later in #11653.)Jarkko Hietaniemi2001-08-121-5/+1
| | | p4raw-id: //depot/perl@11651
* Continuation of #11575: SANY_SEEN completely deprecated,Jarkko Hietaniemi2001-08-041-5/+1
| | | | | | plus more tests that unearthed a bug in @a = ($utf8 =~ /\C/g), plus a fix for the bug. p4raw-id: //depot/perl@11577
* Decouple SANY into SANY and CANY: the new SANY is /./s,Daniel P. Berrange2001-08-041-3/+7
| | | | | | | | | the new CANY is the \C. The problem reported and the test case supplied in Subject: UTF-8 bugs in string length & single line regex matches Message-ID: <20010803113932.A19318@berrange.com> p4raw-id: //depot/perl@11575
* [patch] refcount re opsDoug MacEachern2001-08-041-1/+8
| | | | | Message-ID: <Pine.LNX.4.21.0108031814240.23972-100000@mako.covalent.net> p4raw-id: //depot/perl@11568
* patch to add DEL to [:cntrl:]Jeffrey Friedl2001-07-141-1/+1
| | | | | Message-Id: <200107140625.XAA01517@ventrue.corp.yahoo.com> p4raw-id: //depot/perl@11371
* Patch: document reg_data.what memberMark-Jason Dominus2001-07-121-0/+1
| | | | | Message-ID: <20010712182532.14821.qmail@plover.com> p4raw-id: //depot/perl@11322
* Code cleanup based on turning off the -woffs in IRIX.Jarkko Hietaniemi2001-06-301-1/+6
| | | | | | | | Not all of the gripes cleaned up (hairy code in hv.c and regcomp.c; unused newsp, gimme, and optype from cop.h macros; unused 'key' arguments in ?DBM_File.xs) (and the -woffs left to the IRIX hints) p4raw-id: //depot/perl@11051
* Partially fix a problem noticed by IRIX compiler:Jarkko Hietaniemi2001-06-241-1/+3
| | | | | | | the initialization of parse_start was bypassed by several gotos. Now initialized to zero, which may not be the best choice. p4raw-id: //depot/perl@10906
* Re: perl@10722: Bogus warnings on REs Hugo van der Sanden2001-06-211-1/+2
| | | | | | | Message-Id: <200106210851.JAA01942@crypt.compulink.co.uk> Unroll to avoid a UTS compiler bug. p4raw-id: //depot/perl@10774
* RE: [PATCHES] regcomp.c, pod/perldiag.pod, t/op/pat.tPaul Marquess2001-06-031-2/+10
| | | | | Message-ID: <000601c0ebae$77d10dc0$99dcfea9@bfs.phone.com> p4raw-id: //depot/perl@10410
* One less -Wall whine.Jarkko Hietaniemi2001-06-031-1/+1
| | | p4raw-id: //depot/perl@10406
* -Wall cleanup continues.Jarkko Hietaniemi2001-06-021-2/+8
| | | p4raw-id: //depot/perl@10392
* Re: [PATCHES] regcomp.c, pod/perldiag.pod, t/op/pat.tJeff Pinyan2001-06-011-2/+55
| | | | | Message-ID: <Pine.GSO.4.21.0106011032080.21027-100000@crusoe.crusoe.net> p4raw-id: //depot/perl@10376
* More -Wall sweeping.Jarkko Hietaniemi2001-05-301-14/+14
| | | p4raw-id: //depot/perl@10338
* Medley of -Wall cleanups from Michael Schwen, Hugo van der Sanden,Jarkko Hietaniemi2001-05-301-7/+6
| | | | | and Abhijit Menon-Sen. p4raw-id: //depot/perl@10321
* Re: [ID 20010506.041] segfault when matching utf8 stringInaba Hiroto2001-05-251-0/+1
| | | | | Message-Id: <200105250124.KAA19571@toshiba.co.jp> p4raw-id: //depot/perl@10206
* Re: [ID 20000716.007] \G in a m//g expression causes problems Hugo van der Sanden2001-05-231-1/+1
| | | | | Message-Id: <200105211532.QAA03999@crypt.compulink.co.uk> p4raw-id: //depot/perl@10187
* Re: [PATCH] HERE mark in regexRonald J. Kimball2001-05-161-3/+3
| | | | | Message-ID: <20010516130443.E1516273@linguist.thayer.dartmouth.edu> p4raw-id: //depot/perl@10136
* Remove the 'asciir' re subpragma. Should instead implementJarkko Hietaniemi2001-05-111-31/+7
| | | | | | | | | | | | | | | | | | | | | | | | | the 'physical vs logical' range scheme: \xAA-\xCC is a native physical range, you want that range of codepoints in your native encoding. In EBCDIC the codepoints in the gaps (between i-j and r-s) should be included. \x{AA}-\x{CC} is a physical Unicode range, you want that range of codepoints in Unicode. a-z is a logical range, you want that range of 'logical' codepoints in your native encoding. In EBCDIC the codepoints in the gaps (between i-j and r-s) should not be included. Mixed cases (a-\xAA, etc) should either be errors, or maybe the 'logical' endpoints should be converted to native/Unicode codepoints, and the range handled as a physical range. 'Logical endpoints' are to be recognized only in the A-Z, a-z, and 0-9 ranges. Probably a warning should be given for mixed cases like A-z or a-9 (since such expressions are encoding dependent), with a recommendation to use physical ranges. p4raw-id: //depot/perl@10085
* Insecure regexesRobin Houston2001-05-071-1/+1
| | | | | Message-ID: <20010507215612.A31114@penderel> p4raw-id: //depot/perl@10021
* -Wformat error from ext/re/re_comp.cRobin Barker2001-05-041-6/+6
| | | | | Message-Id: <200105041709.SAA14835@tempest.npl.co.uk> p4raw-id: //depot/perl@9991
* The #9901 had removed one line essential for EBCDIC.Jarkko Hietaniemi2001-05-041-0/+1
| | | p4raw-id: //depot/perl@9987
* Re: [PATCH bleadperl] [ID 20010426.002] Word boundry regex [...] Hugo van der Sanden2001-04-301-1/+0
| | | | | Message-Id: <200104291609.RAA17790@crypt.compulink.co.uk> p4raw-id: //depot/perl@9911
* In character classes one couldn't have 0x80..0xff charactersJarkko Hietaniemi2001-04-291-63/+40
| | | | | | at the left hand side if there were 0x100.. characters in the character class. p4raw-id: //depot/perl@9901
* Re: [PATCH @9846] dumping ANYOFHugo van der Sanden2001-04-261-1/+5
| | | | | Message-Id: <200104262233.XAA22352@crypt.compulink.co.uk> p4raw-id: //depot/perl@9873
* Retract #9851, core dumps from pod2man.Jarkko Hietaniemi2001-04-261-1/+0
| | | p4raw-id: //depot/perl@9852
* (Retracted by #9852.)Hugo van der Sanden2001-04-261-0/+1
| | | | | | Subject: [PATCH @9846] dumping ANYOF Message-Id: <200104260432.FAA12669@crypt.compulink.co.uk> p4raw-id: //depot/perl@9851
* Re: ANYOF_SIZE is wrong in 5.7.1 Mark-Jason Dominus2001-04-221-2/+1
| | | | | Message-ID: <20010422012749.27024.qmail@plover.com> p4raw-id: //depot/perl@9778
* Re: Regex debugger patchMark-Jason Dominus2001-04-221-21/+194
| | | | | | | Message-ID: <20010421182439.16508.qmail@plover.com> Regex debugger backend. p4raw-id: //depot/perl@9776
* Integrate perlio:Jarkko Hietaniemi2001-03-281-8/+34
| | | | | | | | | | | | | | [ 9400] More EBCDIC tweaks: - one more swash issue &~(0xA0-1) did not do the right thing, for UTF-EBCDIC where &~(0x80-1) does for UTF-8. - add "use re 'asciirange'" to make [!-~] etc. work use it in MIME::QuotedPrint and t/op/regexp.t and t/op/pat.t - Choose a key for t/op/each.t test which gets encoded. - Skip utf8decode if this is UTF-EBCDIC. p4raw-link: @9400 on //depot/perlio: daf0f78e031c718c75590ef9ef573756f805776e p4raw-id: //depot/perl@9407
* More EBCDIC stuff:Nick Ing-Simmons2001-03-201-10/+6
| | | | | | | | | | | | | | - Loose the extra level of function on ASCII. - spotted a chr(0) issue in sv.c - re-work of UTF-X tr/// ranges to work in Unicode space. Still issues with the "0xff is illegal UTF-8" hack. - Yet another ad. hoc. utf8 'upgrade' in op.c recoded (why do it once when you can do it all over the place :-( - Enable HINTS_UTF8 on EBCDIC - then ignore it in toke.c, need utf8.pm for swashes. - Simplified and commented scan_const() in toke.c Still something wrong regexp and tr (swashes?). p4raw-id: //depot/perlio@9267
* Integrate changes #9137,9138,9142 from maintperl into mainline.Jarkko Hietaniemi2001-03-141-3/+2
| | | | | | | | | | | | | | | | | | | | | | fix leak in pregcomp() when RE fails to compile (e.g. m/\\/) remove squelch controls for "Scalars leaked" messages in most places (these are now cured) fix another memory leak reported by purify (tie callbacks that croak can leak when wiping out magic) p4raw-link: @9142 on //depot/maint-5.6/perl: 26972843796e21c404c9d13ec5ee86e7b952a2bd p4raw-link: @9138 on //depot/maint-5.6/perl: ad7f1144250940f9ca43bac32708ec5e718b30ff p4raw-link: @9137 on //depot/maint-5.6/perl: 1f35595ecca168b4f66e3399344799fdbd496d17 p4raw-id: //depot/perl@9144 p4raw-integrated: from //depot/maint-5.6/perl@9143 'copy in' t/pragma/strict-vars (@7318..) t/pragma/warn/regcomp (@7887..) t/op/regexp.t (@8551..) t/op/lex_assign.t (@8987..) 'merge in' t/op/local.t (@5902..) t/pragma/warn/op (@7846..) t/pragma/warnings.t (@7895..) t/comp/proto.t (@8173..) t/pragma/warn/toke (@8570..) regcomp.c (@8777..) scope.c (@8855..) t/op/pat.t (@9076..)
* regcomp.c is working in native space, not Unicode space (if different)Nick Ing-Simmons2001-03-111-9/+8
| | | | | as it is doing compare against 'W' in \W etc. p4raw-id: //depot/perlio@9106
* Audit #ifdef EBCDIC and #ifndef ASCIIish, replace latter with former.Nick Ing-Simmons2001-03-111-20/+7
| | | | | Use ASCII_TO_NATIVE and NATIVE_TO_ASCII to avoid some #ifs. p4raw-id: //depot/perlio@9105
* Fix for ID 20010306.008, UTF-8 and \w without 'use utf8' coredump.Jarkko Hietaniemi2001-03-101-18/+0
| | | p4raw-id: //depot/perl@9098
* EBCDIC sanity - phase INick Ing-Simmons2001-03-101-11/+11
| | | | | | | | | | | | | | - rename utf8/uv functions to indicate what sort of uv they provide (uvuni/uvchr) - use utf8n_xxxx (c.f. pvn) for forms which take length. - back out vN.N and $^V exceptions to e2a/a2e - make "locale" isxxx macros be uvchr (may be redundant?) Not clear yet that toUPPER_uni et. al. return being handled correctly. The tr// and rexexp stuff still needs an audit, assumption is they are working in Unicode space. Need to provide v5.6 names for XS modules (decide is uni or chr ?). p4raw-id: //depot/perlio@9096
* Make /x{abcd}/ to work without use utf8.Jarkko Hietaniemi2001-03-061-0/+2
| | | p4raw-id: //depot/perl@9058
* Retract #8929,8930,8932,8933 for now.Jarkko Hietaniemi2001-02-251-31/+43
| | | p4raw-id: //depot/perl@8935
* (Retracted by #8395.)Jarkko Hietaniemi2001-02-251-43/+31
| | | | | Attempt to fix the EBCDIC character range problem with //. p4raw-id: //depot/perl@8930