| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
| |
p4raw-id: //depot/perl@11967
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Date: Sat, 8 Sep 2001 15:42:30 -0400 (EDT)
Message-ID: <Pine.GSO.4.21.0109081535480.24489-100000@crusoe.crusoe.net>
Subject: Re: [PATCH t/op/misc.t] regcomp.c patch broke test
From: "Jeff 'japhy/Marillion' Pinyan" <jeffp@crusoe.net>
Date: Sat, 8 Sep 2001 18:33:12 -0400 (EDT)
Message-ID: <Pine.GSO.4.21.0109081832030.24489-100000@crusoe.crusoe.net>
Subject: [PATCH t/lib/warnings/regcomp] (?=...)? gives no warning now
From: "Jeff 'japhy/Marillion' Pinyan" <jeffp@crusoe.net>
Date: Sat, 8 Sep 2001 18:37:22 -0400 (EDT)
Message-ID: <Pine.GSO.4.21.0109081835340.24489-100000@crusoe.crusoe.net>
p4raw-id: //depot/perl@11956
|
|
|
|
|
|
|
| |
(was Re: FreeBSD MD5 crypt? Re: crypt/hex/oct and Unicode?))
Message-ID: <20010904224250.P25120@plum.flirble.org>
p4raw-id: //depot/perl@11874
|
|
|
|
|
|
| |
code related to PL_reg_sv (so PL_reg_sv_utf8 was logical)
but that is no more the case: PL_reg_match_utf8 is better.
p4raw-id: //depot/perl@11823
|
|
|
|
|
| |
working in 5.7.x
p4raw-id: //depot/perl@11803
|
|
|
|
|
| |
Might break on platforms where bool is larger than 8 bites ???
p4raw-id: //depot/perl@11800
|
|
|
|
|
| |
supposed to happen.
p4raw-id: //depot/perl@11798
|
|
|
|
|
|
| |
Message-Id: <200108311220.IAA54125@raptor.research.att.com>
Fixes test 14 which could fail randomly in rare cases.
p4raw-id: //depot/perl@11797
|
|
|
|
|
| |
"was the last match target UTF8" into its own variable.
p4raw-id: //depot/perl@11717
|
|
|
|
|
|
|
| |
front of pattern
Message-Id: <200108151032.f7FAWBI30961@crypt.compulink.co.uk>
p4raw-id: //depot/perl@11677
|
|
|
|
|
| |
stopped working).
p4raw-id: //depot/perl@11653
|
|
|
| |
p4raw-id: //depot/perl@11651
|
|
|
|
|
|
| |
plus more tests that unearthed a bug in @a = ($utf8 =~ /\C/g),
plus a fix for the bug.
p4raw-id: //depot/perl@11577
|
|
|
|
|
|
|
|
|
| |
the new CANY is the \C. The problem reported and the
test case supplied in
Subject: UTF-8 bugs in string length & single line regex matches
Message-ID: <20010803113932.A19318@berrange.com>
p4raw-id: //depot/perl@11575
|
|
|
|
|
| |
Message-ID: <Pine.LNX.4.21.0108031814240.23972-100000@mako.covalent.net>
p4raw-id: //depot/perl@11568
|
|
|
|
|
| |
Message-Id: <200107140625.XAA01517@ventrue.corp.yahoo.com>
p4raw-id: //depot/perl@11371
|
|
|
|
|
| |
Message-ID: <20010712182532.14821.qmail@plover.com>
p4raw-id: //depot/perl@11322
|
|
|
|
|
|
|
|
| |
Not all of the gripes cleaned up (hairy code in hv.c and
regcomp.c; unused newsp, gimme, and optype from cop.h macros;
unused 'key' arguments in ?DBM_File.xs) (and the -woffs left
to the IRIX hints)
p4raw-id: //depot/perl@11051
|
|
|
|
|
|
|
| |
the initialization of parse_start was bypassed by
several gotos. Now initialized to zero, which may
not be the best choice.
p4raw-id: //depot/perl@10906
|
|
|
|
|
|
|
| |
Message-Id: <200106210851.JAA01942@crypt.compulink.co.uk>
Unroll to avoid a UTS compiler bug.
p4raw-id: //depot/perl@10774
|
|
|
|
|
| |
Message-ID: <000601c0ebae$77d10dc0$99dcfea9@bfs.phone.com>
p4raw-id: //depot/perl@10410
|
|
|
| |
p4raw-id: //depot/perl@10406
|
|
|
| |
p4raw-id: //depot/perl@10392
|
|
|
|
|
| |
Message-ID: <Pine.GSO.4.21.0106011032080.21027-100000@crusoe.crusoe.net>
p4raw-id: //depot/perl@10376
|
|
|
| |
p4raw-id: //depot/perl@10338
|
|
|
|
|
| |
and Abhijit Menon-Sen.
p4raw-id: //depot/perl@10321
|
|
|
|
|
| |
Message-Id: <200105250124.KAA19571@toshiba.co.jp>
p4raw-id: //depot/perl@10206
|
|
|
|
|
| |
Message-Id: <200105211532.QAA03999@crypt.compulink.co.uk>
p4raw-id: //depot/perl@10187
|
|
|
|
|
| |
Message-ID: <20010516130443.E1516273@linguist.thayer.dartmouth.edu>
p4raw-id: //depot/perl@10136
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
the 'physical vs logical' range scheme:
\xAA-\xCC is a native physical range, you want that range of
codepoints in your native encoding. In EBCDIC the codepoints
in the gaps (between i-j and r-s) should be included.
\x{AA}-\x{CC} is a physical Unicode range, you want that range of
codepoints in Unicode.
a-z is a logical range, you want that range of 'logical' codepoints
in your native encoding. In EBCDIC the codepoints in the gaps
(between i-j and r-s) should not be included.
Mixed cases (a-\xAA, etc) should either be errors, or maybe
the 'logical' endpoints should be converted to native/Unicode
codepoints, and the range handled as a physical range.
'Logical endpoints' are to be recognized only in the A-Z, a-z,
and 0-9 ranges. Probably a warning should be given for mixed
cases like A-z or a-9 (since such expressions are encoding
dependent), with a recommendation to use physical ranges.
p4raw-id: //depot/perl@10085
|
|
|
|
|
| |
Message-ID: <20010507215612.A31114@penderel>
p4raw-id: //depot/perl@10021
|
|
|
|
|
| |
Message-Id: <200105041709.SAA14835@tempest.npl.co.uk>
p4raw-id: //depot/perl@9991
|
|
|
| |
p4raw-id: //depot/perl@9987
|
|
|
|
|
| |
Message-Id: <200104291609.RAA17790@crypt.compulink.co.uk>
p4raw-id: //depot/perl@9911
|
|
|
|
|
|
| |
at the left hand side if there were 0x100.. characters in the
character class.
p4raw-id: //depot/perl@9901
|
|
|
|
|
| |
Message-Id: <200104262233.XAA22352@crypt.compulink.co.uk>
p4raw-id: //depot/perl@9873
|
|
|
| |
p4raw-id: //depot/perl@9852
|
|
|
|
|
|
| |
Subject: [PATCH @9846] dumping ANYOF
Message-Id: <200104260432.FAA12669@crypt.compulink.co.uk>
p4raw-id: //depot/perl@9851
|
|
|
|
|
| |
Message-ID: <20010422012749.27024.qmail@plover.com>
p4raw-id: //depot/perl@9778
|
|
|
|
|
|
|
| |
Message-ID: <20010421182439.16508.qmail@plover.com>
Regex debugger backend.
p4raw-id: //depot/perl@9776
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[ 9400]
More EBCDIC tweaks:
- one more swash issue &~(0xA0-1) did not do the right thing,
for UTF-EBCDIC where &~(0x80-1) does for UTF-8.
- add "use re 'asciirange'" to make [!-~] etc. work
use it in MIME::QuotedPrint and t/op/regexp.t and t/op/pat.t
- Choose a key for t/op/each.t test which gets encoded.
- Skip utf8decode if this is UTF-EBCDIC.
p4raw-link: @9400 on //depot/perlio: daf0f78e031c718c75590ef9ef573756f805776e
p4raw-id: //depot/perl@9407
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
- Loose the extra level of function on ASCII.
- spotted a chr(0) issue in sv.c
- re-work of UTF-X tr/// ranges to work in Unicode
space. Still issues with the "0xff is illegal UTF-8" hack.
- Yet another ad. hoc. utf8 'upgrade' in op.c recoded
(why do it once when you can do it all over the place :-(
- Enable HINTS_UTF8 on EBCDIC - then ignore it in toke.c,
need utf8.pm for swashes.
- Simplified and commented scan_const() in toke.c
Still something wrong regexp and tr (swashes?).
p4raw-id: //depot/perlio@9267
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
fix leak in pregcomp() when RE fails to compile (e.g. m/\\/)
remove squelch controls for "Scalars leaked" messages in most places
(these are now cured)
fix another memory leak reported by purify (tie callbacks that
croak can leak when wiping out magic)
p4raw-link: @9142 on //depot/maint-5.6/perl: 26972843796e21c404c9d13ec5ee86e7b952a2bd
p4raw-link: @9138 on //depot/maint-5.6/perl: ad7f1144250940f9ca43bac32708ec5e718b30ff
p4raw-link: @9137 on //depot/maint-5.6/perl: 1f35595ecca168b4f66e3399344799fdbd496d17
p4raw-id: //depot/perl@9144
p4raw-integrated: from //depot/maint-5.6/perl@9143 'copy in'
t/pragma/strict-vars (@7318..) t/pragma/warn/regcomp (@7887..)
t/op/regexp.t (@8551..) t/op/lex_assign.t (@8987..) 'merge in'
t/op/local.t (@5902..) t/pragma/warn/op (@7846..)
t/pragma/warnings.t (@7895..) t/comp/proto.t (@8173..)
t/pragma/warn/toke (@8570..) regcomp.c (@8777..) scope.c
(@8855..) t/op/pat.t (@9076..)
|
|
|
|
|
| |
as it is doing compare against 'W' in \W etc.
p4raw-id: //depot/perlio@9106
|
|
|
|
|
| |
Use ASCII_TO_NATIVE and NATIVE_TO_ASCII to avoid some #ifs.
p4raw-id: //depot/perlio@9105
|
|
|
| |
p4raw-id: //depot/perl@9098
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
- rename utf8/uv functions to indicate what sort of uv they provide (uvuni/uvchr)
- use utf8n_xxxx (c.f. pvn) for forms which take length.
- back out vN.N and $^V exceptions to e2a/a2e
- make "locale" isxxx macros be uvchr (may be redundant?)
Not clear yet that toUPPER_uni et. al. return being handled correctly.
The tr// and rexexp stuff still needs an audit, assumption is they are working
in Unicode space.
Need to provide v5.6 names for XS modules (decide is uni or chr ?).
p4raw-id: //depot/perlio@9096
|
|
|
| |
p4raw-id: //depot/perl@9058
|
|
|
| |
p4raw-id: //depot/perl@8935
|
|
|
|
|
| |
Attempt to fix the EBCDIC character range problem with //.
p4raw-id: //depot/perl@8930
|