| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
| |
Message-ID: <Pine.GSO.4.21.0109070933530.7041-100000@crusoe.crusoe.net>
p4raw-id: //depot/perl@11934
|
|
|
|
|
|
| |
Subject: Re: [PATCH regexec.c] more general .* and .*?
Message-ID: <Pine.GSO.4.21.0109070836110.7041-100000@crusoe.crusoe.net>
p4raw-id: //depot/perl@11933
|
|
|
|
|
|
| |
code related to PL_reg_sv (so PL_reg_sv_utf8 was logical)
but that is no more the case: PL_reg_match_utf8 is better.
p4raw-id: //depot/perl@11823
|
|
|
|
|
| |
working in 5.7.x
p4raw-id: //depot/perl@11803
|
|
|
|
|
|
| |
Thanks to H. Merijn Brand for the patch.
Some of the comments and or guards might be removable in perl.h now.
p4raw-id: //depot/perl@11758
|
|
|
|
|
| |
"was the last match target UTF8" into its own variable.
p4raw-id: //depot/perl@11717
|
|
|
|
|
| |
Message-Id: <200108161527.f7GFRnH09419@crypt.compulink.co.uk>
p4raw-id: //depot/perl@11697
|
|
|
|
|
| |
Message-Id: <200108150933.f7F9Xnr11222@crypt.compulink.co.uk>
p4raw-id: //depot/perl@11676
|
|
|
|
|
|
| |
plus more tests that unearthed a bug in @a = ($utf8 =~ /\C/g),
plus a fix for the bug.
p4raw-id: //depot/perl@11577
|
|
|
|
|
|
|
|
|
| |
the new CANY is the \C. The problem reported and the
test case supplied in
Subject: UTF-8 bugs in string length & single line regex matches
Message-ID: <20010803113932.A19318@berrange.com>
p4raw-id: //depot/perl@11575
|
|
|
|
|
| |
Message-ID: <Pine.LNX.4.21.0108040939050.23972-100000@mako.covalent.net>
p4raw-id: //depot/perl@11574
|
|
|
|
|
| |
Message-ID: <Pine.LNX.4.21.0108031730040.23972-100000@mako.covalent.net>
p4raw-id: //depot/perl@11567
|
|
|
|
|
| |
Message-ID: <20010721223325.3069.qmail@plover.com>
p4raw-id: //depot/perl@11437
|
|
|
|
|
| |
Message-ID: <B77322D6.22DD%artur@contiller.se>
p4raw-id: //depot/perl@11299
|
|
|
| |
p4raw-id: //depot/perl@11038
|
|
|
|
|
| |
Message-ID: <Pine.LNX.4.21.0106210657400.1693-100000@oregonnet.com>
p4raw-id: //depot/perl@10780
|
|
|
| |
p4raw-id: //depot/perl@10392
|
|
|
|
|
|
| |
patch: rename HINT_BYTE and IN_BYTE to HINT_BYTES and IN_BYTES
to match the pragma name; various robustness cleanups.
p4raw-id: //depot/perl@10339
|
|
|
| |
p4raw-id: //depot/perl@10335
|
|
|
|
|
| |
lib/complex dumped core.
p4raw-id: //depot/perl@10261
|
|
|
|
|
| |
Message-Id: <200105250124.KAA19571@toshiba.co.jp>
p4raw-id: //depot/perl@10206
|
|
|
|
|
| |
Message-Id: <200105191912.UAA23925@gizmo.fdgroup.co.uk>
p4raw-id: //depot/perl@10168
|
|
|
|
|
| |
Message-Id: <200105172307.AAA06142@crypt.compulink.co.uk>
p4raw-id: //depot/perl@10152
|
|
|
|
|
| |
Message-Id: <200104291609.RAA17790@crypt.compulink.co.uk>
p4raw-id: //depot/perl@9911
|
|
|
|
|
|
| |
at the left hand side if there were 0x100.. characters in the
character class.
p4raw-id: //depot/perl@9901
|
|
|
| |
p4raw-id: //depot/perl@9682
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
fix for bug 20010410.006, undo change#7115
port the OpenBSD glob() security patch
p4raw-link: @9676 on //depot/maint-5.6/perl: 3f3c3e312f619efa81ad88565a24e92f15dff662
p4raw-link: @9675 on //depot/maint-5.6/perl: c84593816ace2807d5ff27bb0745a28ec29187b1
p4raw-link: @7115 on //depot/perl: 5675c1a6395a0842c857fc8de159747577df6c4b
p4raw-id: //depot/perl@9677
p4raw-integrated: from //depot/maint-5.6/perl@9672 'copy in'
ext/File/Glob/bsd_glob.h (@9264..) ext/File/Glob/bsd_glob.c
(@9512..) ext/File/Glob/Glob.xs (@9545..) 'merge in' t/op/pat.t
(@9138..) regexec.c (@9288..) ext/File/Glob/Glob.pm (@9512..)
|
|
|
|
|
| |
Message-ID: <Pine.OSF.4.10.10103301805450.63762-100000@aspara.forte.com>
p4raw-id: //depot/perl@9485
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
- Loose the extra level of function on ASCII.
- spotted a chr(0) issue in sv.c
- re-work of UTF-X tr/// ranges to work in Unicode
space. Still issues with the "0xff is illegal UTF-8" hack.
- Yet another ad. hoc. utf8 'upgrade' in op.c recoded
(why do it once when you can do it all over the place :-(
- Enable HINTS_UTF8 on EBCDIC - then ignore it in toke.c,
need utf8.pm for swashes.
- Simplified and commented scan_const() in toke.c
Still something wrong regexp and tr (swashes?).
p4raw-id: //depot/perlio@9267
|
|
|
| |
p4raw-id: //depot/perl@9148
|
|
|
| |
p4raw-id: //depot/perl@9098
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
- rename utf8/uv functions to indicate what sort of uv they provide (uvuni/uvchr)
- use utf8n_xxxx (c.f. pvn) for forms which take length.
- back out vN.N and $^V exceptions to e2a/a2e
- make "locale" isxxx macros be uvchr (may be redundant?)
Not clear yet that toUPPER_uni et. al. return being handled correctly.
The tr// and rexexp stuff still needs an audit, assumption is they are working
in Unicode space.
Need to provide v5.6 names for XS modules (decide is uni or chr ?).
p4raw-id: //depot/perlio@9096
|
|
|
|
|
| |
Message-Id: <200103081206.MAA06281@tiree.fdgroup.co.uk>
p4raw-id: //depot/perl@9084
|
|
|
|
|
|
|
|
|
|
| |
Fixes 20001230.002.
What still remains broken is that the submatches that
have \C in them get their UTF8 flag on because their
parent SV has it on. This will result in malformed
UTF8 if a \C happened to match a non-ASCII byte.
p4raw-id: //depot/perl@8836
|
|
|
| |
p4raw-id: //depot/perl@8769
|
|
|
| |
p4raw-id: //depot/perl@8762
|
|
|
| |
p4raw-id: //depot/perl@8566
|
|
|
| |
p4raw-id: //depot/perl@8510
|
|
|
| |
p4raw-id: //depot/perl@8415
|
|
|
| |
p4raw-id: //depot/perl@8414
|
|
|
|
|
| |
Message-Id: <200010300133.BAA10390@crypt.compulink.co.uk>
p4raw-id: //depot/perl@8403
|
|
|
|
|
|
|
| |
Message-ID: <3A59E510.52BAB5B9@st.rim.or.jp>
UTF-8 fixes for 'x' and tr///.
p4raw-id: //depot/perl@8378
|
|
|
| |
p4raw-id: //depot/perl@8328
|
|
|
| |
p4raw-id: //depot/perl@8289
|
|
|
|
|
|
|
|
| |
Message-ID: <3A4D722D.243AFD88@st.rim.or.jp>
Just the patch part for now, and the pragma renamed
as unicode::distinct.
p4raw-id: //depot/perl@8267
|
|
|
|
|
|
|
| |
Message-Id: <p04320404b6639e7aa043@[192.168.1.4]>
This patchlet is needed in order that perl can be statically linked.
p4raw-id: //depot/perl@8191
|
|
|
|
|
|
|
| |
Fixes at least the bugs 20001028.003 (both of them...) and
20001108.001. The bugs 20001114.001 and 20001205.014 seem
also to be fixed by now, probably already before this patch.
p4raw-id: //depot/perl@8143
|
|
|
| |
p4raw-id: //depot/perl@7984
|
|
|
|
|
|
| |
used to swash_init(), makes regprop() dumps more informative
(+utf8::IsAlpha, -utf8::IsDigit, for example).
p4raw-id: //depot/perl@7969
|
|
|
|
|
| |
always use (at least) UTF8_MAXLEN + 1 U8s deep buffer.
p4raw-id: //depot/perl@7967
|