| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
|
| |
Message-ID: <3A59E510.52BAB5B9@st.rim.or.jp>
UTF-8 fixes for 'x' and tr///.
p4raw-id: //depot/perl@8378
|
|
|
| |
p4raw-id: //depot/perl@8332
|
|
|
| |
p4raw-id: //depot/perl@8323
|
|
|
|
|
| |
in is_utf8_string().
p4raw-id: //depot/perl@8300
|
|
|
| |
p4raw-id: //depot/perl@8289
|
|
|
| |
p4raw-id: //depot/perl@8274
|
|
|
|
|
|
|
|
| |
Message-ID: <3A4D722D.243AFD88@st.rim.or.jp>
Just the patch part for now, and the pragma renamed
as unicode::distinct.
p4raw-id: //depot/perl@8267
|
|
|
|
|
| |
it revealed a bug in #8248 (the UTF8_EIGHT_BIT_LO() was wrong).
p4raw-id: //depot/perl@8249
|
|
|
|
|
| |
decoding goes awry, it should be up to the caller to decide.
p4raw-id: //depot/perl@8042
|
|
|
|
|
|
|
| |
Message-ID: <20001208133331.A11535@deep-dark-truthful-mirror.perlhacker.org>
(The pp_hot part needed a rewrite.)
p4raw-id: //depot/perl@8039
|
|
|
|
|
|
| |
everywhere because we do generate illegal UTF-8 in some situations.
This is of course naughty.
p4raw-id: //depot/perl@8033
|
|
|
| |
p4raw-id: //depot/perl@8028
|
|
|
| |
p4raw-id: //depot/perl@8024
|
|
|
| |
p4raw-id: //depot/perl@8023
|
|
|
|
|
| |
the error message.
p4raw-id: //depot/perl@7996
|
|
|
| |
p4raw-id: //depot/perl@7984
|
|
|
|
|
| |
always use (at least) UTF8_MAXLEN + 1 U8s deep buffer.
p4raw-id: //depot/perl@7967
|
|
|
| |
p4raw-id: //depot/perl@7940
|
|
|
|
|
|
|
| |
Message-ID: <Pine.OSF.4.10.10011291233120.328738-100000@aspara.forte.com>
plus regen perlapi.pod.
p4raw-id: //depot/perl@7932
|
|
|
|
|
| |
until eternity.
p4raw-id: //depot/perl@7911
|
|
|
| |
p4raw-id: //depot/perl@7872
|
|
|
|
|
| |
is unused at the moment) to be less forgiving about bad UTF-8.
p4raw-id: //depot/perl@7869
|
|
|
| |
p4raw-id: //depot/perl@7744
|
|
|
|
|
| |
Message-ID: <Pine.OSF.4.10.10011141500260.106218-100000@aspara.forte.com>
p4raw-id: //depot/perl@7695
|
|
|
| |
p4raw-id: //depot/perl@7693
|
|
|
|
|
|
| |
Subject: [ID 20001114.006] 5.7.0-7680 Solaris 8, 64 bit, utf8 patch
Message-Id: <20001114191623.G20559@Strawberry.COM>
p4raw-id: //depot/perl@7691
|
|
|
|
|
| |
Message-Id: <200011132249.eADMnek09679@garcia.efn.org>
p4raw-id: //depot/perl@7677
|
|
|
| |
p4raw-id: //depot/perl@7671
|
|
|
|
|
|
|
|
| |
char on the stack and pop an unsigned quad off the stack.
Subject: Re: [ID 20001103.002] Not OK: perl v5.7.0 +DEVEL7523 on os2-64int-ld-2.30 (UNINSTALLED)
Message-ID: <pxzB6gzkgKXY092yn@efn.org>
p4raw-id: //depot/perl@7584
|
|
|
| |
p4raw-id: //depot/perl@7509
|
|
|
| |
p4raw-id: //depot/perl@7481
|
|
|
|
|
|
|
|
| |
Rename utf8_to_uv_chk() back to utf8_to_uv() because it's
used much more than the simpler API, now called utf8_to_uv_simple().
Still not quite happy with API, too much partial duplication
of functionality.
p4raw-id: //depot/perl@7439
|
|
|
| |
p4raw-id: //depot/perl@7438
|
|
|
|
|
| |
UTF8LEN() and UTF8SKIP().
p4raw-id: //depot/perl@7437
|
|
|
|
|
|
|
|
| |
Subject: Encode bug?
Message-ID: <m3lmveqwh5.fsf@ak-71.mind.de>
Also make is_utf8_char() stricter.
p4raw-id: //depot/perl@7425
|
|
|
|
|
|
|
|
|
|
|
|
| |
malformation happens. This involved adding an argument
to utf8_to_uv_chk(), which involved changing its prototype,
and prefer STRLEN over I32 for the UTF-8 length, which as
a domino effect necessitated changing the prototypes of
scan_bin(), scan_oct(), scan_hex(), and reg_uni().
The stricter UTF-8 decoding checking uses Markus Kuhn's
UTF-8 Decode Stress Tester from
http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt
p4raw-id: //depot/perl@7416
|
|
|
| |
p4raw-id: //depot/perl@7223
|
|
|
| |
p4raw-id: //depot/perl@7222
|
|
|
| |
p4raw-id: //depot/perl@7156
|
|
|
| |
p4raw-id: //depot/perl@7152
|
|
|
|
|
|
|
| |
i.e. rename Simon's function to Perl_utf8_to_uv_chk, change all calls to it
to use new name and add Perl_utf8_to_uv() as a wrapper which calls it passing
0 to checking to get the warning.
p4raw-id: //depot/perl@7096
|
|
|
|
|
| |
Message-ID: <20000914234657.A13953@deep-dark-truthful-mirror.perlhacker.org>
p4raw-id: //depot/perl@7087
|
|
|
|
|
|
| |
Subject: Re: perl@7078
Message-Id: <200009142109.RAA03425@leggy.zk3.dec.com>
p4raw-id: //depot/perl@7085
|
|
|
|
|
|
|
|
| |
convertible to bytes by checking it doesn't go above 0xc3
Subject: Re: perl@7078
Message-ID: <20000914205919.A11098@deep-dark-truthful-mirror.perlhacker.org>
p4raw-id: //depot/perl@7084
|
|
|
| |
p4raw-id: //depot/perl@7075
|
|
|
|
|
|
|
|
| |
Subject: [ID 20000903.001] \w in utf8-strings
Message-Id: <E13VUS5-0000cv-00.pgcc-forever-2000-09-03-09-44-29@fuji>
and various related nits.
p4raw-id: //depot/perl@7030
|
|
|
|
|
| |
Message-ID: <20000903051206.A5909@cerebro.laendle>
p4raw-id: //depot/perl@7021
|
|
|
|
|
| |
Message-Id: <E13Utuf-0004Bw-00@draco.cus.cam.ac.uk>
p4raw-id: //depot/perl@6988
|
|
|
|
|
| |
on Windows)
p4raw-id: //depot/perl@6476
|
|
|
|
|
|
|
|
| |
(UTF16-LE) still fails (silently, no output) but the #22
(UTF16-BE) seems to be working now. The root of the
failure may be in sv_gets(): is it UTF-16LE-aware,
especially when it comes to line endings?
p4raw-id: //depot/perl@6469
|