summaryrefslogtreecommitdiff
path: root/utf8.c
Commit message (Collapse)AuthorAgeFilesLines
* more warnings tidyupPaul Marquess2002-03-111-4/+4
| | | | | | From: "Paul Marquess" <paul_marquess@yahoo.co.uk> Message-ID: <AIEAJICLCBDNAAOLLOKLMEEGDPAA.paul_marquess@yahoo.co.uk> p4raw-id: //depot/perl@15155
* EBCDIC: this seems to calm the last of theJarkko Hietaniemi2002-02-241-12/+8
| | | | | Malformed UTF-8 warnings. p4raw-id: //depot/perl@14850
* In EBCDIC the UNI_TO_NATIVE() macro evaluates its argumentJarkko Hietaniemi2002-02-201-2/+4
| | | | | twice, causing the loop to skip every other character. p4raw-id: //depot/perl@14800
* Misplaced block end.Jarkko Hietaniemi2002-02-191-1/+1
| | | p4raw-id: //depot/perl@14766
* Oops.Jarkko Hietaniemi2002-02-191-1/+1
| | | p4raw-id: //depot/perl@14762
* Try special casing first.Jarkko Hietaniemi2002-02-191-27/+33
| | | p4raw-id: //depot/perl@14759
* Unused in ASCII, used in EBCDIC.Jarkko Hietaniemi2002-02-181-2/+1
| | | p4raw-id: //depot/perl@14747
* EBCDIC: now the worst seems to be over forJarkko Hietaniemi2002-02-181-12/+8
| | | | | | the "Malformed" warnings. Still a few of them, and plenty of test failures, but getting better. p4raw-id: //depot/perl@14739
* After much rewriting we are now pretty muchJarkko Hietaniemi2002-02-181-44/+41
| | | | | back to where we started. p4raw-id: //depot/perl@14737
* Clearing up to_utf8_case() continues: this time useJarkko Hietaniemi2002-02-171-56/+58
| | | | | a single return, and EBCDICification for all paths. p4raw-id: //depot/perl@14734
* Tiny tweak.Jarkko Hietaniemi2002-02-171-6/+2
| | | p4raw-id: //depot/perl@14732
* Redundant casts.Jarkko Hietaniemi2002-02-171-18/+18
| | | p4raw-id: //depot/perl@14731
* EBCDIC: to_utf8_case() is supposed to get its low 256Jarkko Hietaniemi2002-02-171-2/+2
| | | | | input in native code points, not Unicode. p4raw-id: //depot/perl@14726
* The #14715 and #14716 were okay: they just revealedJarkko Hietaniemi2002-02-171-14/+30
| | | | | a bug in the EXACTF matching. p4raw-id: //depot/perl@14724
* Retreat, retreat! (retract #14715 and #14716)Jarkko Hietaniemi2002-02-161-26/+14
| | | p4raw-id: //depot/perl@14723
* Tiny tweak.Jarkko Hietaniemi2002-02-161-4/+5
| | | p4raw-id: //depot/perl@14716
* Restructure to_utf8_case() for simpler execution paths.Jarkko Hietaniemi2002-02-161-14/+25
| | | p4raw-id: //depot/perl@14715
* Excise inexact blather.Jarkko Hietaniemi2002-02-141-5/+0
| | | p4raw-id: //depot/perl@14687
* Iteration continues.Jarkko Hietaniemi2002-02-131-17/+20
| | | p4raw-id: //depot/perl@14669
* Rewrite the "special mapping" part of to_utf8_case(),Jarkko Hietaniemi2002-02-131-26/+46
| | | | | | this time with fewer bugs. (See: The Law of Cybernetic Entymology.) p4raw-id: //depot/perl@14664
* EBCDIC: another "can't happen".Jarkko Hietaniemi2002-02-121-1/+5
| | | p4raw-id: //depot/perl@14660
* format problemRobin Barker2002-02-061-1/+1
| | | | | Message-Id: <200202061401.OAA25053@tempest.npl.co.uk> p4raw-id: //depot/perl@14570
* The Malformed UTF-8 Heisenbug seen by Merijn and NickCJarkko Hietaniemi2002-02-011-1/+5
| | | | | | | I got it in Tru64 + ithreads but only without -g, took some debugging by printf (which was no fun either since adding some debug printfs hid the error) p4raw-id: //depot/perl@14511
* Turn the I/O Unicode error by default on, but theJarkko Hietaniemi2002-01-311-1/+1
| | | | | | character-generating Unicode error by default off, as Larry suggested. p4raw-id: //depot/perl@14505
* EBCDIC fix: t/op/lc.t failures 24-25, 29-30, 34-35, 39-40Jarkko Hietaniemi2002-01-301-0/+1
| | | p4raw-id: //depot/perl@14494
* EBCDIC tweaks-- no new test passes, but getting closer.Jarkko Hietaniemi2002-01-291-3/+16
| | | p4raw-id: //depot/perl@14491
* Copyright++. (Not all the toplevel *.h have one, it seems.)Jarkko Hietaniemi2002-01-231-1/+1
| | | p4raw-id: //depot/perl@14391
* In dumping use isPRINT() instead of isprint() so that localeJarkko Hietaniemi2002-01-091-2/+3
| | | | | does not come into play. p4raw-id: //depot/perl@14146
* Document the flags of pv_uni_display().Jarkko Hietaniemi2002-01-071-9/+17
| | | p4raw-id: //depot/perl@14117
* More regex and utf8 debug dumping.Jarkko Hietaniemi2002-01-071-3/+26
| | | p4raw-id: //depot/perl@14114
* Finish up (ha!) the Unicode case folding;Jarkko Hietaniemi2002-01-051-2/+6
| | | | | enhance regex dumping code. p4raw-id: //depot/perl@14096
* Missed the =head1 additions.Jarkko Hietaniemi2002-01-031-2/+2
| | | p4raw-id: //depot/perl@14041
* One more iteration of the ibcmp_utf8() interface,Jarkko Hietaniemi2002-01-021-39/+55
| | | | | hopefully this is a convergent iteration... p4raw-id: //depot/perl@14014
* Make ibcmp_utf8() optionally progress in either string forJarkko Hietaniemi2002-01-021-8/+33
| | | | | as long as it takes and optionally record how far it got. p4raw-id: //depot/perl@14010
* -Wall silencing.Jarkko Hietaniemi2002-01-021-2/+2
| | | p4raw-id: //depot/perl@14008
* Make ibcmp_utf8() more robust and make regmatch() use it.Jarkko Hietaniemi2002-01-011-30/+36
| | | p4raw-id: //depot/perl@14005
* Document the to_utf8_*() functions.Jarkko Hietaniemi2002-01-011-1/+57
| | | p4raw-id: //depot/perl@14002
* Rename the variables to be a bit more consistent.Jarkko Hietaniemi2002-01-011-24/+24
| | | p4raw-id: //depot/perl@13993
* Unnecessary/Lingering UTF8 flag might mess up caseless matchingJarkko Hietaniemi2002-01-011-16/+12
| | | | | (found by Jeffrey Friedl) p4raw-id: //depot/perl@13992
* Make that six wrongs.Jarkko Hietaniemi2001-12-301-2/+1
| | | p4raw-id: //depot/perl@13966
* Four wrongs can make a right.Jarkko Hietaniemi2001-12-301-2/+3
| | | p4raw-id: //depot/perl@13965
* Fix all the C(ommon) case foldings as per CaseFold.txt.Jarkko Hietaniemi2001-12-301-1/+1
| | | p4raw-id: //depot/perl@13963
* Make using U+FDD0..U+FDEF (noncharacters since Unicode 3.1),Jarkko Hietaniemi2001-12-211-7/+38
| | | | | | U+...FFFE, U+...FFFF, and characters beyond U+10FFFF (the Unicode maximum code point) warnable offenses. p4raw-id: //depot/perl@13823
* Fix up the APIs noone hopefully uses.Jarkko Hietaniemi2001-12-201-3/+15
| | | p4raw-id: //depot/perl@13817
* Some bincompat clawbacks.Jarkko Hietaniemi2001-12-191-0/+18
| | | p4raw-id: //depot/perl@13813
* Can't printf U8s as UVs.Jarkko Hietaniemi2001-12-191-1/+1
| | | p4raw-id: //depot/perl@13801
* Make the utf8 malformedness messages more verbose.Jarkko Hietaniemi2001-12-181-9/+16
| | | p4raw-id: //depot/perl@13757
* Demote the surrogate and non-character errors to warnings.Jarkko Hietaniemi2001-12-171-5/+8
| | | p4raw-id: //depot/perl@13740
* Disallow also Unicode ranges 0xFDD0..0xFDEF andJarkko Hietaniemi2001-12-161-2/+5
| | | | | | | | | | 0xFFFE..0xFFFF. Ranges 0x...FFFE..0x...FFFF in general, and characters beyond 0x10FFF should be disallowed, too, but some tests would need changing, but more importantly some APIs would need remodeling since one can easily generate such characters either by bitwise complements, tr complements, or v-strings. p4raw-id: //depot/perl@13722
* Make surrogates illegal also on EBCDIC.Jarkko Hietaniemi2001-12-161-2/+2
| | | p4raw-id: //depot/perl@13721