summaryrefslogtreecommitdiff
path: root/utf8.h
Commit message (Collapse)AuthorAgeFilesLines
* Use the UTF8 macros a bit. They can't be used with abandonJarkko Hietaniemi2000-12-081-0/+5
| | | | | | everywhere because we do generate illegal UTF-8 in some situations. This is of course naughty. p4raw-id: //depot/perl@8033
* Introduce macros for UTF8 decoding.Jarkko Hietaniemi2000-12-081-1/+16
| | | p4raw-id: //depot/perl@8028
* UINT64_C() work continues.Jarkko Hietaniemi2000-11-151-2/+0
| | | p4raw-id: //depot/perl@7700
* Use UINT64_C().Jens Hamisch2000-11-151-1/+5
| | | | | | Subject: [ID 20001114.006] 5.7.0-7680 Solaris 8, 64 bit, utf8 patch Message-Id: <20001114191623.G20559@Strawberry.COM> p4raw-id: //depot/perl@7691
* [ID 20001113.003] utf8_to_uv on malformed utf returns wrong valuesYitzchak Scott-Thoennes2000-11-141-0/+2
| | | | | Message-Id: <200011132249.eADMnek09679@garcia.efn.org> p4raw-id: //depot/perl@7677
* Allow poking holes at the UTF-8 decoding strictness.Jarkko Hietaniemi2000-10-251-1/+12
| | | p4raw-id: //depot/perl@7438
* Rename UTF8LEN() to be UNISKIP(), too confusing to haveJarkko Hietaniemi2000-10-251-2/+2
| | | | | UTF8LEN() and UTF8SKIP(). p4raw-id: //depot/perl@7437
* Make the UTF-8 decoding stricter and more verbose whenJarkko Hietaniemi2000-10-241-1/+3
| | | | | | | | | | | | malformation happens. This involved adding an argument to utf8_to_uv_chk(), which involved changing its prototype, and prefer STRLEN over I32 for the UTF-8 length, which as a domino effect necessitated changing the prototypes of scan_bin(), scan_oct(), scan_hex(), and reg_uni(). The stricter UTF-8 decoding checking uses Markus Kuhn's UTF-8 Decode Stress Tester from http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt p4raw-id: //depot/perl@7416
* Make ~(chr(a).chr(b)) eq chr(~a).chr(~b) on utf8.Simon Cozens2000-10-151-0/+18
| | | | | | Subject: [PATCH] Re: [ID 20000918.005] ~ on wide chars Message-ID: <20001014205213.A9645@pembro4.pmb.ox.ac.uk> p4raw-id: //depot/perl@7235
* Tweak #7153.Jarkko Hietaniemi2000-10-061-2/+7
| | | p4raw-id: //depot/perl@7154
* Patch from Simon Cozens to avoid using utf8 routines in EBCDIC.Jarkko Hietaniemi2000-10-061-2/+2
| | | p4raw-id: //depot/perl@7153
* allocate sufficient buffer sizes for 64-bit wide utf8 charactersGurusamy Sarathy2000-02-191-0/+2
| | | | | | | permitted by change#5011 (from Gisle Aas) p4raw-link: @5011 on //depot/perl: 3c77ea2bace63b1ad27d15a6366cb938bdd158cb p4raw-id: //depot/perl@5136
* allow 64-bit utf8-encoded integers (from Ilya Zakharevich)Gurusamy Sarathy2000-02-071-1/+2
| | | p4raw-id: //depot/perl@5011
* set SvUTF8 on vectors only if there are chars > 127; update copyrightGurusamy Sarathy2000-02-061-1/+1
| | | | | years (from Gisle Aas) p4raw-id: //depot/perl@5009
* HINT_UTF8 is not propagated to the op tree anymore; add aGurusamy Sarathy2000-02-011-1/+1
| | | | | perlunicode.pod that reflects changes to unicode support so far p4raw-id: //depot/perl@4941
* runtime now looks at the SVf_UTF8 bit on the SV to decideGurusamy Sarathy2000-01-311-0/+16
| | | | | | | | | whether to use widechar semantics; lexer and RE engine continue to need "use utf8" to enable unicode awareness in literals and patterns (TODO: this needs to be fixed); $1 et al are marked SvUTF8 if the pattern was compiled for utf8 (TODO: propagating it from the data is probably better) p4raw-id: //depot/perl@4930
* Re-integrate mainlineNick Ing-Simmons1999-09-181-0/+1
| | | | | Basic SvUTF8 stuff in headers, no functional changes yet. p4raw-id: //depot/utfperl@4193
* EXTERN_C declarations for global arrays in variousGurusamy Sarathy1999-06-121-0/+4
| | | | | | headers, so perl can be built even in C++ mode; win32 build fixups; regen headers p4raw-id: //depot/perl@3537
* update copyright yearsGurusamy Sarathy1999-03-221-1/+1
| | | p4raw-id: //depot/perl@3124
* s/Perl_utf8skip/PL_utf8skip/gGurusamy Sarathy1998-11-171-3/+3
| | | p4raw-id: //depot/perl@2241
* fix globals caught by change#1927; builds and tests on SolarisGurusamy Sarathy1998-10-061-1/+1
| | | | | p4raw-link: @1927 on //depot/perl: eb07465ebe1238598e948058857ec948c6697f86 p4raw-id: //depot/perl@1936
* add new files to MANIFEST; add missing prototypes to proto.h;Gurusamy Sarathy1998-07-261-3/+3
| | | | | | s/PL_utf8skip/utf8skip/ for now, or we end up with Perl_PL_; add typecasts to silence warnings; tweaks for win32 builds p4raw-id: //depot/perl@1663
* Here are the long-expected Unicode/UTF-8 modifications.Larry Wall1998-07-241-0/+27
p4raw-id: //depot/utfperl@1651