summaryrefslogtreecommitdiff
path: root/utf8.h
Commit message (Collapse)AuthorAgeFilesLines
* Bump coopyright year in embed.pl and various files that were just touchedRafael Garcia-Suarez2009-03-301-1/+1
| | | | | | (and run "make regen") (cherry picked from commit 83706693c63eb4fe0fd171a88263c83548c89029)
* Integrate:Nicholas Clark2008-01-301-0/+10
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | [ 32688] Update the hash documentation to reflect the changes between 5.8.1 and 5.8.2, that disabled the automatic randomisation of all hashes. [ 32729] Subject: [PATCH] docs: replace FH by my $fh in open From: "Gabor Szabo" <szabgab@gmail.com> Date: Wed, 26 Dec 2007 06:03:29 +0200 Message-ID: <d8a74af10712252003m2d3244fbv2955fe17e683063d@mail.gmail.com> [ 32730] Subject: [PATCH] docs: list of places where $_ is used From: "Gabor Szabo" <szabgab@gmail.com> Date: Wed, 26 Dec 2007 06:49:34 +0200 Message-ID: <d8a74af10712252049t1b63b475mfca9225324f5fce6@mail.gmail.com> [ 32732] Subject: [PATCH perl5100delta.pod] change/correction to 32246 From: "Robin Barker" <Robin.Barker@npl.co.uk> Date: Fri, 9 Nov 2007 14:40:22 -0000 Message-ID: <2C2E01334A940D4792B3E115F95B722601604912@exchsvr1.npl.ad.local> [ 32759] Note to future self about moving the regexp flag bits around. [ 32793] Add editor blocks to some header files. [ 32805] Subject: [PATCH] Small documentation nits From: Sébastien Aperghis-Tramoni <maddingue@free.fr> Date: Mon, 31 Dec 2007 21:05:47 +0100 Message-Id: <2AF110D6-9183-47C5-BBBA-26C3FB97C3D3@free.fr> [ 32844] Remove a comment that is no longer necessary since Perl Object bit the dust some time around 5.6 [ 32875] Document that sitecustomize.pl in run from a BEGIN block [ 32887] Subject: - perlcommunity.patch (1/1) [PATCH] perlcommunity: "Perl Mongers" and calendar corrections From: brian d foy <brian.d.foy@gmail.com> Date: Fri, 04 Jan 2008 14:07:27 -0600 Message-ID: <040120081407272341%brian.d.foy@gmail.com> [ 32895] Clarify the use of SVf_BREAK on PL_reg_curpm. [ 32904] Subject: Re: [PATCH] docs more open() and $_ related entries From: "Gabor Szabo" <szabgab@gmail.com> Date: Tue, 8 Jan 2008 22:07:54 +0200 Message-ID: <d8a74af10801081207q2637419dy59f1a3600bcc4a76@mail.gmail.com> [ 32916] Add a few =cuts (so configpm is displayed mostly correctly by perldoc or by search.cpan.org) [ 32944] Better descriptions for PL_regex_pad and PL_regex_padav. [ 32982] A couple of POD fixes by Steven Schubiger [ 32993] Add Weed out needless PERL_UNUSED_ARG to perltodo. It's a good "cage cleaner" task. [ 33066] Subject: [PATCH] socketpair() *is* available on Win32 From: "Jan Dubois" <jand@activestate.com> Date: Thu, 24 Jan 2008 14:17:23 -0800 Message-ID: <032201c85ed6$e56edf40$b04c9dc0$@com> [ 33082] fix misleading comment in op.h [ 33095] Minor rewording of the localtime() documentation, based on : Subject: [perl #49914] gmtime()/localtime() do not use time() From: Abe Timmerman (via RT) <perlbug-followup@perl.org> Date: Thu, 17 Jan 2008 18:05:07 -0800 Message-ID: <rt-3.6.HEAD-18434-1200621906-376.49914-75-0@perl.org> [ 33101] Subject: [PATCH] mistaken variable in pod From: "Yitzchak Scott-Thoennes" <sthoenna@efn.org> Date: Mon, 28 Jan 2008 13:39:29 -0800 (PST) Message-ID: <40783.64.81.167.122.1201556369.squirrel@webmail.efn.org> [ 33104] More wording nits about perl's time() versus the libc's time() p4raw-link: @33104 on //depot/perl: e3176d09175983fa5c9fa7bc6d286b6349cbc891 p4raw-link: @33101 on //depot/perl: a0604b4cdf35e86e96dd5002c652cb32430e1735 p4raw-link: @33095 on //depot/perl: e199845280bc7d6fec24a11bbe30c154943f9814 p4raw-link: @33082 on //depot/perl: b900987b60cce8e0dffc3b8fd3835992d58ce664 p4raw-link: @33066 on //depot/perl: f38e12df2246599f9c1104d703d0b8bb03353e86 p4raw-link: @32993 on //depot/perl: 3d826b294b9e4cbc002b8f9eeaf0e88c52d9ee96 p4raw-link: @32982 on //depot/perl: d77cdebfaf0c7eb784a132b575c93953a56db215 p4raw-link: @32944 on //depot/perl: e68531976a39b1bb777ce9a7b75bd5a3eaf8de8b p4raw-link: @32916 on //depot/perl: c90cd22b2f0d69bba1abd4ca7cd7ece1531bbb93 p4raw-link: @32904 on //depot/perl: b0169937a1833d262bdcef83fc6595ccc076cb56 p4raw-link: @32895 on //depot/perl: 36c662f48c9a23ec4251c3cf747c3f42c63a6fd9 p4raw-link: @32887 on //depot/perl: 10a3d64ebfb9fbd83a25907c9ad1c5709db66e15 p4raw-link: @32875 on //depot/perl: e846cbe53fd65a133ee59961dd2b0d9ae211552a p4raw-link: @32844 on //depot/perl: e16f71082347dd9854a05b7cece1dc02f0f9f3a8 p4raw-link: @32805 on //depot/perl: d991eed6a698716065b2039839dd7d319bc188db p4raw-link: @32793 on //depot/perl: e9a8c0991ec6634f51266c0ad53e02c328bc0761 p4raw-link: @32759 on //depot/perl: a1cf41cd41cfa3d3c737acde06e0d3bbd02dd08d p4raw-link: @32732 on //depot/perl: 62ba2c934831b412d226e94c53b2ed4d0a5c0726 p4raw-link: @32730 on //depot/perl: db1511c850bfe94373082caa16c1f91ab1ce8510 p4raw-link: @32729 on //depot/perl: 460b70c2ae94a254c087c06a9e5a5c4c3d88a0b5 p4raw-link: @32688 on //depot/perl: 22883ac550dc5dbe0c35890635d3c37c4084f3a1 p4raw-id: //depot/maint-5.10/perl@33123 p4raw-integrated: from //depot/perl@33122 'copy in' fakethr.h (@19242..) nostdio.h (@26736..) fakesdio.h (@27355..) thread.h (@27834..) pod/perlsec.pod (@28800..) ext/POSIX/POSIX.pod (@31137..) configpm (@31366..) parser.h (@31618..) pod/perlcommunity.pod (@31626..) iperlsys.h (@32013..) perliol.h (@32215..) dosish.h mg.h perlio.h perlsdio.h perlsfio.h unixish.h utf8.h utfebcdic.h util.h (@32237..) pod/perlport.pod (@32492..) pod/perlsyn.pod (@32559..) pod/perlvar.pod (@32730..) pad.h (@32793..) 'edit in' op.h (@33030..) 'merge in' pod/perltodo.pod (@32731..) p4raw-integrated: from //depot/perl@33095 'edit in' pod/perlfunc.pod (@32969..) p4raw-integrated: from //depot/perl@32982 'copy in' pod/perlintern.pod (@32148..) 'merge in' pod/perlapi.pod (@32925..) sv.c (@32969..) p4raw-integrated: from //depot/perl@32944 'merge in' intrpvar.h (@32681..) p4raw-integrated: from //depot/perl@32895 'merge in' regexec.c sv.h (@32861..) p4raw-integrated: from //depot/perl@32875 'copy in' pod/perlrun.pod (@32461..) p4raw-integrated: from //depot/perl@32844 'merge in' embed.fnc (@32840..) p4raw-integrated: from //depot/perl@32793 'copy in' scope.h (@29695..) regcomp.h (@32237..) 'merge in' handy.h (@32707..) perl.h (@32789..) p4raw-integrated: from //depot/perl@32759 'merge in' regexp.h (@32758..) p4raw-integrated: from //depot/perl@32732 'ignore' pod/perl5100delta.pod (@32649..)
* Fix up copyright years for files modified in 2007.Nicholas Clark2007-11-071-1/+1
| | | p4raw-id: //depot/perl@32237
* Update copyright years in .h files. Also, in .plRafael Garcia-Suarez2007-01-051-1/+1
| | | | | | files that generate .h files, so they'll be ready next time. p4raw-id: //depot/perl@29695
* Re: [perl #38293] chr(65535) should be allowed in regexesSADAHIRO Tomoyuki2006-04-021-4/+6
| | | | | Message-Id: <20060402224657.B942.BQW10602@nifty.com> p4raw-id: //depot/perl@27688
* Change 27677 missed two direct accesses to op_private in COPs.Nicholas Clark2006-04-021-1/+1
| | | | | | | I believe that all are now found, as redefining CopHINTS_get(c) to (~(c)->op_private) (with corresponding changes to CopHINTS_set() and the initialisation of PL_compiling) works. p4raw-id: //depot/perl@27687
* Compilation help for EBCDIC platforms, from Jarkko,Rafael Garcia-Suarez2005-12-221-3/+5
| | | | | tested by Rajarshi Das p4raw-id: //depot/perl@26452
* Re: XS-assisted SWASHGET (esp. for t/uni/class.t speedup)SADAHIRO Tomoyuki2005-11-301-1/+1
| | | | | Message-Id: <20051127170016.A786.BQW10602@nifty.com> p4raw-id: //depot/perl@26229
* A more elegant way to deal with utf8n_to_uvchr() and utf8n_to_uvuni().Nicholas Clark2005-10-311-2/+2
| | | p4raw-id: //depot/perl@25926
* Replace uvuni_to_utf8() with a macro that passes the extra 0 argumentNicholas Clark2005-10-301-0/+1
| | | | | to uvuni_to_utf8_flags(). Move the old body to mathoms.c p4raw-id: //depot/perl@25905
* Replace is_utf8_string_loc() with a macro that passes the extra 0Nicholas Clark2005-10-301-1/+3
| | | | | | argument to is_utf8_string_loc(). Correct the description of its parameters in its POD. p4raw-id: //depot/perl@25903
* undef IS_UTF8_CHAR() on EBCDICSADAHIRO Tomoyuki2005-10-091-0/+4
| | | | | Message-Id: <20051008165752.348A.BQW10602@nifty.com> p4raw-id: //depot/perl@25716
* one more round of is_utf8_foo tuneupJarkko Hietaniemi2005-06-071-0/+2
| | | | | Message-ID: <42A314E4.8060608@gmail.com> p4raw-id: //depot/perl@24730
* speed up is_utf8_char()Jarkko Hietaniemi2005-06-031-0/+70
| | | | | Message-ID: <429F557E.3090007@gmail.com> p4raw-id: //depot/perl@24687
* Symbian port of PerlJarkko Hietaniemi2005-04-211-1/+1
| | | | | Message-ID: <B356D8F434D20B40A8CEDAEC305A1F2453D653@esebe105.NOE.Nokia.com> p4raw-id: //depot/perl@24271
* Third consting batchAndy Lester2005-03-241-5/+5
| | | | | Message-Id: <2f14220e7101a03f7659dbe79a03b115@petdance.com> p4raw-id: //depot/perl@24074
* Re: uc($long_utf8_string) exhausts memoryJarkko Hietaniemi2005-01-221-8/+20
| | | | | | | | Message-Id: <41F1801C.3080201@iki.fi> Make buffer size estimates for utf8 case conversion less maximally pessimistic p4raw-id: //depot/perl@23857
* UTF8_ALLOW_ANYUV should not allow overlong sequences [PATCH]Gisle Aas2004-12-091-2/+1
| | | | | Message-ID: <lrmzwrae0j.fsf_-_@caliper.activestate.com> p4raw-id: //depot/perl@23632
* It's UTF-8, not UTF8. (Note: not s/UTF-8/UTF8/,Jarkko Hietaniemi2003-09-121-2/+2
| | | | | | since that would break a lot of code.) Also few stray UTF16s, UTF32s, and "encoded in Unicode". p4raw-id: //depot/perl@21198
* Fix up Larry's copyright statements to my best knowledge.Jarkko Hietaniemi2003-04-161-1/+1
| | | | | | | (Lots of Perl 5 source code archaeology was involved.) Larry didn't make strangled noises when I showed him the patch, either :-) p4raw-id: //depot/perl@19242
* Reverse copyright update (#18801) for files not changed in 2003.Hugo van der Sanden2003-03-021-1/+1
| | | p4raw-id: //depot/perl@18807
* Update all copyrights to 2003, from JarkkoHugo van der Sanden2003-03-021-1/+1
| | | p4raw-id: //depot/perl@18801
* As noted by Philip Newton: nothing wrong with BOM,Jarkko Hietaniemi2002-04-061-14/+12
| | | | | but 0xFFFE quite wrong. p4raw-id: //depot/perl@15762
* Explain the "gaps" in the UTF-8 encoding.Jarkko Hietaniemi2002-04-061-0/+4
| | | p4raw-id: //depot/perl@15761
* What started as a small nit (the charnames test, nit foundJarkko Hietaniemi2002-04-021-5/+5
| | | | | | | | | be Hugo), ballooned a bit... the goal is Larry's wish that illegal Unicode (such as U+FFFF) by default doesn't warn, since what if somebody WANTS to create illegal Unicode? Now getting close to this in the regex runtime. (Also, fix more of my fixation that BOM would be U+FFFE.) p4raw-id: //depot/perl@15689
* Mysterious characters.Jarkko Hietaniemi2002-03-101-6/+6
| | | p4raw-id: //depot/perl@15148
* Update the UTF-8 explanation table.Jarkko Hietaniemi2002-02-271-2/+25
| | | p4raw-id: //depot/perl@14900
* Not extending enough.Jarkko Hietaniemi2002-02-191-2/+4
| | | p4raw-id: //depot/perl@14758
* EBCDIC: SHARP S is different.Jarkko Hietaniemi2002-02-051-1/+14
| | | p4raw-id: //depot/perl@14561
* Copyright++. (Not all the toplevel *.h have one, it seems.)Jarkko Hietaniemi2002-01-231-1/+1
| | | p4raw-id: //depot/perl@14391
* AIX cpp bug: having macro arguments and character constantsJarkko Hietaniemi2002-01-231-7/+7
| | | | | | | | | "the same" means trouble (here s and 's') What broke now was 841 and 842 of t/op/pat.t, because of the ANYOF_UNICODE_FOLD_SHARP_S() in utf8.h, ccversion 5.0.1.0 (note that breakage happened only under cc_r and usethreads+ useithreads) p4raw-id: //depot/perl@14379
* Sharp S as a special treat for our German UTF-8 testers :-)Jarkko Hietaniemi2002-01-121-0/+8
| | | p4raw-id: //depot/perl@14222
* More regex and utf8 debug dumping.Jarkko Hietaniemi2002-01-071-0/+3
| | | p4raw-id: //depot/perl@14114
* Finish up (ha!) the Unicode case folding;Jarkko Hietaniemi2002-01-051-0/+2
| | | | | enhance regex dumping code. p4raw-id: //depot/perl@14096
* The funky final sigma casefolding.Jarkko Hietaniemi2001-12-231-0/+5
| | | p4raw-id: //depot/perl@13866
* Make using U+FDD0..U+FDEF (noncharacters since Unicode 3.1),Jarkko Hietaniemi2001-12-211-0/+11
| | | | | | U+...FFFE, U+...FFFF, and characters beyond U+10FFFF (the Unicode maximum code point) warnable offenses. p4raw-id: //depot/perl@13823
* Unadorned numbers evil.Jarkko Hietaniemi2001-12-131-1/+6
| | | p4raw-id: //depot/perl@13672
* PATCH Resubmission - was Re: [ID 20010902.001] v strings over 2*31 barfJohn Peacock2001-09-101-1/+1
| | | | | Message-ID: <3B9D23D6.90BCCC25@rowman.com> p4raw-id: //depot/perl@11986
* If you want you can now add -DUSE_UTF8_SCRIPTS to your cflagsJarkko Hietaniemi2001-08-121-0/+9
| | | | | | and the Perl will be built to do that by default (adding that will break scripts having non-UTF-8 binary data, such as Latin-1.) p4raw-id: //depot/perl@11656
* There is no IN_UTF8.Jarkko Hietaniemi2001-08-121-1/+0
| | | p4raw-id: //depot/perl@11652
* QNX patch extended for NTONorton T. Allen2001-07-061-1/+3
| | | | | Message-Id: <200107061339.JAA12582@bottesini.harvard.edu> p4raw-id: //depot/perl@11184
* Salvage bits and pieces from the experimental 'utf8 everywhere'Jarkko Hietaniemi2001-05-311-4/+4
| | | | | | patch: rename HINT_BYTE and IN_BYTE to HINT_BYTES and IN_BYTES to match the pragma name; various robustness cleanups. p4raw-id: //depot/perl@10339
* Typo in utf8.hJesús Quiroga2001-04-211-1/+1
| | | | | Message-Id: <5.0.2.1.1.20010421192107.01ce5a50@ix.netcorps.com> p4raw-id: //depot/perl@9775
* Integrate changes #9493,9494,9495,9496 from maintperlJarkko Hietaniemi2001-04-011-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | into mainline. fix a broken workaround for Borland compiler in change#4739 (caused weird "short reads" on DATA, which caused op/misc.t to fail) nits spotted by Borland compiler avoid redefinition warnings under Borland 5.02 various nits identified by the Borland 5.5 compiler; remove suppression of a few warnings p4raw-link: @9496 on //depot/maint-5.6/perl: 9d05ad52b0aa7d1f7d147da0c4dbc14de5fe4a37 p4raw-link: @9495 on //depot/maint-5.6/perl: 759997f1e719f33541bed70dd7f79bfa26a930b3 p4raw-link: @9494 on //depot/maint-5.6/perl: 01b59bde1cb7ff62776f3b83c0f2575c79a950a6 p4raw-link: @9493 on //depot/maint-5.6/perl: eea7051a8d4ef81c032143ab3193bc1240ab2e8f p4raw-link: @4739 on //depot/perl: c39cd00800303e8967294e98aa4c427a1872a251 p4raw-id: //depot/perl@9497 p4raw-integrated: from //depot/maint-5.6/perl@9492 'merge in' sv.c utf8.h (@9288..) toke.c (@9292..) ext/File/Glob/bsd_glob.c (@9415..) win32/makefile.mk (@9426..) win32/win32.h (@9494..)
* More EBCDIC stuff:Nick Ing-Simmons2001-03-201-0/+4
| | | | | | | | | | | | | | - Loose the extra level of function on ASCII. - spotted a chr(0) issue in sv.c - re-work of UTF-X tr/// ranges to work in Unicode space. Still issues with the "0xff is illegal UTF-8" hack. - Yet another ad. hoc. utf8 'upgrade' in op.c recoded (why do it once when you can do it all over the place :-( - Enable HINTS_UTF8 on EBCDIC - then ignore it in toke.c, need utf8.pm for swashes. - Simplified and commented scan_const() in toke.c Still something wrong regexp and tr (swashes?). p4raw-id: //depot/perlio@9267
* More EBCDIC fixes.Nick Ing-Simmons2001-03-191-1/+3
| | | p4raw-id: //depot/perlio@9246
* Infrastructure to use UTF-EBCDIC rather than UTF-8 as the internalNick Ing-Simmons2001-03-171-68/+69
| | | | | | | | | | | | | | | | | | encoding on EBCDIC platforms. This has property that U+0000..U+009F i.e. a superset of ASCII are invariant under the encoding. This is EBCDIC friendly as an encoded string can be looked at as being EBCDIC by lexer sprintf("%d",...) etc. in same manner that a UTF-8 string be considered ASCII on ASCII machines. - re-arrange utf8.h to get ASCII specific vs Unicode generic bits seperate. - Add some more macros to comprehend different shift amounts and possible swizzle in UTF-EBCDIC vs UTF-8. Change utf8.c to use them. - add utfebcdic.h which provides UTF-EBCDIC versions of the macros, and conditionally #include it. EBCDIC build as yet untested. ASCII still fails the one test. p4raw-id: //depot/perlio@9185
* Minor naming change UTF8_IS_ASCII => UTF8_IS_INVARIANTNick Ing-Simmons2001-03-171-0/+1
| | | p4raw-id: //depot/perlio@9184
* EBCDIC Fixes.Nick Ing-Simmons2001-03-161-9/+13
| | | p4raw-id: //depot/perlio@9180
* #ifdef'ed out code for 'USE_BYTES_DOWNGRADES' case.Nick Ing-Simmons2001-03-121-0/+4
| | | p4raw-id: //depot/perlio@9110