delta/perl.git - github.com: perl/perl5.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	Add hook for Unicode private use override	Karl Williamson	2019-03-07	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	I am starting to write a Unicode::Private_Use module which will allow one to specify the Unicode properties of private use code points, thus making them actually useful. This commit adds a hook to regcomp.c to accommodate this module. The changes are pretty minimal. This way we don't have to wait another release cycle to get it out there. I don't want to document this interface, until it's proven.
*	perlvars.h: remove #ifdef DEBUGGING	David Mitchell	2019-02-19	1	-2/+0
\| \| \| \| \| \| \| \| \| \|	Under PERL_GLOBAL_STRUCT, all the "global" vars are put in a structure, which means perlvars.h needs a similar constraint to intrpvar.h: its size and alignment shouldn't change between debugging and non-debugging builds. This is because regcomp/exec.c are compiled both with and without DEBUGGING. [Fixed by Karl]
*	fix thread issue with PERL_GLOBAL_STRUCT	David Mitchell	2019-02-19	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The MY_CXT subsystem allows per-thread pseudo-static data storage. Part of the implementation for this involves each XS module being assigned a unique index in its my_cxt_index static var when first loaded. Because PERL_GLOBAL_STRUCT bans any static vars, under those builds there is instead a table which maps the MY_CXT_KEY identifying string to index. Unfortunately, this table was allocated per-interpreter rather than globally, meaning if multiple threads tried to load the same XS module, crashes could ensue. This manifested itself in failures in ext/XS-APItest/t/keyword_plugin_threads.t The fix is relatively straightforward: allocate PL_my_cxt_keys globally rather than per-interpreter. Also record the size of this struct in a new var, PL_my_cxt_keys_size, rather than doing double duty on PL_my_cxt_size.
*	PL_InBitmap: fix with -DPERL_GLOBAL_STRUCT_PRIVATE	David Mitchell	2019-02-19	1	-1/+1
\| \| \| \| \|	Valgrind wasn't happy, as this global var wasn't being initialised to NULL.
*	foo_cloexec() under PERL_GLOBAL_STRUCT_PRIVATE	David Mitchell	2019-02-19	1	-0/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix the various Perl_PerlSock_dup2_cloexec() type functions so that t/porting/liberl.a passes under -DPERL_GLOBAL_STRUCT_PRIVATE builds. In these builds it is forbidden to have any static variables, but each of these functions (via convoluted macros) has a static var called 'strategy' which records, for each function, whether a run-time probe has been done to determine the best way of achieving close-exec functionality, and the result. Replace them all with 'global' vars: PL_strategy_dup2 etc. NB these vars aren't thread-safe but it doesn't really matter, as the worst that can happen is for a redundant probe or two to be done before a suitable "don't probe any more" value is written to the var and seen by all the threads.
*	Add global hash to handle \p{user-defined}	Karl Williamson	2019-02-14	1	-1/+8
\| \| \| \| \| \| \|	A global hash has to be specially handled. The keys can't be shared, and all the SVs stored into it must be in its thread. This commit adds the hash, and initialization, and macros for context change, but doesn't use them. The code to deal with this is entirely confined to regcomp.c.
*	Add mutex for dealing with qr/\p{user-defined}/	Karl Williamson	2019-02-14	1	-0/+4
\| \| \| \|	This will be used in future commits
*	regen/mk_invlists.pl: Create new inversion list	Karl Williamson	2019-02-05	1	-0/+1
\| \| \| \|	This will be used in a future commit.
*	perlvars.h: A variable was being initialized unnecessarily	Karl Williamson	2018-12-30	1	-1/+1
\|
*	Change name of PL_NonL1NonFinalFold	Karl Williamson	2018-12-25	1	-1/+1
\| \| \| \| \|	The inversion list this refers to now includes the Latin 1 range, so the name was misleading.
*	Change name of PL_utf8_foldable variable	Karl Williamson	2018-12-25	1	-3/+2
\| \| \| \| \| \|	This variable's name was out-of-date and misleading. It is the name of an inversion list that contains all the code points in the current version of Unicode that participate in any way in a /i type of fold.
*	regen/mk_invlists.pl: Add new table	Karl Williamson	2018-12-07	1	-0/+1
\| \| \| \| \| \| \|	This table contains all the code points that are in any multi-character fold (not the folded-from character, but what that character folds to). It will be used in a future commit.
*	Make global two interpreter variables	Karl Williamson	2018-07-14	1	-0/+2
\| \| \| \| \|	These variables are constant, once initialized, through the life of a program, so having them be per instance is a waste of time and space
*	regcomp.c: Simplify	Karl Williamson	2018-06-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Under /a pattern matching, the matches of the [:posix:] classes are restricted to the ASCII range. Previously, in a time/space trade-off that favored space, we created the list of matching characters at pattern compilation time by ANDing the full-range Posix class with the set of ASCII characters. But now, the tables for just the ASCII-range classes are generated anyway, so there's no need to do that compilation-time intersection. This slightly simplifies the code.
*	Use compiled-in C structure for inverted case folds	Karl Williamson	2018-03-31	1	-0/+4
\| \| \| \| \| \| \| \| \| \|	This commit changes to use the C data structures generated by the previous commit to compute what characters fold to a given one. This is used to find out what things should match under /i. This now avoids the expensive start up cost of switching to perl utf8_heavy.pl, loading a file from disk, and constructing a hash from it.
*	Use charnames inversion lists	Karl Williamson	2018-03-31	1	-0/+2
\| \| \| \| \| \| \| \|	This commit makes the inversion lists for parsing character name global instead of interpreter level, so can be initialized once per process, and no copies are created upon new thread instantiation. More importantly, this is another instance where utf8_heavy.pl no longer needs to be loaded, and the definition files read from disk.
*	Move case change invlists from interpreter to global	Karl Williamson	2018-03-26	1	-0/+5
\| \| \| \| \|	These are now constant through the life of the program, so don't need to be duplicated at each new thread instantiation.
*	Make Unicode data structures global	Karl Williamson	2018-03-14	1	-0/+24
\| \| \| \| \| \| \| \| \| \|	These structures are read-only, use const C strings, and are truly global, so no need to have them be interpreter level. This saves duplicating and freeing them as threads come and go. In doing this, I noticed that not every one was properly being copied/deallocated, so this fixes some potential unreported bugs, and leaks.
*	Work around Microsoft threaded locale bug for localeconv()	Karl Williamson	2018-03-12	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Prior to Visual Studio 2015, the localeconv() function only looks at the global locale, not the per-thread one it should. This works around this by creating critical sections, switching to the global locale to call localeconv(), then switching back. For the most common usage, it avoids the switch by parsing a string it generates that should contain the desired substring. This leaves the switch required for retrieving the floating point grouping separator and the currency string, plus POSIX::localeconv(). The first two could be avoided by extra code as detailed in the pod for switch_to_global_locale(); patches welcome!
*	Don't create locale object unless threaded	Karl Williamson	2018-03-12	1	-5/+1
\| \| \| \| \| \|	PL_C_locale_obj is now only created on threaded builds on systems with POSIX 2008. On unthreaded builds, we really should continue to use the old tried and true library calls.
*	Don't create unneeded mutexes	Karl Williamson	2018-03-12	1	-1/+2
\| \| \| \|	These mutexes are needed only for unsafe threaded-locale operations.
*	Latch LC_NUMERIC during critical sections	Karl Williamson	2018-02-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	It is possible for operations on threaded perls which don't 'use locale' to still change the locale. This happens when calling POSIX::localeconv() and I18N::Langinfo(), and in earlier perls, it can happen for other operations when perl has been initialized with the environment causing the various locale categories to not have a uniform locale. This commit causes the areas where the locale for this category should predictably be in one or the other state to be a critical section where another thread can't interrupt and change it. This is a separate mutex, so that only these particular operations will be held up.
*	makedef.pl: Use all #ifdefs to decide	Karl Williamson	2018-02-18	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \|	This uses all the conditions that go in to deciding to define this variable, instead of taking a shortcut. (It's unfortunate that the shortcut defined in perl.h now used elsewhere via commit 7e5377f714ebe84e0fe2599b3bfec50036f0d18f is not available to makedef.pl. It would make sense to have this file look at perl.h and avoid duplication). This fixes some freebsd builds, which failed because their hints file undef'd a different variable than this shortuct looked at.
*	add wrap_keyword_plugin function (RT #132413)	Lukas Mai	2017-11-11	1	-0/+6
\|
*	Initialize locale object even in unthreaded perls	Karl Williamson	2017-08-12	1	-3/+4
\| \| \| \| \| \| \|	This commit will now initialize the thread-safe C locale object if the POSIX 2008 functions are available, regardless of whether the perl is threaded or not. This will allow for a future commit that uses them, and which is a win on unthreaded builds.
*	Restore "Add new hashing and "hash with state" infrastructure"	Yves Orton	2017-06-01	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit e6a172f358c0f48c4b744dbd5e9ef6ff0b4ff289, which was a revert of a3bf60fbb1f05cd2c69d4ff0a2ef99537afdaba7. Add new hashing and "hash with state" infrastructure This adds support for three new hash functions: StadtX, Zaphod32 and SBOX, and reworks some of our hash internals infrastructure to do so. SBOX is special in that it is designed to be used in conjuction with any other hash function for hashing short strings very efficiently and very securely. It features compile time options on how much memory and startup time are traded off to control the length of keys that SBOX hashes. This also adds support for caching the hash values of single byte characters which can be used in conjuction with any other hash, including SBOX, although SBOX itself is as fast as the lookup cache, so typically you wouldnt use both at the same time. This also removes support for Jenkins One-At-A-Time. It has served us well, but it's day is done. This patch adds three new files: zaphod32_hash.h, stadtx_hash.h, sbox32_hash.h
*	Revert "Add new hashing and "hash with state" infrastructure"	Yves Orton	2017-04-23	1	-6/+0
\| \| \| \| \| \|	This reverts commit a3bf60fbb1f05cd2c69d4ff0a2ef99537afdaba7. Accidentally pushed work pending unfreeze.
*	Add new hashing and "hash with state" infrastructure	Yves Orton	2017-04-23	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds support for three new hash functions: StadtX, Zaphod32 and SBOX, and reworks some of our hash internals infrastructure to do so. SBOX is special in that it is designed to be used in conjuction with any other hash function for hashing short strings very efficiently and very securely. It features compile time options on how much memory and startup time are traded off to control the length of keys that SBOX hashes. This also adds support for caching the hash values of single byte characters which can be used in conjuction with any other hash, including SBOX, although SBOX itself is as fast as the lookup cache, so typically you wouldnt use both at the same time. This also removes support for Jenkins One-At-A-Time. It has served us well, but it's day is done. This patch adds three new files: zaphod32_hash.h, stadtx_hash.h, sbox32_hash.h
*	Make PERLLIB_SEP dynamic on VMS.	Craig A. Berry	2016-09-01	1	-0/+7
\| \| \| \| \| \| \|	Because if we're running under a Unix shell, the path separator is likely to meet the expectations of Unix shell scripts better if it's the Unix ':' rather than the VMS '\|'. There is no change when running under DCL.
*	locale.c: Revamp my_strerror() for thread-safeness	Karl Williamson	2016-07-29	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit is the first step in making locale handling thread-safe. [perl #127708] was solved for 5.24 by adding a mutex in this function. That bug was caused by the code changing the locale even if the calling program is not consciously using locales. Posix 2008 introduced thread-safe locale functions. This commit changes this function to use them if the perl is threaded and the platform has them available. This means that the mutex is avoided on modern platforms. It restructures the function to return a mortal copy of the error message. This is a step towards making the function completely thread safe. Right now, as documented, if you do 'use locale', locale handling isn't thread-safe. A global C locale object is created and used here if necessary. It is destroyed at the end of the program. Note that some platforms have a strerror_r(), which is automatically used instead of strerror() if available. It differs form straight strerror() by taking a buffer to place the returned string, so the return does not point to internal static storage. One could test for the existence of this and avoid the mortal copy.
*	Add locale mutex	Karl Williamson	2016-04-09	1	-0/+2
\| \| \| \| \|	This adds a new mutex for use in the next commit for use with locale handling.
*	perlapi: Clarify process of using undocumented globals	Karl Williamson	2015-09-30	1	-2/+7
\| \| \| \| \|	One should send email to p5p first to get the go-ahead for documenting and using an undocumented function or global variable.
*	Static initialization using 1/0 or 0/0 not C89.	Jarkko Hietaniemi	2014-09-01	1	-13/+0
\|
*	Do not use HUGE_VAL/VALL for NV_INF.	Jarkko Hietaniemi	2014-09-01	1	-0/+13
\| \| \| \| \| \| \| \|	Because some platforms (like HP-UX 10.*) have HUGE_VAL as DBL_MAX, which, while large, is not quite the infinity. So have infinity own our very own. Similarly for NV_NAN.
*	initialise PL_watch_pvx	David Mitchell	2014-04-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	This global var is only used in debugging builds, but with PERL_GLOBAL_STRUCT_PRIVATE builds, it gives valgrind errors. Just initialise it to NULL. I was originally trying to replicate smoke failures of dist/Thread-Queue/t/07_lock.t under PERL_GLOBAL_STRUCT_PRIVATE, and valgrind complained.
*	Revert "[perl #119801] Stop @DB::dbline modifications from crashing"	Father Chrysostomos	2013-12-25	1	-4/+0
\| \| \| \| \| \|	This reverts commit c1cec775e9019cc8ae244d4db239a7ea5c0b343e. See ticket #120864.
*	[perl #119801] Stop @DB::dbline modifications from crashing	Father Chrysostomos	2013-12-21	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The cop address for each breakable line was being stored in the IVX slot of ${"_<$file"}[$line]. This value itself, writable from Perl space, was being used as the address of the op to be flagged, whenever a breakpoint was set. This meant writing to ${"_<$file"}[$line] and assigning a number (like 42) would cause perl to use 42 as an op address, and crash when trying to flag the op. Furthermore, since the array holding the lines could outlive the ops, setting a breakpoint on the op could write to freed memory or to an unrelated op (even a different type), potentially changing the beha- viour of unrelated code. This commit solves those pitfalls by moving breakpoints into a global breakpoint bitfield. Dbstate ops now have an extra field on the end holding a sequence number, representing which bit holds the breakpoint for that op.
*	Fix SEGVs and test failures for -DPERL_GLOBAL_STRUCT_PRIVATE	Nicholas Clark	2013-07-12	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	With PERL_GLOBAL_STRUCT_PRIVATE "global" variables are in a structure in malloc()ed memory, not in global static variables or a global static structure. Hence no global variables are implicitly initialised to zero. * PL_curinterp and PL_op_sequence need initialising to NULL * The global structure is free()d before handlers registered with atexit() run, so be defensive about this. * Some C code checks SvOK(PL_sv_placeholder) so ensure that its SvFLAGS() are 0. * Zero PL_hash_seed
*	Make it possible to disable and control hash key traversal randomization	Yves Orton	2013-05-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adds support for PERL_PERTURB_KEYS environment variable, which in turn allows one to control the level of randomization applied to keys() and friends. When PERL_PERTURB_KEYS is 0 we will not randomize key order at all. The chance that keys() changes due to an insert will be the same as in previous perls, basically only when the bucket size is changed. When PERL_PERTURB_KEYS is 1 we will randomize keys in a non repeatedable way. The chance that keys() changes due to an insert will be very high. This is the most secure and default mode. When PERL_PERTURB_KEYS is 2 we will randomize keys in a repeatedable way. Repititive runs of the same program should produce the same output every time. The chance that keys changes due to an insert will be very high. This patch also makes PERL_HASH_SEED imply a non-default PERL_PERTURB_KEYS setting. Setting PERL_HASH_SEED=0 (exactly one 0) implies PERL_PERTURB_KEYS=0 (hash key randomization disabled), settng PERL_HASH_SEED to any other value, implies PERL_PERTURB_KEYS=2 (deterministic/repeatable hash key randomization). Specifying PERL_PERTURB_KEYS explicitly to a different level overrides this behavior. Includes changes to allow one to compile out various aspects of the patch. One can compile such that PERL_PERTURB_KEYS is not respected, or can compile without hash key traversal randomization at all. Note that support for these modes is incomplete, and currently a few tests will fail. Also includes a new subroutine in Hash::Util::hash_traversal_mask() which can be used to ensure a given hash produces a predictable key order (assuming the same hash seed is in effect). This sub acts as a getter and a setter. NOTE - this patch lacks tests, but I lack tuits to get them done quickly, so I am pushing this with the hope that others can add them afterwards.
*	The size of PL_hash_seed depends on the chosen hash algorithm.	Nicholas Clark	2012-11-23	1	-1/+1
\| \| \| \|	For most 8 is fine, but SipHash needs 16.
*	Hash Function Change - Murmur hash and true per process hash seed	Yves Orton	2012-11-17	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch does the following: ) Introduces multiple new hash functions to choose from at build time. This includes Murmur-32, SDBM, DJB2, SipHash, SuperFast, and One-at-a-time. Currently this is handled by muning hv.h. Configure support hopefully to follow. ) Changes the default hash to Murmur hash which is faster than the old default One-at-a-time. ) Rips out the old HvREHASH mechanism and replaces it with a per-process random hash seed. ) Changes the old PL_hash_seed from an interpreter value to a global variable. This means it does not have to be copied during interpreter setup or cloning. ) Changes the format of the PERL_HASH_SEED variable to a hex string so that hash seeds longer than fit in an integer are possible. ) Changes the return of Hash::Util::hash_seed() from a number to a string. This is to accomodate hash functions which have more bits than can be fit in an integer. *) Adds new functions to Hash::Util to improve introspection of hashes -) hash_value() - returns an integer hash value for a given string. -) bucket_info() - returns basic hash bucket utilization info -) bucket_stats() - returns more hash bucket utilization info -) bucket_array() - which keys are in which buckets in a hash More details on the new hash functions can be found below: Murmur Hash: (v3) from google, see http://code.google.com/p/smhasher/wiki/MurmurHash3 Superfast Hash: From Paul Hsieh. http://www.azillionmonkeys.com/qed/hash.html DJB2: a hash function from Daniel Bernstein http://www.cse.yorku.ca/~oz/hash.html SDBM: a hash function sdbm. http://www.cse.yorku.ca/~oz/hash.html SipHash: by Jean-Philippe Aumasson and Daniel J. Bernstein. https://www.131002.net/siphash/ They have all be converted into Perl's ugly macro format. I have not done any rigorous testing to make sure this conversion is correct. They seem to function as expected however. All of them use the random hash seed. You can force the use of a given function by defining one of PERL_HASH_FUNC_MURMUR PERL_HASH_FUNC_SUPERFAST PERL_HASH_FUNC_DJB2 PERL_HASH_FUNC_SDBM PERL_HASH_FUNC_ONE_AT_A_TIME Setting the environment variable PERL_HASH_SEED_DEBUG to 1 will make perl output the current seed (changed to hex) and the hash function it has been built with. Setting the environment variable PERL_HASH_SEED to a hex value will cause that value to be used at the seed. Any missing bits of the seed will be set to 0. The bits are filled in from left to right, not the traditional right to left so setting it to FE results in a seed value of "FE000000" not "000000FE". Note that we do the hash seed initialization in perl_construct(). Doing it via perl_alloc() (via init_tls) causes problems under threaded builds as the buffers used for reentrant srand48 functions are not allocated. See also the p5p mail "Hash improvements blocker: portable random code that doesnt depend on a functional interpreter", Message-ID: <CANgJU+X+wNayjsNOpKRqYHnEy_+B9UH_2irRA5O3ZmcYGAAZFQ@mail.gmail.com>
*	add wrap_op_checker() API function	Zefram	2012-02-11	1	-0/+40
\| \| \| \| \|	This function provides a convenient and thread-safe way for modules to hook op checking.
*	Simplify embedvar.h, removing a level of macro indirection for PL_* variables.	Nicholas Clark	2011-08-11	1	-33/+35
\| \| \| \| \| \| \|	For the default (non-multiplicity) configuration, PERLVAR*() macros now directly expand their arguments to tokens such as C<PL_defgv>, instead of expanding to C<PL_Idefgv>. This removes over 350 lines from F<embedvar.h>, which defined macros to map from C<PL_Idefgv> to C<PL_defgv> and so forth.
*	Re-order perlvars.h to group related variables, and spread mutexes out.	Nicholas Clark	2011-06-12	1	-45/+41
\| \| \| \| \| \|	Move variables smaller than pointers into groups, to avoid holes in the structure for PERL_GLOBAL_STRUCT. Spreading the mutexes out tries to keep them off the same cache lines, which may help slightly on multi-processor machines.
*	PL_perlio_mutex is only needed with USE_UIHREADS && USE_PERLIO	Nicholas Clark	2011-06-12	1	-1/+1
\| \| \| \|	(Not that Configure will let you configure with ithreads but without perlio)
*	Change PL_use_safe_putenv from int to bool.	Nicholas Clark	2011-06-12	1	-1/+1
\| \| \| \| \|	It is only ever checked for truth/falsehood, and all assignments to it (in core and on CPAN) are either 0 or 1.
*	Move PL_runops_{std,dbg} to perl.h, and make them const.	Nicholas Clark	2011-06-12	1	-6/+0
\| \| \| \| \|	They exist solely to ensure that Perl_runops_standard and Perl_runops_debug are linked in - nothing assigns to either variable, and nothing reads them.
*	Move PL_global_struct_size, PL_interp_size{,_5_16_0} to perl.h	Nicholas Clark	2011-06-12	1	-18/+0
\| \| \| \|	Make them const U16 - they should have been const from the start.
*	In perlvar.h, move the always-present globals above those conditionally compiled	Nicholas Clark	2011-06-12	1	-2/+2
\| \| \| \| \|	Rename PL_interp_size_5_10_0 to PL_interp_size_5_16_0, as it is only intended to track interpreter size within (forwards) binary compatible maintenance branches.
*	Move PL_{revision,version,subversion} to perl.h, making them const U8.	Nicholas Clark	2011-06-12	1	-9/+0
\| \| \| \| \|	To get the initialisation to work, the location of #include patchlevel.h needs to be moved.