delta/ruby.git - github.com: ruby/ruby.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	This commit implements the Object Shapes technique in CRuby.	Jemma Issroff	2022-09-28	2	-18/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Object Shapes is used for accessing instance variables and representing the "frozenness" of objects. Object instances have a "shape" and the shape represents some attributes of the object (currently which instance variables are set and the "frozenness"). Shapes form a tree data structure, and when a new instance variable is set on an object, that object "transitions" to a new shape in the shape tree. Each shape has an ID that is used for caching. The shape structure is independent of class, so objects of different types can have the same shape. For example: ```ruby class Foo def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end class Bar def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end foo = Foo.new # `foo` has shape id 2 bar = Bar.new # `bar` has shape id 2 ``` Both `foo` and `bar` instances have the same shape because they both set instance variables of the same name in the same order. This technique can help to improve inline cache hits as well as generate more efficient machine code in JIT compilers. This commit also adds some methods for debugging shapes on objects. See `RubyVM::Shape` for more details. For more context on Object Shapes, see [Feature: #18776] Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org> Co-Authored-By: Eileen M. Uchitelle <eileencodes@gmail.com> Co-Authored-By: John Hawthorn <john@hawthorn.email>
*	Revert this until we can figure out WB issues or remove shapes from GC	Aaron Patterson	2022-09-26	2	-4/+18
\| \| \| \| \| \| \| \| \| \|	Revert "* expand tabs. [ci skip]" This reverts commit 830b5b5c351c5c6efa5ad461ae4ec5085e5f0275. Revert "This commit implements the Object Shapes technique in CRuby." This reverts commit 9ddfd2ca004d1952be79cf1b84c52c79a55978f4.
*	This commit implements the Object Shapes technique in CRuby.	Jemma Issroff	2022-09-26	2	-18/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Object Shapes is used for accessing instance variables and representing the "frozenness" of objects. Object instances have a "shape" and the shape represents some attributes of the object (currently which instance variables are set and the "frozenness"). Shapes form a tree data structure, and when a new instance variable is set on an object, that object "transitions" to a new shape in the shape tree. Each shape has an ID that is used for caching. The shape structure is independent of class, so objects of different types can have the same shape. For example: ```ruby class Foo def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end class Bar def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end foo = Foo.new # `foo` has shape id 2 bar = Bar.new # `bar` has shape id 2 ``` Both `foo` and `bar` instances have the same shape because they both set instance variables of the same name in the same order. This technique can help to improve inline cache hits as well as generate more efficient machine code in JIT compilers. This commit also adds some methods for debugging shapes on objects. See `RubyVM::Shape` for more details. For more context on Object Shapes, see [Feature: #18776] Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org> Co-Authored-By: Eileen M. Uchitelle <eileencodes@gmail.com> Co-Authored-By: John Hawthorn <john@hawthorn.email>
*	Fix `io/buffer.h` header guard.	Samuel Williams	2022-09-26	1	-3/+3
\|
*	Just a star [ci skip]	Nobuyoshi Nakada	2022-09-23	1	-1/+1
\|
*	rb_define_method: dedicated overload for rb_f_notimplement	卜部昌平	2022-09-21	1	-7/+8
\| \| \| \| \|	rb_f_notimplement was type-compatible with VALUE(*)(ANYARGS), but not any longer in C23. Provide a dedicated path for it.
*	[Bug #5317] Use `rb_off_t` instead of `off_t`	Nobuyoshi Nakada	2022-09-08	4	-12/+11
\| \| \| \|	Get rid of the conflict with system-provided small `off_t`.
*	Avoid leaving a period alone [ci skip]	Takashi Kokubun	2022-08-27	1	-2/+2
\|
*	typos	spaette	2022-08-27	4	-4/+5
\|
*	Support Encoding::Converter newline: :lf and :lf_newline options	Jeremy Evans	2022-08-19	1	-7/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, newline: :lf was accepted but ignored. Where it should have been used was commented out code that didn't work, but unlike all other invalid values, using newline: :lf did not raise an error. This adds support for newline: :lf and :lf_newline, for consistency with newline: :cr and :cr_newline. This is basically the same as universal_newline, except that it only affects writing and not reading due to RUBY_ECONV_NEWLINE_DECORATOR_WRITE_MASK. Add tests for the File.open :newline option while here. Fixes [Bug #12436]
*	Stop defining `RUBY_ABI_VERSION` if released versions	Nobuyoshi Nakada	2022-08-12	1	-1/+5
\| \| \| \| \| \|	As commented in include/ruby/internal/abi.h, since teeny versions of Ruby should guarantee ABI compatibility, `RUBY_ABI_VERSION` has no role in released versions of Ruby.
*	Add missing `rb_enc_iscntrl`	Nobuyoshi Nakada	2022-08-12	1	-0/+15
\|
*	[DOC] Use `true`/`false` for `@retval`s which are `bool`	Nobuyoshi Nakada	2022-08-12	1	-43/+43
\|
*	[DOC] Add return values of rb_enc_mbcput	Nobuyoshi Nakada	2022-08-07	1	-4/+6
\|
*	Adjust styles [ci skip]	Nobuyoshi Nakada	2022-07-27	1	-4/+8
\|
*	Rename rb_ary_tmp_new to rb_ary_hidden_new	Peter Zhu	2022-07-26	1	-3/+3
\| \| \| \| \| \|	rb_ary_tmp_new suggests that the array is temporary in some way, but that's not true, it just creates an array that's hidden and not on the transient heap. This commit renames it to rb_ary_hidden_new.
*	Move enum definitions out of struct definition	Yusuke Endoh	2022-07-22	1	-30/+29
\|
*	Expand tabs [ci skip]	Takashi Kokubun	2022-07-21	4	-26/+26
\| \| \| \|	[Misc #18891]
*	Remove unused internal macros in rarray.h	Peter Zhu	2022-07-21	1	-24/+0
\|
*	Implement Objects on VWA	Peter Zhu	2022-07-15	2	-4/+43
\| \| \| \| \| \|	This commit implements Objects on Variable Width Allocation. This allows Objects with more ivars to be embedded (i.e. contents directly follow the object header) which improves performance through better cache locality.
*	Fix some UBSAN false positives (#6115)	Kevin Backhouse	2022-07-12	1	-1/+1
\| \| \| \|	* Fix some UBSAN false positives. * ruby tool/update-deps --fix
*	do not define our own version of memcpy	卜部昌平	2022-07-07	1	-5/+1
\| \| \| \| \| \|	The (sole) use of memcpy in our public header is now replaced to directly call ruby_nonempty_memcpy, and the previous definition of memcpy is now internal-only. [Bug#18893]
*	Copy `IO#wait*` methods from `io-wait` gem to `io.c`.	Samuel Williams	2022-06-25	1	-0/+3
\|
*	Include JIT information in crash reports	Chris Seaton	2022-06-20	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since enabling YJIT or MJIT drastically changes what could go wrong at runtime, it's good to be front and center about whether they are enabled when dumping a crash report. Previously, `RUBY_DESCRIPTION` and the description printed when crashing can be different when a JIT is on. Introduce a new internal data global, `rb_dynamic_description`, and set it to be the same as `RUBY_DESCRIPTION` during initialization; use it when crashing. * version.c: Init_ruby_description(): Initialize and use `rb_dynamic_description`. * error.c: Change crash reports to use `rb_dynamic_description`. * ruby.c: Call `Init_ruby_description()` earlier. Slightly more work for when we exit right after printing the description but that was deemed acceptable. * include/ruby/version.h: Talk about how JIT info is not in `ruby_description`. * test/-ext-/bug_reporter/test_bug_reporter.rb: Remove handling for crash description being different from `RUBY_DESCRIPTION`. * test/ruby/test_rubyoptions.rb: ditto Co-authored-by: Nobuyoshi Nakada <nobu@ruby-lang.org> Co-authored-by: Alan Wu <alanwu@ruby-lang.org>
*	GVL Instrumentation API: add STARTED and EXITED events	Jean Boussier	2022-06-17	1	-4/+6
\| \| \| \| \| \| \| \|	[Feature #18339] After experimenting with the initial version of the API I figured there is a need for an exit event to cleanup instrumentation data. e.g. if you record data in a {thread_id -> data} table, you need to free associated data when a thread goes away.
*	Remove unused and accidentally public rb_str_shared_root_p()	Alan Wu	2022-06-16	1	-3/+0
\| \| \| \| \| \| \|	This function was added to a public header in [1] probably unintentionally since it's not used anywhere, exposes implementation details, and isn't related to the goals of that pull request. [1]: 56cc3e99b6b9ec004255280337f6b8353f5e5b06
*	Restore rb_exec_recursive_outer	John Hawthorn	2022-06-15	1	-2/+1
\| \| \| \|	This was a public method, so we should probably keep it.
*	Move String RVALUES between pools	Matt Valentine-House	2022-06-13	1	-0/+3
\| \| \| \| \|	And re-embed any strings that can now fit inside the slot they've been moved to
*	Make method id explicit in rb_exec_recursive_outer	John Hawthorn	2022-06-10	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, because opt_aref and opt_aset don't push a frame, when they would call rb_hash to determine the hash value of the key, the initial level of recursion would incorrectly use the method id at the top of the stack instead of "hash". This commit replaces rb_exec_recursive_outer with rb_exec_recursive_outer_mid, which takes an explicit method id, so that we can make the hash calculation behave consistently. rb_exec_recursive_outer was documented as being internal, so I believe this should be okay to change.
*	[Feature #18339] GVL Instrumentation API	Jean Boussier	2022-06-03	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Ref: https://bugs.ruby-lang.org/issues/18339 Design: - This tries to minimize the overhead when no hook is registered. It should only incur an extra unsynchronized boolean check. - The hook list is protected with a read-write lock as to cause contention when some hooks are registered. - The hooks MUST be thread safe, and MUST NOT call into Ruby as they are executed outside the GVL. - It's simply a noop on Windows. API: ``` rb_internal_thread_event_hook_t * rb_internal_thread_add_event_hook(rb_internal_thread_event_callback callback, rb_event_flag_t internal_event, void user_data); bool rb_internal_thread_remove_event_hook(rb_internal_thread_event_hook_t hook); ``` You can subscribe to 3 events: - READY: called right before attempting to acquire the GVL - RESUMED: called right after successfully acquiring the GVL - SUSPENDED: called right after releasing the GVL. The hooks MUST be threadsafe, as they are executed outside of the GVL, they also MUST NOT call any Ruby API.
*	Remove trailing comma from FL_USER3 (#5958)	Jemma Issroff	2022-05-26	1	-1/+1
\|
*	Remove unused RMODULE_INCLUDED_INTO_REFINEMENT flag	Jemma Issroff	2022-05-26	1	-37/+0
\|
*	Remove unnecessary module flag, add module assertions to other module flags	Jemma Issroff	2022-05-23	1	-10/+0
\|
*	Undefine RUBY_DLN_CHECK_ABI on cygwin	Daisuke Fujimura (fd0)	2022-05-19	1	-1/+1
\|
*	Increase SIZE_POOL_COUNT to 5	Peter Zhu	2022-05-09	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Having more size pools will allow us to allocate larger objects through Variable Width Allocation. I have attached some benchmark results below. Discourse: On Discourse, we don't see much change in response times. We do see a small reduction in RSS. Branch RSS: 377.8 MB Master RSS: 396.3 MB railsbench: On railsbench, we don't see a big change in RPS or p99 performance. We see a small increase in RSS. Branch RPS: 815.38 Master RPS: 811.73 Branch p99: 1.69 ms Master p99: 1.68 ms Branch RSS: 90.6 MB Master RSS: 89.4 MB liquid: We don't see a significant change in liquid performance. Branch parse & render: 29.041 I/s Master parse & render: 29.211 I/s
*	Expose `rb_hash_new_capa(long)`	Jean Boussier	2022-04-26	1	-0/+11
\| \| \| \| \| \| \| \|	[Feature #18683] This allows parsers and similar libraries to create Hashes of a certain capacity in advance. It's useful when the key and values are streamed, hence `bulk_insert()` can't be used.
*	[Doc] correct my understanding about nonblocking mode	卜部昌平	2022-04-21	1	-4/+17
\| \| \| \| \|	I was wrong. Nonblocking mode nowadays does not interface with IO#read. Document updated. [ci skip]
*	[DOC] add missing size params in fiber scheduler.h (#5441)	Alex Matchneer	2022-04-14	1	-0/+2
\|
*	Finer-grained constant cache invalidation (take 2)	Kevin Newton	2022-04-01	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit reintroduces finer-grained constant cache invalidation. After 8008fb7 got merged, it was causing issues on token-threaded builds (such as on Windows). The issue was that when you're iterating through instruction sequences and using the translator functions to get back the instruction structs, you're either using `rb_vm_insn_null_translator` or `rb_vm_insn_addr2insn2` depending if it's a direct-threading build. `rb_vm_insn_addr2insn2` does some normalization to always return to you the non-trace version of whatever instruction you're looking at. `rb_vm_insn_null_translator` does not do that normalization. This means that when you're looping through the instructions if you're trying to do an opcode comparison, it can change depending on the type of threading that you're using. This can be very confusing. So, this commit creates a new translator function `rb_vm_insn_normalizing_translator` to always return the non-trace version so that opcode comparisons don't have to worry about different configurations. [Feature #18589]
*	re.c: Add Regexp.timeout= and Regexp.timeout	Yusuke Endoh	2022-03-30	1	-0/+7
\| \| \| \|	[Feature #17837]
*	Revert "Finer-grained inline constant cache invalidation"	Nobuyoshi Nakada	2022-03-25	2	-7/+2
\| \| \| \| \| \| \| \| \| \| \| \|	This reverts commits for [Feature #18589]: * 8008fb7352abc6fba433b99bf20763cf0d4adb38 "Update formatting per feedback" * 8f6eaca2e19828e92ecdb28b0fe693d606a03f96 "Delete ID from constant cache table if it becomes empty on ISEQ free" * 629908586b4bead1103267652f8b96b1083573a8 "Finer-grained inline constant cache invalidation" MSWin builds on AppVeyor have been crashing since the merger.
*	Finer-grained inline constant cache invalidation	Kevin Newton	2022-03-24	2	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Current behavior - caches depend on a global counter. All constant mutations cause caches to be invalidated. ```ruby class A B = 1 end def foo A::B # inline cache depends on global counter end foo # populate inline cache foo # hit inline cache C = 1 # global counter increments, all caches are invalidated foo # misses inline cache due to `C = 1` ``` Proposed behavior - caches depend on name components. Only constant mutations with corresponding names will invalidate the cache. ```ruby class A B = 1 end def foo A::B # inline cache depends constants named "A" and "B" end foo # populate inline cache foo # hit inline cache C = 1 # caches that depend on the name "C" are invalidated foo # hits inline cache because IC only depends on "A" and "B" ``` Examples of breaking the new cache: ```ruby module C # Breaks `foo` cache because "A" constant is set and the cache in foo depends # on "A" and "B" class A; end end B = 1 ``` We expect the new cache scheme to be invalidated less often because names aren't frequently reused. With the cache being invalidated less, we can rely on its stability more to keep our constant references fast and reduce the need to throw away generated code in YJIT.
*	[Feature #18634] Implement Arrays on Variable Width Allocation	Peter Zhu	2022-03-22	2	-2/+21
\| \| \| \| \| \|	This commit implements arrays on Variable Width Allocation. This allows longer arrays to be embedded (i.e. contents directly follow the object header) which improves performance through better cache locality.
*	Honor if `_Bool` is available	Nobuyoshi Nakada	2022-03-16	1	-1/+1
\| \| \| \|	`AC_HEADER_STDBOOL` rejects stdbool.h in c2x, which is not conforming to C99.
*	Wrap ruby_abi_version in `extern "C"` for C++	Peter Zhu	2022-03-01	1	-0/+8
\| \| \| \| \|	Make ruby_abi_version have C linkage so that the symbol can be found in the shared object.
*	Only define RUBY_DLN_CHECK_ABI when supported	Peter Zhu	2022-03-01	1	-4/+2
\|
*	[DOC] Fix reference in rb_enc_associate() description	Lars Kanis	2022-03-01	1	-2/+2
\|
*	[DOC] Fix function name in example	Lars Kanis	2022-03-01	1	-1/+1
\|
*	[Feature #18249] Implement ABI checking	Peter Zhu	2022-02-22	2	-0/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Header file include/ruby/internal/abi.h contains RUBY_ABI_VERSION which is the ABI version. This value should be bumped whenever an ABI incompatible change is introduced. When loading dynamic libraries, Ruby will compare its own `ruby_abi_version` and the `ruby_abi_version` of the loaded library. If these two values don't match it will raise a `LoadError`. This feature can also be turned off by setting the environment variable `RUBY_RUBY_ABI_CHECK=0`. This feature will prevent cases where previously installed native gems fail in unexpected ways due to incompatibility of changes in header files. This will force the developer to recompile their gems to use the same header files as the built Ruby. In Ruby, the ABI version is exposed through `RbConfig::CONFIG["ruby_abi_version"]`.
*	Check if `__assume` is supported	Nobuyoshi Nakada	2022-02-19	2	-5/+1
\|