mirror of https://github.com/ruby/ruby.git synced 2026-01-27 12:34:21 +00:00

History

Jeremy Evans c20e819e8b Fix crash when passing large keyword splat to method accepting keywords and keyword splat

The following code previously caused a crash:

```ruby
h = {}
1000000.times{|i| h[i.to_s.to_sym] = i}
def f(kw: 1, **kws) end
f(**h)
```

Inside a thread or fiber, the size of the keyword splat could be much smaller
and still cause a crash.

I found this issue while optimizing method calling by reducing implicit
allocations.  Given the following code:

```ruby
def f(kw: , **kws) end
kw = {kw: 1}
f(**kw)
```

The `f(**kw)` call previously allocated two hashes callee side instead of a
single hash.  This is because `setup_parameters_complex` would extract the
keywords from the keyword splat hash to the C stack, to attempt to mirror
the case when literal keywords are passed without a keyword splat.  Then,
`make_rest_kw_hash` would build a new hash based on the extracted keywords
that weren't used for literal keywords.

Switch the implementation so that if a keyword splat is passed, literal keywords
are deleted from the keyword splat hash (or a copy of the hash if the hash is
not mutable).

In addition to avoiding the crash, this new approach is much more
efficient in all cases.  With the included benchmark:

```
                                1
            miniruby:   5247879.9 i/s
     miniruby-before:   2474050.2 i/s - 2.12x  slower

                        1_mutable
            miniruby:   1797036.5 i/s
     miniruby-before:   1239543.3 i/s - 1.45x  slower

                               10
            miniruby:   1094750.1 i/s
     miniruby-before:    365529.6 i/s - 2.99x  slower

                       10_mutable
            miniruby:    407781.7 i/s
     miniruby-before:    225364.0 i/s - 1.81x  slower

                              100
            miniruby:    100992.3 i/s
     miniruby-before:     32703.6 i/s - 3.09x  slower

                      100_mutable
            miniruby:     40092.3 i/s
     miniruby-before:     21266.9 i/s - 1.89x  slower

                             1000
            miniruby:     21694.2 i/s
     miniruby-before:      4949.8 i/s - 4.38x  slower

                     1000_mutable
            miniruby:      5819.5 i/s
     miniruby-before:      2995.0 i/s - 1.94x  slower
```

2024-02-11 22:48:38 -08:00

…

lib

Remove MJIT-specific benchmarks

2023-03-06 22:36:57 -08:00

other-lang

…

app_answer.rb

…

app_aobench.rb

…

app_erb.yml

…

app_factorial.rb

…

app_fib.rb

…

app_lc_fizzbuzz.rb

…

app_mandelbrot.rb

…

app_pentomino.rb

…

app_raise.rb

…

app_strconcat.rb

…

app_tak.rb

…

app_tarai.rb

…

app_uri.rb

…

array_flatten.yml

…

array_intersection.yml

…

array_large_literal.yml

Optimize compilation of large literal arrays

2024-01-27 10:16:52 -08:00

array_max_float.yml

…

array_max_int.yml

…

array_max_str.yml

…

array_min.yml

…

array_sample_100k_10.rb

…

array_sample_100k_11.rb

…

array_sample_100k__1k.rb

…

array_sample_100k__6k.rb

…

array_sample_100k__100.rb

…

array_sample_100k___10k.rb

…

array_sample_100k___50k.rb

…

array_sample.yml

Skip string allocation in benchmark/time_at.yml

2021-11-14 23:25:25 -08:00

array_shift.rb

…

array_small_and.rb

…

array_small_diff.rb

…

array_small_or.rb

…

array_sort_block.rb

…

array_sort_float.rb

…

array_sort_int.yml

Introduce BOP_CMP for optimized comparison

2022-12-06 12:37:23 -08:00

array_values_at_int.rb

…

array_values_at_range.rb

…

attr_accessor.yml

Support tracing of attr_reader and attr_writer

2021-08-29 07:23:39 -07:00

bighash.rb

…

buffer_each.yml

Add several new methods for getting and setting buffer contents. (#6434 )

2022-09-26 18:06:12 +13:00

buffer_get.yml

Add several new methods for getting and setting buffer contents. (#6434 )

2022-09-26 18:06:12 +13:00

cgi_escape_html.yml

Improve HTML escape benchmarks

2022-11-04 23:54:25 -07:00

complex_float_add.yml

…

complex_float_div.yml

…

complex_float_mul.yml

…

complex_float_new.yml

…

complex_float_power.yml

…

complex_float_sub.yml

…

constant_invalidation.rb

Finer-grained constant cache invalidation (take 2)

2022-04-01 14:48:22 -04:00

dir_empty_p.rb

…

enum_lazy_flat_map.yml

…

enum_lazy_grep_v_20.rb

…

enum_lazy_grep_v_50.rb

…

enum_lazy_grep_v_100.rb

…

enum_lazy_uniq_20.rb

…

enum_lazy_uniq_50.rb

…

enum_lazy_uniq_100.rb

…

enum_lazy_zip.yml

…

enum_minmax.yml

Introduce BOP_CMP for optimized comparison

2022-12-06 12:37:23 -08:00

enum_sort_by.yml

[Feature #19643 ] Direct primitive compare sort for Array#sort_by

2023-05-20 19:40:27 +09:00

enum_sort.yml

Introduce BOP_CMP for optimized comparison

2022-12-06 12:37:23 -08:00

enum_tally.yml

Improve Enumerable#tally performance

2021-03-16 23:06:41 +09:00

erb_escape_html.yml

Improve HTML escape benchmarks

2022-11-04 23:54:25 -07:00

erb_render.yml

…

fiber_chain.yml

…

fiber_locals.yml

…

file_chmod.rb

…

file_rename.rb

…

float_methods.yml

Improve performance some Float methods [Feature #17498 ] (#4018 )

2021-01-01 18:39:07 -08:00

float_neg_posi.yml

Improve performance Float#positive? and Float#negative? [Feature #17614 ] (#4160 )

2021-02-08 20:29:42 -08:00

float_to_s.yml

Simple benchmark of Float#to_s

2021-02-10 19:42:00 +09:00

hash_aref_array.rb

Use faster any_hash logic in rb_hash

2021-09-30 13:06:53 -07:00

hash_aref_dsym_long.rb

…

hash_aref_dsym.rb

…

hash_aref_fix.rb

…

hash_aref_flo.rb

…

hash_aref_miss.rb

…

hash_aref_str.rb

…

hash_aref_sym_long.rb

…

hash_aref_sym.rb

…

hash_defaults.yml

…

hash_dup.yml

…

hash_first.yml

st.c: skip all deleted entries [Bug #17779 ]

2021-04-11 19:05:26 +09:00

hash_flatten.rb

…

hash_ident_flo.rb

…

hash_ident_num.rb

…

hash_ident_obj.rb

…

hash_ident_str.rb

…

hash_ident_sym.rb

…

hash_keys.rb

…

hash_literal_small2.rb

…

hash_literal_small4.rb

…

hash_literal_small8.rb

…

hash_long.rb

…

hash_shift_u16.rb

…

hash_shift_u24.rb

…

hash_shift_u32.rb

…

hash_shift.rb

…

hash_small2.rb

…

hash_small4.rb

…

hash_small8.rb

…

hash_to_proc.rb

…

hash_values.rb

…

int_quo.rb

…

io_copy_stream_write_socket.rb

…

io_copy_stream_write.rb

…

io_file_create.rb

…

io_file_read.rb

…

io_file_write.rb

…

io_nonblock_noex2.rb

…

io_nonblock_noex.rb

…

io_pipe_rw.rb

…

io_select2.rb

…

io_select3.rb

…

io_select.rb

…

io_write.rb

Add IO write throughput/locking overhead benchmark.

2022-05-28 15:44:18 +12:00

irb_color.yml

…

irb_exec.yml

…

iseq_load_from_binary.yml

Add a benchmark for RubyVM::InstructionSequence.load_from_binary

2021-03-10 13:44:07 -08:00

ivar_extend.yml

Eagerly allocate instance variable tables along with object

2021-05-03 14:11:48 -07:00

kernel_clone.yml

…

kernel_float.yml

…

kernel_tap.yml

…

kernel_then.yml

…

keyword_arguments.yml

…

loop_each.yml

Rewrite Array#each in Ruby using Primitive (#9533 )

2024-01-23 20:09:57 +00:00

loop_for.rb

…

loop_generator.rb

Rewrite Kernel#loop in Ruby (#6983 )

2022-12-25 21:46:29 -08:00

loop_times_megamorphic.yml

YJIT: Allow inlining ISEQ calls with a block (#9622 )

2024-01-23 19:36:23 +00:00

loop_times.rb

…

loop_whileloop2.rb

…

loop_whileloop.rb

…

marshal_dump_flo.rb

…

marshal_dump_load_geniv.rb

…

marshal_dump_load_integer.yml

Optimize Marshal dump/load for large (> 31-bit) FIXNUM (#6229 )

2022-08-15 16:14:12 -07:00

marshal_dump_load_time.rb

…

masgn.yml

Update multiple assignment benchmarks to include non-literal array cases

2022-08-09 22:19:46 -07:00

match_gt4.rb

…

match_small.rb

…

method_bind_call.yml

proc.c: make bind_call use existing callable method entry when possible

2021-03-10 13:43:22 -08:00

module_eqq.yml

Constant time class to class ancestor lookup

2022-02-23 19:57:42 -08:00

nil_p.yml

…

nilclass.yml

Implemented some NilClass method in Ruby code is faster [Feature #17054 ] (#3366 )

2021-06-02 20:04:56 -07:00

num_zero_p.yml

…

numeric_methods.yml

Improve performance some Integer and Float methods [Feature #19085 ] (#6638 )

2022-10-27 09:13:16 -07:00

object_allocate.yml

…

objspace_dump_all.yml

…

pm_array.yml

…

ractor_const.yml

Add a benchmark-driver runner for Ractor (#4172 )

2021-02-10 21:24:25 -08:00

ractor_float_to_s.yml

Add a benchmark-driver runner for Ractor (#4172 )

2021-02-10 21:24:25 -08:00

range_bsearch_bignum.yml

Add benchmarks for Range#bsearch

2023-09-26 17:31:10 +09:00

range_bsearch_endpointless.yml

Add benchmarks for Range#bsearch

2023-09-26 17:31:10 +09:00

range_bsearch_fixnum.yml

Add benchmarks for Range#bsearch

2023-09-26 17:31:10 +09:00

range_count.yml

Optimize Range#count by using range_size if possible

2023-10-05 00:19:55 +09:00

range_last.yml

…

range_min.yml

Introduce BOP_CMP for optimized comparison

2022-12-06 12:37:23 -08:00

range_overlap.yml

[Feature #19839 ] Fix Range#overlap? for empty ranges

2023-09-16 17:24:21 +09:00

range_reverse_each.yml

Add benchmarks for Range#reverse_each

2023-10-12 17:34:49 +09:00

README.md

Update the help message on /benchmark

2022-06-07 21:30:28 -07:00

realpath.yml

Fix a benchmark to avoid leaving a garbage file

2024-02-08 17:08:23 -08:00

regexp_dup.yml

Optimize Regexp#dup and Regexp.new(/RE/)

2023-06-09 20:22:30 +09:00

regexp_new.yml

Optimize Regexp#dup and Regexp.new(/RE/)

2023-06-09 20:22:30 +09:00

require_thread.yml

…

require.yml

…

securerandom.rb

…

so_ackermann.rb

…

so_array.rb

…

so_binary_trees.rb

…

so_concatenate.rb

…

so_count_words.yml

Clean up temporary file, wc.input [ci skip]

2023-10-24 12:30:10 +09:00

so_exception.rb

…

so_fannkuch.rb

…

so_fasta.rb

…

so_k_nucleotide.yml

…

so_lists.rb

…

so_mandelbrot.rb

…

so_matrix.rb

…

so_meteor_contest.rb

Fix spelling (#7405 )

2023-02-28 10:05:30 -08:00

so_nbody.rb

Make benchmark indentation consistent

2022-08-19 14:44:08 -07:00

so_nested_loop.rb

…

so_nsieve_bits.rb

…

so_nsieve.rb

…

so_object.rb

…

so_partial_sums.rb

…

so_pidigits.rb

…

so_random.rb

…

so_reverse_complement.yml

…

so_sieve.rb

…

so_spectralnorm.rb

…

string_capitalize.yml

…

string_casecmp_p.yml

…

string_casecmp.yml

…

string_concat.yml

Benchmark String interpolation across size pools

2023-01-13 10:31:35 -05:00

string_downcase.yml

…

string_dup.yml

Specialize String#dup

2023-11-20 14:33:20 +01:00

string_index.rb

…

string_rpartition.yml

Make rb_str_rindex return byte index

2023-07-09 16:39:28 +09:00

string_scan_re.rb

…

string_scan_str.rb

…

string_slice.yml

…

string_split.yml

…

string_swapcase.yml

…

string_upcase.yml

…

struct_accessor.yml

Support tracing of struct member accessor methods

2023-12-07 10:29:33 -08:00

time_at.yml

Skip string allocation in benchmark/time_at.yml

2021-11-14 23:25:25 -08:00

time_new.yml

Add benchmarks to create Time instances

2021-09-12 18:44:53 +09:00

time_now.yml

Speed up and avoid kwarg hash alloc in Time.now

2022-01-12 12:55:14 -08:00

time_parse.yml

[Feature #18033 ] Make Time.new parse time strings

2022-12-16 22:52:59 +09:00

time_strptime.yml

…

time_subsec.rb

…

vm_array.yml

…

vm_attr_ivar_set.yml

…

vm_attr_ivar.yml

…

vm_backtrace.rb

…

vm_bigarray.yml

…

vm_bighash.yml

…

vm_block_handler.yml

…

vm_block.yml

…

vm_blockparam_call.yml

…

vm_blockparam_pass.yml

…

vm_blockparam_yield.yml

…

vm_blockparam.yml

…

vm_call_bmethod.yml

Speed up calling iseq bmethods

2023-04-25 08:06:16 -07:00

vm_call_kw_and_kw_splat.yml

Fix crash when passing large keyword splat to method accepting keywords and keyword splat

2024-02-11 22:48:38 -08:00

vm_call_method_missing.yml

Optimize method_missing calls

2023-04-25 08:06:16 -07:00

vm_call_send_iseq.yml

Optimize send calls

2023-04-25 08:06:16 -07:00

vm_call_symproc.yml

Optimize symproc calls

2023-04-25 08:06:16 -07:00

vm_case_classes.yml

compile.c: Emit send for === calls in when statements

2021-05-28 12:34:03 -04:00

vm_case_lit.yml

…

vm_case.yml

…

vm_clearmethodcache.rb

…

vm_const.yml

New constant caching insn: opt_getconstant_path

2022-09-01 15:20:49 -07:00

vm_cvar.yml

Add a cache for class variables

2021-06-18 10:02:44 -07:00

vm_defined_method.yml

…

vm_dstr_ary.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_bool.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_class_module.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_digit.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_int.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_nil.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_obj_def.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_obj.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_str.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr_sym.rb

Optimize dynamic string interpolation for symbol/true/false/nil/0-9

2021-11-18 15:10:20 -08:00

vm_dstr.yml

…

vm_ensure.yml

…

vm_eval.yml

…

vm_fiber_allocate.yml

…

vm_fiber_count.yml

…

vm_fiber_reuse_gc.yml

…

vm_fiber_reuse.yml

…

vm_fiber_switch.yml

…

vm_float_simple.yml

…

vm_freezeobj.yml

Adds a benchmark to measure freezing objects

2022-09-22 10:29:43 -07:00

vm_freezestring.yml

…

vm_gc_old_full.rb

…

vm_gc_old_immediate.rb

…

vm_gc_old_lazy.rb

…

vm_gc_short_lived.yml

…

vm_gc_short_with_complex_long.yml

…

vm_gc_short_with_long.yml

…

vm_gc_short_with_symbol.yml

…

vm_gc_wb_ary_promoted.yml

…

vm_gc_wb_ary.yml

…

vm_gc_wb_obj_promoted.yml

…

vm_gc_wb_obj.yml

…

vm_gc.rb

…

vm_iclass_super.yml

…

vm_ivar_embedded_obj_init.yml

Fixes ivar benchmarks to not depend on object allocation

2022-07-15 10:29:42 -04:00

vm_ivar_extended_obj_init.yml

Fixes ivar benchmarks to not depend on object allocation

2022-07-15 10:29:42 -04:00

vm_ivar_generic_get.yml

Add benchmarks for setting / getting ivars on generics

2022-07-15 13:39:02 -07:00

vm_ivar_generic_set.yml

Add benchmarks for setting / getting ivars on generics

2022-07-15 13:39:02 -07:00

vm_ivar_get_unintialized.yml

Fix style on vm_ivar benchmarks (#6379 )

2022-09-15 09:39:39 +09:00

vm_ivar_get.yml

Fix style on vm_ivar benchmarks (#6379 )

2022-09-15 09:39:39 +09:00

vm_ivar_ic_miss.yml

Update benchmark/vm_ivar_ic_miss.yml

2023-10-24 10:52:06 -07:00

vm_ivar_lazy_set.yml

Fix style on vm_ivar benchmarks (#6379 )

2022-09-15 09:39:39 +09:00

vm_ivar_memoize.yml

vm_getivar: assume the cached shape_id like have a common ancestor

2023-11-03 12:47:43 +01:00

vm_ivar_of_class_set.yml

add vm_ivar_of_class_set

2021-10-23 01:32:55 +09:00

vm_ivar_of_class.yml

allow to access ivars of classes/modules

2021-10-23 01:32:55 +09:00

vm_ivar_set_on_instance.yml

Make benchmark indentation consistent

2022-08-19 14:44:08 -07:00

vm_ivar_set_subclass.yml

Fixes ivar benchmarks to not depend on object allocation

2022-07-15 10:29:42 -04:00

vm_ivar_set.yml

…

vm_ivar.yml

…

vm_length.yml

…

vm_lvar_cond_set.yml

avoid extra dup and pop in compile_op_asgn2

2022-09-22 09:47:13 -07:00

vm_lvar_init.yml

…

vm_lvar_set.yml

…

vm_method_missing.yml

…

vm_method_splat_calls2.yml

Add benchmark for implicit array/hash allocation reduction changes

2024-01-24 18:25:55 -08:00

vm_method_splat_calls.yml

Add benchmark for recent optimization to avoid implicit allocations

2023-12-07 11:27:55 -08:00

vm_method_with_block.yml

…

vm_method.yml

…

vm_module_ann_const_set.yml

…

vm_module_const_set.yml

…

vm_mutex.yml

…

vm_neq.yml

…

vm_newlambda.yml

…

vm_not.yml

…

vm_poly_method_ov.yml

…

vm_poly_method.yml

…

vm_poly_same_method.yml

…

vm_poly_singleton.yml

…

vm_proc.yml

…

vm_raise1.yml

…

vm_raise2.yml

…

vm_regexp.yml

…

vm_rescue.yml

…

vm_send_cfunc.yml

Optimize cfunc calls for f(*a) and f(*a, **kw) if kw is empty

2023-04-25 08:06:16 -07:00

vm_send.yml

…

vm_simplereturn.yml

…

vm_string_literal.yml

…

vm_struct_big_aref_hi.yml

…

vm_struct_big_aref_lo.yml

…

vm_struct_big_aset.yml

…

vm_struct_big_href_hi.yml

…

vm_struct_big_href_lo.yml

…

vm_struct_big_hset.yml

…

vm_struct_small_aref.yml

…

vm_struct_small_aset.yml

…

vm_struct_small_href.yml

…

vm_struct_small_hset.yml

…

vm_super.yml

…

vm_swap.yml

…

vm_symbol_block_pass.rb

…

vm_thread_alive_check.yml

…

vm_thread_close.rb

…

vm_thread_condvar1.rb

Prefer qualified names under Thread

2021-06-29 11:41:10 +09:00

vm_thread_condvar2.rb

Prefer qualified names under Thread

2021-06-29 11:41:10 +09:00

vm_thread_create_join.rb

…

vm_thread_mutex1.rb

…

vm_thread_mutex2.rb

…

vm_thread_mutex3.rb

…

vm_thread_pass_flood.rb

…

vm_thread_pass.rb

…

vm_thread_pipe.rb

…

vm_thread_queue.rb

…

vm_thread_sized_queue2.rb

…

vm_thread_sized_queue3.rb

…

vm_thread_sized_queue4.rb

…

vm_thread_sized_queue.rb

…

vm_thread_sleep.yml

…

vm_unif1.yml

…

vm_yield.yml

…

vm_zsuper.yml

…

README.md

ruby/benchmark

This directory has benchmark definitions to be run with benchmark_driver.gem.

Normal usage

Execute gem install benchmark_driver and run a command like:

# Run a benchmark script with the ruby in the $PATH
benchmark-driver benchmark/app_fib.rb

# Run benchmark scripts with multiple Ruby executables or options
benchmark-driver benchmark/*.rb -e /path/to/ruby -e '/path/to/ruby --jit'

# Or compare Ruby versions managed by rbenv
benchmark-driver benchmark/*.rb --rbenv '2.5.1;2.6.0-preview2 --jit'

# You can collect many metrics in many ways
benchmark-driver benchmark/*.rb --runner memory --output markdown

# Some are defined with YAML for complex setup or accurate measurement
benchmark-driver benchmark/*.yml

make benchmark

Using make benchmark, make update-benchmark-driver automatically downloads the supported version of benchmark_driver, and it runs benchmarks with the downloaded benchmark_driver.

# Run all benchmarks with the ruby in the $PATH and the built ruby
make benchmark

# Or compare with specific ruby binary
make benchmark COMPARE_RUBY="/path/to/ruby --jit"

# Run vm benchmarks
make benchmark ITEM=vm

# Run some limited benchmarks in ITEM-matched files
make benchmark ITEM=vm OPTS=--filter=block

# You can specify the benchmark by an exact filename instead of using the default argument:
# ARGS = $$(find $(srcdir)/benchmark -maxdepth 1 -name '*$(ITEM)*.yml' -o -name '*$(ITEM)*.rb')
make benchmark ARGS=benchmark/erb_render.yml

# You can specify any option via $OPTS
make benchmark OPTS="--help"

# With `make benchmark`, some special runner plugins are available:
#   -r peak, -r size, -r total, -r utime, -r stime, -r cutime, -r cstime
make benchmark ITEM=vm_bigarray OPTS="-r peak"