mirror/ruby - ruby - Maple Linux Source

mirror of https://github.com/ruby/ruby.git synced 2026-01-26 04:07:58 +00:00

Author	SHA1	Message	Date
Chris Hasiński	5de4cc5608	Fix regexp performance regression for patterns starting with s/k Commit 981ee02c7c ("Fix performance problem with /k/i and /s/i") was merged for Ruby 4.0 to enable partial Boyer-Moore optimization for patterns containing 's' or 'k' by using the prefix before those characters. However, when 's' or 'k' appears at the start of a pattern (no usable prefix), set_bm_skip() returns 0 and the code returned early without setting any optimization mode, leaving reg->optimize at ONIG_OPTIMIZE_NONE. This caused up to 30x slowdown for patterns like /slackware/i when matched against strings with non-ASCII characters. This patch keeps the improvement from 981ee02c7c for patterns with 3+ char prefix, while fixing the regression by falling back to ONIG_OPTIMIZE_EXACT_IC with the full pattern when the usable prefix is less than 3 characters. Before: /\bslackware\b/i with non-ASCII string: 2.24 us/op After: /\bslackware\b/i with non-ASCII string: 0.70 us/op (3.2x faster) [Bug #21824]	2026-01-13 10:19:44 +09:00
K.Takata	16086128cc	[k-takata/Onigmo] Suppress warning on 64-bit builds https://github.com/k-takata/Onigmo/commit/ced209d5e9	2026-01-12 20:01:43 +09:00
K.Takata	496e74d0cc	[k-takata/Onigmo] Fix that onig_new() may crash When onig_reg_init() returns an error, onig_free_body() which is called via onig_new() may crash because some members are not properly initialized. Fix it. https://github.com/k-takata/Onigmo/commit/d2a090a57e	2026-01-12 20:01:42 +09:00
K.Takata	916fbf1063	[k-takata/Onigmo] Update copyright information * Update our copyright information. * Import the latest information from oniguruma. Related: #95 https://github.com/k-takata/Onigmo/commit/0d8662b500	2026-01-12 20:01:42 +09:00
K.Takata	f9131412f8	[k-takata/Onigmo] Revise set_bm_skip() https://github.com/k-takata/Onigmo/commit/6875da50f7	2026-01-12 20:01:42 +09:00
K.Takata	bbf9bf3fc5	[k-takata/Onigmo] Remove code for backward BM search The code has not been used for long. (Oniguruma also removed this code.) https://github.com/k-takata/Onigmo/commit/8796781fdd	2026-01-12 20:01:42 +09:00
K.Takata	fb7f344b09	[k-takata/Onigmo] Remove code for reg->int_map https://github.com/k-takata/Onigmo/commit/6c58de82d2	2026-01-12 20:01:42 +09:00
K.Takata	ba74fcd6e1	Remove old code for BMH search Remove the code for Boyer-Moore-Horspool search. Now we are using Sunday's quick search. `3d9072419a`	2025-10-31 20:49:59 +09:00
K.Takata	a89246834d	Fix initialization of the table for quick search This fixes k-takata/Onigmo#120. The commit k-takata/Onigmo@9c13de8d06 was insufficient. `1de602ddff`	2025-10-31 20:49:59 +09:00
K.Takata	daf0d6c686	Fix lgtm.com warnings * Multiplication result may overflow 'int' before it is converted to 'OnigDistance'. * Comparison is always true because code <= 122. * This statement makes ExprStmt unreachable. * Empty block without comment `387ad616c3`	2025-10-31 20:49:59 +09:00
K.Takata	bfbbcf3455	[Bug #13671 ] Fix that "ss" in look-behind causes syntax error Fixes k-takata/Onigmo#92. This fix was ported from oniguruma: `257082dac8` `b1a5445fbe`	2025-10-31 20:49:59 +09:00
K.Takata	bcea1129f4	Revert "[tune] implicit-anchor optimization" This reverts commit 282338f88a8bf0807a7a1d21b06f78abe9de8fac. It seems that the commit didn't improve the performance. Revert it to fix k-takata/Onigmo#100. `cef834cb3a`	2025-10-31 20:49:59 +09:00
K.Takata	981ee02c7c	Fix performance problem with /k/i and /s/i (Close k-takata/Onigmo#97 ) E.g. For the pattern `/----k/i`, optimization was totally turned off. Make it possible to use the characters before `k` (i.e. `----`) for optimization. `9c13de8d06`	2025-10-31 20:49:59 +09:00
Hiroya Fujinami	18f500e347	Fix indents in Onigmo files to use spaces instead of tabs (#14047 ) [no ci]	2025-07-31 13:08:54 +09:00
Hiroya Fujinami	4a6384ed93	Fix [Bug #20246 ]: Don't set next_head_exact if a capture is called (#9897 )	2024-02-13 09:53:03 +09:00
Peter Zhu	d0b774cfb8	Remove null checks for xfree xfree can handle null values, so we don't need to check it.	2024-01-19 10:25:02 -05:00
Nobuyoshi Nakada	5cff4c5aa3	Fix onigmo name table without st Co-authored-by: Adam Hess <HParker@github.com>	2023-11-03 10:41:48 +09:00
Hiroya Fujinami	34cb174800	Optimize regexp matching for look-around and atomic groups (#7931 )	2023-10-30 13:10:42 +09:00
Peter Zhu	f72f3ab15b	Use onig_new_without_alloc for onig_new	2023-07-27 15:39:05 -04:00
Peter Zhu	58386814a7	Don't check for null pointer in calls to free According to the C99 specification section 7.20.3.2 paragraph 2: > If ptr is a null pointer, no action occurs. So we do not need to check that the pointer is a null pointer.	2023-06-30 09:13:31 -04:00
Nobuyoshi Nakada	ab6eb3786c	Optimize `Regexp#dup` and `Regexp.new(/RE/)` When copying from another regexp, copy already built `regex_t` instead of re-compiling its source.	2023-06-09 20:22:30 +09:00
Nobuyoshi Nakada	5ccb625fbb	Use `roomof` macro for rounding up divisions	2022-10-14 19:23:25 +09:00
Nobuyoshi Nakada	cf2bbcfff2	Just free compiled pattern if no space is used https://hackerone.com/reports/1220911	2022-04-12 20:24:14 +09:00
Yusuke Endoh	ffc3b37f96	re.c: Add Regexp.timeout= and Regexp.timeout [Feature #17837]	2022-03-30 16:50:46 +09:00
Nobuyoshi Nakada	1d6a490c2c	Cast up to get rid of the potential overflow posibility	2021-10-14 18:43:32 +09:00
Nobuyoshi Nakada	40612d55c6	Fixed shorten-64-to-32 errors when USE_COMBINATION_EXPLOSION_CHECK	2021-05-07 20:15:05 +09:00
xtkoba (Tee KOBAYASHI)	0846c2da45	Check backref number buffer overrun [Bug #16376 ]	2021-03-15 10:17:50 +09:00
Lourens Naudé	9a17437558	Right size literal regular expression buffers on compile	2020-12-10 18:15:26 +09:00
nobu	893949167b	Fix issues detected by code analysis tool (mainly Coverity). * Fix leaked storage in addr2line.c. * Fix for "top_root" leaking the resource. [Fix GH-1956] From: Jun Aruga <jaruga@redhat.com> git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@64750 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-09-15 09:59:14 +00:00
hsbt	ef9bc609db	Fix typos. * rememberd -> remembered * refered -> referred git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@61933 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-01-18 11:44:10 +00:00
naruse	6b1c6e0e55	Merge Onigmo 6.1.1 * Support absent operator https://github.com/k-takata/Onigmo/issues/82 * https://github.com/k-takata/Onigmo/blob/Onigmo-6.1.1/HISTORY git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@57603 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-02-11 15:08:33 +00:00
nobu	7db9fbdfe6	regcomp.c: debug function [ci skip] * regcomp.c (print_indent_tree): make a function for debug static. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@57420 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-01-25 06:08:16 +00:00
ngoto	9022401809	regcomp.c: Fix compile error with old version of fcc on Solaris * regcomp.c (get_min_match_length): Fix compile error with old version of fcc (Fujitsu C Compiler) on Solaris 10. [Bug #13059][ruby-dev:49909] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@57143 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2016-12-21 15:21:19 +00:00
naruse	2873edeafb	Merge Onigmo 6.0.0 * https://github.com/k-takata/Onigmo/blob/Onigmo-6.0.0/HISTORY * fix for ruby 2.4: https://github.com/k-takata/Onigmo/pull/78 * suppress warning: https://github.com/k-takata/Onigmo/pull/79 * include/ruby/oniguruma.h: include onigmo.h. * template/encdb.h.tmpl: ignore duplicated definition of EUC-CN in enc/euc_kr.c. It is defined in enc/gb2313.c with CRuby macro. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@57045 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2016-12-10 17:47:04 +00:00
naruse	c11e648799	Regexp supports Unicoe 9.0.0's \X * meta character \X matches Unicode 9.0.0 characters with some workarounds for UTR #51 Unicode Emoji, Version 4.0 emoji zwj sequences. [Feature #12831] [ruby-core:77586] The term "character" can have many meanings bytes, codepoints, combined characters, and so on. "grapheme cluster" is highest one of such words, which means user-perceived characters. Unicode Standard Annex #29 UNICODE TEXT SEGMENTATION specifies how to handle grapheme clusters (extended grapheme cluster). But some specs aren't updated to current situation because Unicode Emoji is rapidly extended without well definition. It breaks the precondition of UTR#29 "Grapheme cluster boundaries can be easily tested by looking at immediately adjacent characters". (the sentence will be removed in the next version) Though some of its detail are described in Unicode Technical Report #51 UNICODE EMOJI but it is not merged into UTR#29 yet. http://unicode.org/reports/tr29/ http://unicode.org/reports/tr51/ http://unicode.org/Public/emoji/4.0/ git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@56949 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2016-11-30 17:29:19 +00:00
naruse	5d9b95a5c7	* regcomp.c (onig_print_compiled_byte_code): make the shown address look relative. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@56333 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2016-10-03 21:06:50 +00:00
naruse	1e3fc7dcfa	* regcomp.c (noname_disable_map): don't optimize out group 0 Ruby's Regexp doesn't allow normal numbered groups if the regexp has named groups. On such case it optimizes out related NT_ENCLOSE. But even on the case it can use \g<0>. This fix not to remove NT_ENCLOSE whose regnum is 0. [ruby-core:75828] [Bug #12454] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55562 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2016-07-01 19:37:43 +00:00
nobu	4c68173079	regcomp.c: fix debug conditionals git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55157 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2016-05-25 06:05:48 +00:00
naruse	599f054fe6	* regcomp.c: remove condition for debug output because prelude doesn't use regexp now. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55155 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2016-05-25 04:21:32 +00:00
naruse	8c2ddab57a	* regcomp.c (compile_length_tree): return error code immediately if compile_length_tree raised error [Bug #12418] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55154 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2016-05-25 04:21:31 +00:00
nobu	b3377eaa13	Revert r52995 revert slow atomic operations. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@52997 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2015-12-09 07:30:44 +00:00
nobu	0599d7de58	use atomic operations * regcomp.c (onig_chain_link_add): use atomic operation instead of mutex. * regint.h (ONIG_STATE_{INC,DEC}_THREAD): ditto. * regparse.c (PopFreeNode, node_recycle): ditto. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@52995 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2015-12-09 06:48:15 +00:00
naruse	9ed1d63f41	* regcomp.c, regenc.c, regexec.c, regint.h, enc/unicode.c: Merge Onigmo 58fa099ed1a34367de67fb3d06dd48d076839692 + https://github.com/k-takata/Onigmo/pull/52 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@52756 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2015-11-26 08:31:27 +00:00
naruse	d2a5354255	* reg*.c: Merge Onigmo 5.15.0 38a870960aa7370051a3544 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@47598 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2014-09-15 16:18:41 +00:00
naruse	64c81e40d4	* regcomp.c: Merge Onigmo 5.14.1 25a8a69fc05ae3b56a09. this includes Support for Unicode 7.0 [Bug #9092]. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@46831 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2014-07-16 03:27:25 +00:00
naruse	d39f196401	* regcomp.c (): Merge Onigmo 5.13.5 23b523076d6f1161. https://bugs.ruby-lang.org/issues/8583 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@41764 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2013-07-04 01:16:22 +00:00
naruse	536a3274c9	* Merge Onigmo 5.13.4 f22cf2e566712cace60d17f84d63119d7c5764ee. [bug] fix problem with optimization of \z (Issue #16) [Bug #8210] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@40276 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2013-04-13 11:30:35 +00:00
naruse	78dbaa1648	* Merge Onigmo 0fe387da2fee089254f6b04990541c731a26757f v5.13.3 [Bug#7972] [Bug#7974] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@39547 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2013-03-01 16:36:37 +00:00
naruse	bfba60f8fe	Remove unused variables. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@34905 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2012-03-05 07:40:51 +00:00
nobu	0ffc74dece	* regcomp.c, regexec.c: fix-up warnings. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@34684 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2012-02-18 15:41:59 +00:00

1 2

98 Commits