archived-php-src

mirror of https://github.com/php/php-src.git synced 2026-04-27 18:23:26 +02:00

Author	SHA1	Message	Date
Ilija Tovilo	cb51737f41	Merge branch 'PHP-8.5' * PHP-8.5: Tweak zend.max_allowed_stack_size for gh20836_stack_limit.phpt	2026-02-03 00:55:05 +01:00
Ilija Tovilo	9e96c5ff39	Merge branch 'PHP-8.4' into PHP-8.5 * PHP-8.4: Tweak zend.max_allowed_stack_size for gh20836_stack_limit.phpt	2026-02-03 00:54:56 +01:00
Ilija Tovilo	1f57d04648	Tweak zend.max_allowed_stack_size for gh20836_stack_limit.phpt Fixes GH-21086	2026-02-03 00:54:25 +01:00
Ilija Tovilo	6173a9a109	VAR\|TMP overhaul (GH-20628) The aim of this PR is twofold: - Reduce the number of highly similar TMP\|VAR handlers - Avoid ZVAL_DEREF in most of these cases This is achieved by guaranteeing that all zend_compile_expr() calls, as well as all other compile calls with BP_VAR_{R,IS}, will result in a TMP variable. This implies that the result will not contain an IS_INDIRECT or IS_REFERENCE value, which was mostly already the case, with two exceptions: - Calls to return-by-reference functions. Because return-by-reference functions are quite rare, this is solved by delegating the DEREF to the RETURN_BY_REF handler, which will examine the stack to check whether the caller expects a VAR or TMP to understand whether the DEREF is needed. Internal functions will also need to adjust by calling the zend_return_unwrap_ref() function. - By-reference assignments, including both $a = &$b, as well as $a = [&$b]. When the result of these expressions is used in a BP_VAR_R context, the reference is unwrapped via a ZEND_QM_ASSIGN opcode beforehand. This is exceptionally rare. Closes GH-20628	2026-01-31 19:44:56 +01:00
Arnaud Le Blanc	65b4073922	Include the actual stub name in generated arginfo headers (#20993 )	2026-01-21 20:57:00 +01:00
Alex Dowad	7ad406a4b9	Fix crash in mb_substr with MacJapanese encoding Thanks to the GitHub user vi3tL0u1s (Viet Hoang Luu) for reporting this issue. The MacJapanese legacy text encoding has a very unusual property; it is possible for a string to encode more codepoints than it has bytes. In some corner cases, this resulted in a situation where the implementation code for mb_substr() would allocate a buffer of size -1. As you can probably imagine, that doesn't end well. Fixes GH-20832.	2026-01-18 20:07:12 +09:00
Alexandre Daubois	b391c28f90	Merge branch 'PHP-8.5' * PHP-8.5: Fix GH-20836: Stack overflow in mb_convert_variables with recursive array references (#20839)	2026-01-14 20:11:31 +01:00
Alexandre Daubois	32803687fe	Merge branch 'PHP-8.4' into PHP-8.5 * PHP-8.4: Fix GH-20836: Stack overflow in mb_convert_variables with recursive array references (#20839)	2026-01-14 20:10:30 +01:00
Alexandre Daubois	2c112e3696	Fix GH-20836: Stack overflow in mb_convert_variables with recursive array references (#20839 )	2026-01-14 20:07:11 +01:00
Alex Dowad	c34b84ed81	Remove unused conversion code from mbstring Over the last few years, I refactored mbstring to perform encoding conversion a buffer at a time, rather than a single byte at a time. This resulted in a huge performance increase. After the refactoring, the old "byte-at-a-time" code was retained for two reasons: 1) It was used by the mailparse PECL extension. 2) It was used to implement mb_strcut for some text encodings. However, after reviewing mailparse's use of mbstring, it is clear that mailparse only relies on mbstring for decoding of QPrint, and possibly Base64. It does not use the byte-at-a-time conversion code for any other encoding. Further, mb_strcut only relies on the byte-at-a-time conversion code for a limited number of legacy text encodings, such as ISO-2022-JP, HZ, UTF-7, etc. Hence, we can remove over 5000 lines of unused code without breaking anything. This will help to reduce binary size, and make the mbstring codebase easier to navigate for new contributors.	2026-01-13 11:43:44 +09:00
Alex Dowad	11bec6b92f	Remove some now-unused code from mbfl_strcut The legacy mbfl_strcut function is only used to implement mb_strcut for legacy text encodings which 1) do not use a fixed number of bytes per codepoint, 2) do not have an 'mblen_table' which can be used to quickly determine the codepoint length of a byte sequence, and 3) do not have a specialized 'mb_cut' function which implements mb_strcut for that text encoding. Remove unused code from mbfl_strcut, and leave only what is currently needed for the implementation of mb_strcut.	2026-01-13 11:43:44 +09:00
Alex Dowad	79b52042e3	Use fast path in more cases when doing case folding with mb_convert_case mbstring's Unicode case conversion is table-driven, using Minimal Perfect Hash tables. However, for small codepoint values, we bypass the hashtable lookup and just use hard-coded conversion logic (i.e. adding or subtracting 0x20 from the appropriate ASCII range). For upcasing and downcasing, we had already optimized the conditional which sends execution down this fast path, to use the fast path for as many codepoint values as possible. However, for case folding, this had not been done. This will give a small performance boost for case-folding Unicode text which includes non-breaking spaces, symbols like ¥ or ™, or accented Latin characters (used in many European languages).	2026-01-10 13:10:59 +09:00
Niels Dossche	e4098da58a	Merge branch 'PHP-8.5' * PHP-8.5: Fix GH-20833: mb_str_pad() divide by zero if padding string is invalid in the encoding	2026-01-05 20:01:59 +01:00
Niels Dossche	171b52c98f	Merge branch 'PHP-8.4' into PHP-8.5 * PHP-8.4: Fix GH-20833: mb_str_pad() divide by zero if padding string is invalid in the encoding	2026-01-05 20:01:54 +01:00
Niels Dossche	03113b09ce	Fix GH-20833: mb_str_pad() divide by zero if padding string is invalid in the encoding If the padding string is not valid in the given encoding, mb_get_strlen() can return 0. Closes GH-20834.	2026-01-05 20:01:25 +01:00
Gina Peter Banyard	c727f4d6c5	ext/standard/mail: use zend_string* for extra_cmd param of php_mail()	2025-12-27 23:26:58 +00:00
Niels Dossche	f20701416d	mbstring: Transform RETURN_STR(zend_string_init_fast(...)) to RETURN_STRINGL_FAST(...) (#20779 ) This is a dedicated API which is cleaner.	2025-12-26 12:15:25 +01:00
Yuya Hamada	64dd933a06	Merge branch 'PHP-8.4' into PHP-8.5	2025-12-15 10:58:49 +09:00
Yuya Hamada	355a4b5e61	Merge branch 'PHP-8.3' into PHP-8.4	2025-12-15 10:57:21 +09:00
Yuya Hamada	0056d013bf	Fix GH-20674 mb_decode_mimeheader does not handle separator `?= =?` is skipped if long term, so skip space character. Add test case from RFC2047 and fix last pattern See: https://www.ietf.org/rfc/rfc2047#section-8	2025-12-15 10:55:17 +09:00
Yuya Hamada	85913fc61b	Fix GH-20674 mb_decode_mimeheader does not handle separator `?= =?` is skipped if long term, so skip space character. Add test case from RFC2047 and fix last pattern See: https://www.ietf.org/rfc/rfc2047#section-8	2025-12-15 10:52:03 +09:00
Heran Yang	1f3fe93eff	Add GB18030-2022 to default encoding list for zh-CN (#20604 ) GB18030-2022 is the current official standard, superseding the previous 2005 and 2000 versions. It is essential for modern Chinese text processing for the following reasons: 1. Superset Relationship: GB18030 is a strict superset of CP936 (GBK) and EUC-CN (GB2312). Using GB18030 as the detection target covers all characters in these older encodings while enabling support for a much wider range of characters. 2. Extended Character Coverage: The 2022 standard includes significant updates, covering over 87,000 characters. It adds support for CJK Extensions (C, D, E, F, G) and updates mappings for rare characters that were previously mapped to the Private Use Area (PUA) in the 2005 version. This is critical for correctly handling names containing rare characters (e.g., in banking or government data). 3. Backward Compatibility: It is safe to promote GB18030-2022 as the preferred encoding. Files encoded in EUC-CN or CP936 are valid GB18030 streams. This PR adds GB18030-2022 to the default encoding list for CN.	2025-12-12 11:58:37 +09:00
Tobias Vorwachs	6b197ee4ed	mbstring: fix missing copying of detect_order_list to current_detect_order_list on ini_set('mbstring.detect_order', string) Closes GH-20523.	2025-12-01 20:47:57 +09:00
Niels Dossche	c0cf84158f	Merge branch 'PHP-8.5' * PHP-8.5: Fix GH-20492: mbstring compile warning due to non-strings Fix GH-20491: SLES15 compile error with mbstring oniguruma	2025-11-20 19:26:54 +01:00
Niels Dossche	929e7177f1	Merge branch 'PHP-8.4' into PHP-8.5 * PHP-8.4: Fix GH-20492: mbstring compile warning due to non-strings Fix GH-20491: SLES15 compile error with mbstring oniguruma	2025-11-20 19:26:48 +01:00
Niels Dossche	10ac41f158	Merge branch 'PHP-8.3' into PHP-8.4 * PHP-8.3: Fix GH-20492: mbstring compile warning due to non-strings Fix GH-20491: SLES15 compile error with mbstring oniguruma	2025-11-20 19:23:36 +01:00
Niels Dossche	159ef1401c	Fix GH-20492: mbstring compile warning due to non-strings This is a partial backport of `ea69276f`, but without changing public headers as that's not allowed at this point. Closes GH-20494.	2025-11-20 19:17:55 +01:00
Niels Dossche	a1912e3cdd	Fix GH-20491: SLES15 compile error with mbstring oniguruma The issue is specific to SLES15. Arguably this should be reported to them as it seems to me they meddled with the oniguruma source code. The definition in oniguruma.h on that platform looks like this (same as upstream): ```c ONIG_EXTERN int onig_error_code_to_str PV_((OnigUChar* s, int err_code, ...)); ``` Where `PV_` is defined as (differs): ```c #ifndef PV_ #ifdef HAVE_STDARG_PROTOTYPES # define PV_(args) args #else # define PV_(args) () #endif #endif ``` So that means that `HAVE_STDARG_PROTOTYPES` is unset. This can be set if we define `HAVE_STDARG_H`, which we can do because PHP requires at least C99 in which the header is always available. We could also use an autoconf check, but this isn't really necessary as it will always succeed.	2025-11-20 19:17:17 +01:00
Niels Dossche	94c256f997	Properly silence set-but-unused-var warning	2025-11-15 18:53:12 +01:00
Niels Dossche	fee4e1889f	mbstring: Avoid pointless refcounted copy (#20325 ) These scalars can use the ZVAL_COPY_VALUE variant instead of ZVAL_COPY because they don't need refcounting.	2025-10-29 17:23:13 +01:00
Tim Düsterhus	753f287a37	mbstring: Use `true` / `false` instead of `1` / `0` for `bool` parameters Changes done with Coccinelle: @r1@ identifier F; identifier p; typedef bool; parameter list [n1] PL1; parameter list [n2] PL2; @@ F(PL1, bool p, PL2) { ... } @r2@ identifier r1.F; expression list [r1.n1] EL1; expression list [r1.n2] EL2; @@ F(EL1, ( - 1 + true \| - 0 + false ) , EL2)	2025-09-24 18:51:40 +02:00
Tim Düsterhus	af7340a265	mbstring: Use `true` / `false` instead of `1` / `0` when assigning to `bool` Changes done with Coccinelle: @@ bool b; @@ - b = 0 + b = false @@ bool b; @@ - b = 1 + b = true	2025-09-24 18:51:40 +02:00
Gina Peter Banyard	93676a0425	ext/standard: Deprecate passing string which are not one byte long to ord() (#19440 ) RFC: https://wiki.php.net/rfc/deprecations_php_8_5#deprecate_passing_string_which_are_not_one_byte_long_to_ord Co-authored-by: Niels Dossche <7771979+nielsdos@users.noreply.github.com>	2025-09-14 11:42:59 +01:00
tekimen	edc2671227	ext/mbstring: Update to Unicode 17.0 (#19796 ) Updates UCD to Unicode 17.0 (released 2025 Sep).	2025-09-13 08:07:51 +09:00
Jorg Adam Sowa	1e02099e6a	ext/mbstring: Use `internal_encoding` INI setting instead of `mb_internal_encoding()` in tests (#19663 ) Moves the usage of `mb_internal_encoding()` to INI section for the tests not testing the encoding/function itself, but the other mbstring/iconv functions.	2025-09-03 11:34:12 +01:00
Niels Dossche	be2889411a	Merge branch 'PHP-8.4' * PHP-8.4: Fix GH-19397: mb_list_encodings() can cause crashes on shutdown	2025-08-08 20:33:00 +02:00
Niels Dossche	db3f6d0bf0	Merge branch 'PHP-8.3' into PHP-8.4 * PHP-8.3: Fix GH-19397: mb_list_encodings() can cause crashes on shutdown	2025-08-08 20:32:55 +02:00
Niels Dossche	cc93bbb765	Fix GH-19397: mb_list_encodings() can cause crashes on shutdown The request shutdown does not necessarily hold the last reference, if there is still a CV that refers to the array. Closes GH-19405.	2025-08-08 20:32:29 +02:00
Gina Peter Banyard	105c1e9896	tree: use zend_str_has_nul_byte() API (#19336 )	2025-07-31 23:57:27 +01:00
Niels Dossche	719419a6e5	Fix unterminated string GCC warnings in mbstring (#19192 ) Necessary for for Werror builds	2025-07-23 11:49:16 +02:00
DanielEScherzer	07f1cfd9b0	Deprecate producing output in a user output handler (#19067 ) https://wiki.php.net/rfc/deprecations_php_8_4	2025-07-09 21:20:58 -07:00
Gina Peter Banyard	c7778641dd	ext/mbstring: Remove ZPP tests	2025-06-23 13:58:31 +02:00
Niels Dossche	b727821c79	Merge branch 'PHP-8.4' * PHP-8.4: Fix GH-18901: integer overflow mb_split	2025-06-22 13:09:09 +02:00
Niels Dossche	2577e3a703	Merge branch 'PHP-8.3' into PHP-8.4 * PHP-8.3: Fix GH-18901: integer overflow mb_split	2025-06-22 13:08:05 +02:00
Niels Dossche	a5f21ca700	Fix GH-18901: integer overflow mb_split We prevent signed overflow by making the count unsigned. The actual interpretation of the count doesn't matter as it's just used to denote a limit. The test output for some limit values looks strange though, so that may need extra investigation. However, that's orthogonal to this fix. Closes GH-18906.	2025-06-22 13:07:43 +02:00
Niels Dossche	be17e9ed54	Merge branch 'PHP-8.4' * PHP-8.4: Fix handling of references in zval_try_get_long()	2025-06-04 21:00:22 +02:00
Niels Dossche	2b383848a7	Fix handling of references in zval_try_get_long() This API can't handle references, yet everyone keeps forgetting that it can't and that you should DEREF upfront. Fix every type of this issue once and for all by moving the reference handling to this Zend API. Closes GH-18761.	2025-06-04 21:00:05 +02:00
Saki Takamachi	462fd4dffe	Small change SIMD codes (#18626 ) * use zend_simd.h in zend_accelerator_util_funcs.c * use zend_simd.h in mbstring * Remove unnecessary SSE3 includes	2025-05-26 16:32:27 +09:00
David CARLIER	16e154a2c4	ext/mbstring: mb_ereg*_replace remove unreachable code. (#18646 )	2025-05-25 10:44:59 +01:00
Niels Dossche	cd43536b27	Drop useless cast	2025-05-21 19:54:09 +02:00

1 2 3 4 5 ...

2639 Commits