archived-php-src

mirror of https://github.com/php/php-src.git synced 2026-03-24 08:12:21 +01:00

Author	SHA1	Message	Date
Alex Dowad	115ea486ac	Merge branch 'PHP-8.5'	2026-02-17 06:51:21 +09:00
Alex Dowad	e106d688c2	Merge branch 'PHP-8.4' into PHP-8.5	2026-02-17 06:48:37 +09:00
Jordi Kroon	37c5a13d67	replace alloca with do_alloca in mb_guess_encoding_for_strings This avoids a crash in cases where the list of candidate encodings is so huge that alloca would fail. Such crashes have been observed when the list of encodings was larger than around 208,000 entries.	2026-02-17 06:46:42 +09:00
Ilija Tovilo	cb51737f41	Merge branch 'PHP-8.5' * PHP-8.5: Tweak zend.max_allowed_stack_size for gh20836_stack_limit.phpt	2026-02-03 00:55:05 +01:00
Ilija Tovilo	9e96c5ff39	Merge branch 'PHP-8.4' into PHP-8.5 * PHP-8.4: Tweak zend.max_allowed_stack_size for gh20836_stack_limit.phpt	2026-02-03 00:54:56 +01:00
Ilija Tovilo	1f57d04648	Tweak zend.max_allowed_stack_size for gh20836_stack_limit.phpt Fixes GH-21086	2026-02-03 00:54:25 +01:00
Ilija Tovilo	6173a9a109	VAR\|TMP overhaul (GH-20628) The aim of this PR is twofold: - Reduce the number of highly similar TMP\|VAR handlers - Avoid ZVAL_DEREF in most of these cases This is achieved by guaranteeing that all zend_compile_expr() calls, as well as all other compile calls with BP_VAR_{R,IS}, will result in a TMP variable. This implies that the result will not contain an IS_INDIRECT or IS_REFERENCE value, which was mostly already the case, with two exceptions: - Calls to return-by-reference functions. Because return-by-reference functions are quite rare, this is solved by delegating the DEREF to the RETURN_BY_REF handler, which will examine the stack to check whether the caller expects a VAR or TMP to understand whether the DEREF is needed. Internal functions will also need to adjust by calling the zend_return_unwrap_ref() function. - By-reference assignments, including both $a = &$b, as well as $a = [&$b]. When the result of these expressions is used in a BP_VAR_R context, the reference is unwrapped via a ZEND_QM_ASSIGN opcode beforehand. This is exceptionally rare. Closes GH-20628	2026-01-31 19:44:56 +01:00
Alex Dowad	7ad406a4b9	Fix crash in mb_substr with MacJapanese encoding Thanks to the GitHub user vi3tL0u1s (Viet Hoang Luu) for reporting this issue. The MacJapanese legacy text encoding has a very unusual property; it is possible for a string to encode more codepoints than it has bytes. In some corner cases, this resulted in a situation where the implementation code for mb_substr() would allocate a buffer of size -1. As you can probably imagine, that doesn't end well. Fixes GH-20832.	2026-01-18 20:07:12 +09:00
Alexandre Daubois	b391c28f90	Merge branch 'PHP-8.5' * PHP-8.5: Fix GH-20836: Stack overflow in mb_convert_variables with recursive array references (#20839)	2026-01-14 20:11:31 +01:00
Alexandre Daubois	32803687fe	Merge branch 'PHP-8.4' into PHP-8.5 * PHP-8.4: Fix GH-20836: Stack overflow in mb_convert_variables with recursive array references (#20839)	2026-01-14 20:10:30 +01:00
Alexandre Daubois	2c112e3696	Fix GH-20836: Stack overflow in mb_convert_variables with recursive array references (#20839 )	2026-01-14 20:07:11 +01:00
Niels Dossche	e4098da58a	Merge branch 'PHP-8.5' * PHP-8.5: Fix GH-20833: mb_str_pad() divide by zero if padding string is invalid in the encoding	2026-01-05 20:01:59 +01:00
Niels Dossche	171b52c98f	Merge branch 'PHP-8.4' into PHP-8.5 * PHP-8.4: Fix GH-20833: mb_str_pad() divide by zero if padding string is invalid in the encoding	2026-01-05 20:01:54 +01:00
Niels Dossche	03113b09ce	Fix GH-20833: mb_str_pad() divide by zero if padding string is invalid in the encoding If the padding string is not valid in the given encoding, mb_get_strlen() can return 0. Closes GH-20834.	2026-01-05 20:01:25 +01:00
Yuya Hamada	64dd933a06	Merge branch 'PHP-8.4' into PHP-8.5	2025-12-15 10:58:49 +09:00
Yuya Hamada	355a4b5e61	Merge branch 'PHP-8.3' into PHP-8.4	2025-12-15 10:57:21 +09:00
Yuya Hamada	0056d013bf	Fix GH-20674 mb_decode_mimeheader does not handle separator `?= =?` is skipped if long term, so skip space character. Add test case from RFC2047 and fix last pattern See: https://www.ietf.org/rfc/rfc2047#section-8	2025-12-15 10:55:17 +09:00
Yuya Hamada	85913fc61b	Fix GH-20674 mb_decode_mimeheader does not handle separator `?= =?` is skipped if long term, so skip space character. Add test case from RFC2047 and fix last pattern See: https://www.ietf.org/rfc/rfc2047#section-8	2025-12-15 10:52:03 +09:00
Heran Yang	1f3fe93eff	Add GB18030-2022 to default encoding list for zh-CN (#20604 ) GB18030-2022 is the current official standard, superseding the previous 2005 and 2000 versions. It is essential for modern Chinese text processing for the following reasons: 1. Superset Relationship: GB18030 is a strict superset of CP936 (GBK) and EUC-CN (GB2312). Using GB18030 as the detection target covers all characters in these older encodings while enabling support for a much wider range of characters. 2. Extended Character Coverage: The 2022 standard includes significant updates, covering over 87,000 characters. It adds support for CJK Extensions (C, D, E, F, G) and updates mappings for rare characters that were previously mapped to the Private Use Area (PUA) in the 2005 version. This is critical for correctly handling names containing rare characters (e.g., in banking or government data). 3. Backward Compatibility: It is safe to promote GB18030-2022 as the preferred encoding. Files encoded in EUC-CN or CP936 are valid GB18030 streams. This PR adds GB18030-2022 to the default encoding list for CN.	2025-12-12 11:58:37 +09:00
Tobias Vorwachs	6b197ee4ed	mbstring: fix missing copying of detect_order_list to current_detect_order_list on ini_set('mbstring.detect_order', string) Closes GH-20523.	2025-12-01 20:47:57 +09:00
Gina Peter Banyard	93676a0425	ext/standard: Deprecate passing string which are not one byte long to ord() (#19440 ) RFC: https://wiki.php.net/rfc/deprecations_php_8_5#deprecate_passing_string_which_are_not_one_byte_long_to_ord Co-authored-by: Niels Dossche <7771979+nielsdos@users.noreply.github.com>	2025-09-14 11:42:59 +01:00
tekimen	edc2671227	ext/mbstring: Update to Unicode 17.0 (#19796 ) Updates UCD to Unicode 17.0 (released 2025 Sep).	2025-09-13 08:07:51 +09:00
Jorg Adam Sowa	1e02099e6a	ext/mbstring: Use `internal_encoding` INI setting instead of `mb_internal_encoding()` in tests (#19663 ) Moves the usage of `mb_internal_encoding()` to INI section for the tests not testing the encoding/function itself, but the other mbstring/iconv functions.	2025-09-03 11:34:12 +01:00
Niels Dossche	be2889411a	Merge branch 'PHP-8.4' * PHP-8.4: Fix GH-19397: mb_list_encodings() can cause crashes on shutdown	2025-08-08 20:33:00 +02:00
Niels Dossche	db3f6d0bf0	Merge branch 'PHP-8.3' into PHP-8.4 * PHP-8.3: Fix GH-19397: mb_list_encodings() can cause crashes on shutdown	2025-08-08 20:32:55 +02:00
Niels Dossche	cc93bbb765	Fix GH-19397: mb_list_encodings() can cause crashes on shutdown The request shutdown does not necessarily hold the last reference, if there is still a CV that refers to the array. Closes GH-19405.	2025-08-08 20:32:29 +02:00
Gina Peter Banyard	c7778641dd	ext/mbstring: Remove ZPP tests	2025-06-23 13:58:31 +02:00
Niels Dossche	2577e3a703	Merge branch 'PHP-8.3' into PHP-8.4 * PHP-8.3: Fix GH-18901: integer overflow mb_split	2025-06-22 13:08:05 +02:00
Niels Dossche	a5f21ca700	Fix GH-18901: integer overflow mb_split We prevent signed overflow by making the count unsigned. The actual interpretation of the count doesn't matter as it's just used to denote a limit. The test output for some limit values looks strange though, so that may need extra investigation. However, that's orthogonal to this fix. Closes GH-18906.	2025-06-22 13:07:43 +02:00
Niels Dossche	2b383848a7	Fix handling of references in zval_try_get_long() This API can't handle references, yet everyone keeps forgetting that it can't and that you should DEREF upfront. Fix every type of this issue once and for all by moving the reference handling to this Zend API. Closes GH-18761.	2025-06-04 21:00:05 +02:00
Niels Dossche	aa6e58f82a	Merge branch 'PHP-8.3' into PHP-8.4 * PHP-8.3: Fix weird unpack behaviour in DOM Fix GH-17989: mb_output_handler crash with unset http_output_conv_mimetypes	2025-03-09 11:21:27 +01:00
Niels Dossche	c7d3dc6fab	Fix GH-17989: mb_output_handler crash with unset http_output_conv_mimetypes The INI option can be NULL or invalid, resulting in a NULL global. So we have to add a NULL check. Closes GH-17996.	2025-03-09 11:16:33 +01:00
Christoph M. Becker	47a0922dee	Merge branch 'PHP-8.3' into PHP-8.4 * PHP-8.3: Fix GH-17503: Undefined float conversion in mb_convert_variables	2025-02-04 15:53:24 +01:00
Christoph M. Becker	55e676e181	Fix GH-17503: Undefined float conversion in mb_convert_variables Conversion of floating point to integer values is undefined if the integral part of the float value cannot be represented by the integer type. We need to cater to that explicitly (in a manner similar to `zend_dval_to_lval_cap()`). Closes GH-17689.	2025-02-04 15:51:48 +01:00
David Carlier	f47a45ecff	Merge branch 'PHP-8.3' into PHP-8.4	2024-10-11 08:49:00 +01:00
David Carlier	89b4f94024	Merge branch 'PHP-8.2' into PHP-8.3	2024-10-11 08:48:49 +01:00
David Carlier	c34d4fbbf4	Fix GH-16360 mb_substr overflow on start and length arguments. occurs when they are negated to start working from the end instead when set with ZEND_LONG_MIN.	2024-10-11 08:46:48 +01:00
Niels Dossche	07e418abfb	Merge branch 'PHP-8.3' into PHP-8.4 * PHP-8.3: Fix GH-16261: Reference invariant broken in mb_convert_variables()	2024-10-07 17:49:56 +02:00
Niels Dossche	2fe8c4a4fc	Merge branch 'PHP-8.2' into PHP-8.3 * PHP-8.2: Fix GH-16261: Reference invariant broken in mb_convert_variables()	2024-10-07 17:49:24 +02:00
Niels Dossche	bf70d9ba0d	Fix GH-16261: Reference invariant broken in mb_convert_variables() The behaviour is weird in the sense that the reference must get unwrapped. What ended up happening is that when destroying the old reference the sources list was not cleaned properly. We add handling for that. Normally we would use use ZEND_TRY_ASSIGN_STRINGL but that doesn't work here as it would keep the reference and change values through references (see bug #26639). Closes GH-16272.	2024-10-07 17:46:06 +02:00
Yuya Hamada	f815310c98	Merge branch 'PHP-8.3' into PHP-8.4	2024-10-05 18:28:43 +09:00
Yuya Hamada	4e23d3945a	Merge branch 'PHP-8.2' into PHP-8.3	2024-10-05 18:26:25 +09:00
Yuya Hamada	d840200cea	Fix GH-16229: Address overflowed in mb_send_mail when empty string	2024-10-05 18:24:09 +09:00
Ayesh Karunaratne	3afb96184e	ext/mbstring: Update to Unicode 16 Updates UCD to Unicode 16.0 (released 2024 Sept). Previously: `0fdffc18`, #7502, #14680 Unicode 16 adds several new character sets and case folding rules. However, the existing ucgendat script can still parse them. This also adds a couple test cases to make sure the new rules for East Asian Wide characters and case folding work correctly. These tests fail on Unicode 15.1 and older because those verisons do not contain those rules.	2024-09-17 10:40:00 +09:00
tekimen	dc5f3b9562	Fix GH-15824 mb_detect_encoding() invalid "UTF8" (#15829 ) I fixed from strcasecmp to strncasecmp. However, strncasecmp is specify size to #3 parameter. Hence, Add check length to mime and aliases. Co-authored-by: Niels Dossche <7771979+nielsdos@users.noreply.github.com>	2024-09-11 09:40:35 +09:00
Gina Peter Bnayard	5853cdb73d	Use "must not" instead of "cannot" wording	2024-08-21 21:12:17 +01:00
Gina Peter Bnayard	9a2fdbec48	ext/mbstring: Use standard wording for ValueError	2024-08-21 21:12:17 +01:00
Ayesh Karunaratne	421ac9ac28	ext/mbstring: update to Unicode 15 Updates UCD to Unicode 15.1 (released 2023 Sept). The upcoming Unicode 16 version will be released roughly on 2024 Sept. Previously: `0fdffc18`, #7502 UCD 15.1 `DerivedNormalizationProps` contains multiple properties in the same line, which breaks the parser. This also updates the `ucgendat.php` script to allow 2 or three fields in each line, and to look for the `Cased` and `Case_Ignorable` properties in either of the fields to mimic the previous behavior.	2024-06-29 17:24:52 +02:00
Niels Dossche	f81370847c	Fix GH-13815: mb_trim() inaccurate $characters default value (#13820 ) Because the default characters are defined in the stub file, and the stub file is UTF-8 (typically), the characters are encoded in the string as UTF-8. When using a different character encoding, there is a mismatch between what mb_trim expects and the UTF-8 encoded string it gets. One way of solving this is by making the characters argument nullable, which would mean that it always uses the internal code path that has the unicode codepoints that are defaulted actually stored as codepoint numbers instead of in a string. Co-authored-by: @ranvis	2024-04-24 09:07:55 +02:00
Ben Ramsey	7ca4300db8	Merge branch 'PHP-8.3'	2024-04-09 23:55:11 -05:00

1 2 3 4 5 ...

967 Commits