archived-php-src

mirror of https://github.com/php/php-src.git synced 2026-04-13 11:02:55 +02:00

Author	SHA1	Message	Date
Nikita Popov	ad6738e886	Merge branch 'PHP-7.3'	2018-10-17 12:51:17 +02:00
Nikita Popov	1151554668	Remove the "auto" encoding "auto" is only meaningful in functions which accept an encoding list and support encoding detection. These functions have explicit checks for "auto". It cannot be used as a standalone encoding in any meaningful capacity, so I'm dropping it entirely.	2018-10-17 12:50:24 +02:00
Nikita Popov	e4ccc1a29f	Merge branch 'PHP-7.3'	2018-10-17 12:40:39 +02:00
Nikita Popov	56665a1b17	Fixed bug #77025 Implements 8bit conversions equivalently to iso-8859-1 conversions. This seems quite dubious to me, but seems to match the previous behavior. It might make more sense to map the characters into a private area instead, so that the 8bit encoding is treated as binary data with no case conversions (including no case conversions in the ascii range).	2018-10-17 12:38:31 +02:00
Nikita Popov	2cc6462edc	Add vtbls for EUC-TW encoding	2018-10-17 12:10:16 +02:00
Peter Kokot	1ad08256f3	Sync leading and final newlines in source code files This patch adds missing newlines, trims multiple redundant final newlines into a single one, and trims redundant leading newlines. According to POSIX, a line is a sequence of zero or more non-' <newline>' characters plus a terminating '<newline>' character. [1] Files should normally have at least one final newline character. C89 [2] and later standards [3] mention a final newline: "A source file that is not empty shall end in a new-line character, which shall not be immediately preceded by a backslash character." Although it is not mandatory for all files to have a final newline fixed, a more consistent and homogeneous approach brings less of commit differences issues and a better development experience in certain text editors and IDEs. [1] http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap03.html#tag_03_206 [2] https://port70.net/~nsz/c/c89/c89-draft.html#2.1.1.2 [3] https://port70.net/~nsz/c/c99/n1256.html#5.1.1.2	2018-10-14 12:56:38 +02:00
Peter Kokot	1c850bfcca	Sync leading and final newlines in source code files This patch adds missing newlines, trims multiple redundant final newlines into a single one, and trims redundant leading newlines. According to POSIX, a line is a sequence of zero or more non-' <newline>' characters plus a terminating '<newline>' character. [1] Files should normally have at least one final newline character. C89 [2] and later standards [3] mention a final newline: "A source file that is not empty shall end in a new-line character, which shall not be immediately preceded by a backslash character." Although it is not mandatory for all files to have a final newline fixed, a more consistent and homogeneous approach brings less of commit differences issues and a better development experience in certain text editors and IDEs. [1] http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap03.html#tag_03_206 [2] https://port70.net/~nsz/c/c89/c89-draft.html#2.1.1.2 [3] https://port70.net/~nsz/c/c99/n1256.html#5.1.1.2	2018-10-14 12:55:24 +02:00
Peter Kokot	37c329d715	Trim trailing whitespace in source code files	2018-10-13 14:17:28 +02:00
Peter Kokot	3362620b5f	Trim trailing whitespace in source code files	2018-10-13 14:16:33 +02:00
Nikita Popov	aec6421409	Merge branch 'PHP-7.3'	2018-10-02 16:14:36 +02:00
Nikita Popov	26f82a7706	Fixed bug #76958	2018-10-02 16:13:51 +02:00
Nikita Popov	9cfd8f43c2	Don't fall back to vtbl_pass if no matching vtbl found If we don't know how to convert between two encodings, make sure we error instead of ignoring the issue. Explicitly use vtbl_pass if we are round-tripping wchar->wchar or 8bit->8bit. Fingers crossed that nothing else relies on the vtbl_pass fallback...	2018-10-02 16:07:22 +02:00
Peter Kokot	a4102e1bc3	Remove unused and untouched ext/libmbfl/tests files Test files of the forked and bundled libmbfl library aren't utilized in php-src. Instead, the current approach is to use the phpt tests.	2018-09-26 23:08:04 +02:00
Peter Kokot	d3ca28f569	Remove HAVE_STRING_H The C89 standard and later defines the `<string.h>` header as part of the standard headers [1] and on current systems it is always present. Code included also `<strings.h>` header as an alterinative in some files. This kind of check was relevant on some older systems where the `<strings.h>` file included definitions for the C89 compliant `<string.h>`. Today such alternative check is not required anymore. The `<strings.h>` file is part of the POSIX definition these days. Also Autoconf suggests doing this and relying on C89 or above [2] and [3]. This patch also cleans few unused `<strings.h>` inclusions in the libmbfl. [1]: https://port70.net/~nsz/c/c89/c89-draft.html#4.1.2 [2]: http://git.savannah.gnu.org/cgit/autoconf.git/tree/lib/autoconf/headers.m4 [3]: https://www.gnu.org/software/autoconf/manual/autoconf-2.69/autoconf.html	2018-09-18 05:32:08 +02:00
Peter Kokot	ba3bd4ae06	Remove HAVE_STDIO_H The `<stdio.h>` header file is part of the standard C89 headers [1] and on current systems can be included unconditionally. Since PHP requires at least C89 or greater, the `HAVE_STDIO_H` symbol defined by Autoconf [2] can be ommitted and simplifed. Refs: [1] https://port70.net/~nsz/c/c89/c89-draft.html#4.1.2 [2] https://git.savannah.gnu.org/cgit/autoconf.git/tree/lib/autoconf/headers.m4	2018-09-17 02:00:51 +02:00
Peter Kokot	7dd62811ce	Remove HAVE_STDLIB_H The C89 and later standard defines the `<stdlib.h>` header as part of the standard headers [1] and on current systems it is always present and the `HAVE_STDLIB_H` symbol can be removed. Also Autoconf suggests doing this and relying on C89 or above [2] and [3]. [1] https://port70.net/~nsz/c/c89/c89-draft.html#4.1.2 [2] http://git.savannah.gnu.org/cgit/autoconf.git/tree/lib/autoconf/headers.m4 [3] https://www.gnu.org/software/autoconf/manual/autoconf-2.69/autoconf.html	2018-09-16 20:53:53 +02:00
Peter Kokot	77118fc925	Remove HAVE_ASSERT_H The `<assert.h>` header file is part of the standard C89 headers [1] and on older systems there needed to be also a manual check if header is present. Since PHP requires at least C89 manual check and the `HAVE_ASSERT_H` symbol defined by Autoconf in configure.ac can be both removed [2]. This patch also removes unused <assert.h> includes where c files don't use the `assert()` macro. Refs: [1] https://port70.net/~nsz/c/c89/c89-draft.html#4.2 [2] https://git.savannah.gnu.org/cgit/autoconf.git/tree/lib/autoconf/headers.m4	2018-09-09 09:43:03 +02:00
Peter Kokot	6c1ff61a36	Remove HAVE_STDDEF_H The `<stddef.h>` header file is part of the standard C89 headers [1] and on current systems there is no need for a manual check if header is present. Since PHP requires at least C89 the `HAVE_STDDEF_H` symbol isn't defined by Autoconf anywhere else anymore [2] and accross the PHP source code the header is included unconditionally already. This patch syncs this also for the bundled libmbfl which is maintaned as a fork in php-src. Refs: [1] https://port70.net/~nsz/c/c89/c89-draft.html#4.1.2 [2] https://git.savannah.gnu.org/cgit/autoconf.git/tree/lib/autoconf/headers.m4	2018-09-05 11:51:19 +02:00
Peter Kokot	8e230d364d	Remove AC_C_CONST Autoconf 2.59d (released in 2006) [1] started promoting several macros as not relevant for newer systems, including the `AC_C_CONST`. The `const` keyword is used in C since C89. On old systems some compilers lacked the `const` and this macro defined it to be empty. This check was relevant on systems with compilers before C89 and on current systems it can be omitted. [2] PHP also requires at least C89 so `const` is always available. Refs: [1] http://git.savannah.gnu.org/cgit/autoconf.git/tree/NEWS [2] https://www.gnu.org/software/autoconf/manual/autoconf-2.69/autoconf.html	2018-09-02 18:55:03 +02:00
Peter Kokot	255e29f3bc	Remove unused libmbfl build system related files PHP build system already builds necessary files also from libmbfl directory using the mbstring config.m4 file.	2018-07-29 10:07:32 +02:00
Peter Kokot	8d3f8ca12a	Remove unused Git attributes ident The $Id$ keywords were used in Subversion where they can be substituted with filename, last revision number change, last changed date, and last user who changed it. In Git this functionality is different and can be done with Git attribute ident. These need to be defined manually for each file in the .gitattributes file and are afterwards replaced with 40-character hexadecimal blob object name which is based only on the particular file contents. This patch simplifies handling of $Id$ keywords by removing them since they are not used anymore.	2018-07-25 00:53:25 +02:00
Nikita Popov	a7101415cb	Merge branch 'PHP-7.2'	2018-06-28 23:06:08 +02:00
Nikita Popov	00c0d7702c	Merge branch 'PHP-7.1' into PHP-7.2	2018-06-28 23:05:09 +02:00
Marcus Schwarz	bf5a802f5a	Fixed bug #76532 (excessive memory usage in mb_strimwidth)	2018-06-28 23:02:28 +02:00
Christoph M. Becker	9004985273	Merge branch 'PHP-7.2' * PHP-7.2: Fix #75944: Wrong cp1251 detection	2018-03-19 14:48:10 +01:00
Christoph M. Becker	cd2912af5e	Merge branch 'PHP-7.1' into PHP-7.2 * PHP-7.1: Fix #75944: Wrong cp1251 detection	2018-03-19 14:34:09 +01:00
Christoph M. Becker	47461368ca	Fix #75944 : Wrong cp1251 detection `\xFF` is a valid character of CP-1251.	2018-03-19 14:24:27 +01:00
Christoph M. Becker	ef01ec08f0	Merge branch 'PHP-7.2' * PHP-7.2: Fix #62545: wrong unicode mapping in some charsets	2018-03-11 18:05:08 +01:00
Christoph M. Becker	2b02e6dff3	Merge branch 'PHP-7.1' into PHP-7.2 * PHP-7.1: Fix #62545: wrong unicode mapping in some charsets	2018-03-11 17:54:45 +01:00
Christoph M. Becker	01ea314e8c	Fix #62545 : wrong unicode mapping in some charsets Undefined characters are best mapped to Unicode REPLACEMENT characters.	2018-03-11 17:38:28 +01:00
Gabriel Caruso	2d48d734a2	Fix some misspellings	2018-02-06 16:59:00 +01:00
Gabriel Caruso	6400264856	Trailing whitespaces Signed-off-by: Gabriel Caruso <carusogabriel34@gmail.com>	2018-01-03 14:38:00 +01:00
Nikita Popov	d21c902841	Fix cp950 pua check One set of parenthesis was missing, causing a legitimate compiler warnings. In the end it doesn't actually matter, because it just ends up doing an unnecessary check in the w > 0 case. This fixes the logic and moves it out into a separate functions, to be a bit more readable.	2017-11-22 23:47:18 +01:00
Peter Kokot	3ed3bc3a0c	Update README information for the libmbfl library The libmbfl library is bundled with PHP and has its own repository for development and bug fixes. To avoid confusion and faster development the README has been updated to include the information of the original library and to use the bundled library as a fork of the upstream repository instead.	2017-10-08 17:51:02 +02:00
Joe Watkins	c898349e16	fixes PR #2722 , no clue how it broke ...	2017-09-06 11:13:27 +01:00
shinemotec@gmail.com	9b77615608	fixed mbstring extension compiled broken with archlinux	2017-09-06 09:50:08 +01:00
Nikita Popov	f24db7686e	Optimize mb_ord() Don't perform a full encoding conversion into UCS4-BE, instead only perform an input conversion into a wchar device.	2017-08-04 22:22:58 +02:00
Nikita Popov	633a471ba0	Store input and output filters in mbfl encodings For functions like mb_chr() and mb_ord() just looking up the input/output filter for the encoding dominates the runtime. This commit stores the input/output filter for an encoding in the mbfl encoding structure, so it can be looked up directly, rather than scanning through filter function lists.	2017-08-04 22:22:58 +02:00
Nikita Popov	e20fbd43ba	Separate mbfl filters into three categories Input filters, output filters and special filters.	2017-08-04 22:22:58 +02:00
Nikita Popov	c98714f19e	Merge branch 'PHP-7.2'	2017-08-03 21:57:35 +02:00
Nikita Popov	fb9bf5b64b	Revert/fix substitution character fallback The introduced checks were not correct in two respects: * It was checked whether the source encoding of the string matches the internal encoding, while the actually relevant encoding is the target encoding. * Even if the correct encoding is used, the checks are still too conservative. Just because something is not a "Unicode-encoding" does not mean that it does not map any non-ASCII characters. I've reverted the added checks and instead adjusted mbfl_convert to first try to use the provided substitution character and if that fails, perform the fallback to '?' at that point. This means that any codepoint mapped in the target encoding should now be correctly supported and anything else should fall back to '?'.	2017-08-03 21:53:59 +02:00
Nikita Popov	25b6e68432	Merge branch 'PHP-7.2'	2017-07-28 13:03:35 +02:00
Nikita Popov	5d777e56e2	Merge branch 'PHP-7.1' into PHP-7.2	2017-07-28 13:03:26 +02:00
Nikita Popov	e3d25e78eb	Fixed bug #62934	2017-07-28 13:02:25 +02:00
Nikita Popov	445e13b149	Add MBFL_SUBSTR_TO_END mode to mbfl_substr This takes the substr from the offset to the end of the string. This avoids pointless searching for the end position and also saves us a length calculation in the strstr family of functions.	2017-07-23 23:17:12 +02:00
Anatol Belski	7496bad2ac	adjust datatype, used for position handling	2017-07-23 16:37:31 +02:00
Nikita Popov	698132d6f9	Merge branch 'PHP-7.2'	2017-07-23 12:22:09 +02:00
Nikita Popov	88f752a947	Merge branch 'PHP-7.1' into PHP-7.2	2017-07-23 12:21:51 +02:00
Christoph M. Becker	418da85f15	Fix #71606 : Segmentation fault mb_strcut with HTML-ENTITIES The HTML decoding filter uses the `opaque` member of mbfl_convert_filter as buffer, but there was no copy constructor defined, what caused double frees when the filter is copied (what happens multiple times in mb_strcut(), for instance).	2017-07-23 12:19:27 +02:00
Nikita Popov	42ff1aa86c	Fix overflow checks in mbfl_memory_device Also prune out some duplicate code and use strlen() and memcpy() instead of ad-hoc reimplementations. Remove multiplications by sizeof(unsigned char), which wrongly imply that this can be anything but 1.	2017-07-23 11:55:43 +02:00

1 2 3 4 5

236 Commits