1
0
mirror of https://github.com/php/php-src.git synced 2026-04-02 21:52:36 +02:00
Files
archived-php-src/ext/mbstring/tests
Alex Dowad ff85ed8adc Fix conversion of EUC-TW text (and add test suite)
- Treat text which ends abruptly in the middle of a multi-byte
  character as erroneous.
- Don't allow ASCII control characters to appear in the middle of a
  multi-byte character.
- If an illegal byte appears in the middle of a multi-byte character,
  go back to the initial state rather than trying to finish the
  multi-byte character.
- There was a bug in the file with the conversion tables, which set the
  'maximum codepoint which can be converted using table A2' using the
  size of table A1, not table A2. This meant that several hundred
  Unicode codepoints which should have been able to be converted to
  EUC-TW were flagged as erroneous instead.
- When a sequence which cannot possibly be a prefix of a valid
  multi-byte character is found, immediately flag it as an error, rather
  than waiting to read more bytes first.
- Allow characters in CNS-11643 plane 1 to be encoded as 4-byte
  sequences (although they can also be encoded as 2-byte sequences).
  This is allowed by the standard for EUC-TW text.
2021-06-29 12:25:21 +02:00
..
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2021-06-11 14:00:43 +02:00
2020-02-03 22:52:20 +01:00