summaryrefslogtreecommitdiffstats
path: root/Lib/test/test_unicode.py
Commit message (Collapse)AuthorAgeFilesLines
...
| * Issue #13333: The UTF-7 decoder now accepts lone surrogatesAntoine Pitrou2011-11-151-4/+12
| | | | | | | | (the encoder already accepts them).
* | Fix range in test.Ezio Melotti2011-11-111-1/+1
| |
* | Make test more inclusiveAntoine Pitrou2011-11-111-6/+4
| |
* | Enable commented out testAntoine Pitrou2011-11-111-1/+1
| |
* | Issue #13134: optimize finding single-character strings using memchrAntoine Pitrou2011-10-111-0/+17
| |
* | test_unicode was forgetting to run the common string tests for str.find()Antoine Pitrou2011-10-081-0/+1
|\ \ | |/
| * test_unicode was forgetting to run the common string tests for str.find()Antoine Pitrou2011-10-081-0/+1
| |
* | Mark 'abc'.expandtab() optimization as specific to CPythonVictor Stinner2011-10-071-3/+4
| | | | | | | | Improve also str.replace(a, a) test
* | str.replace(a, a) is now returning str unchanged if a is aVictor Stinner2011-10-071-0/+6
| |
* | #13054: fix usage of sys.maxunicode after PEP-393.Ezio Melotti2011-10-041-16/+4
| |
* | When expandtabs() would be a no-op, don't create a duplicate stringAntoine Pitrou2011-10-041-0/+4
| |
* | Optimize unicode_subtype_new(): don't encode to wchar_t and decode from wchar_tVictor Stinner2011-10-011-4/+7
| | | | | | | | Rewrite unicode_subtype_new(): allocate directly the right type.
* | remove "fast-path" for (i)adding stringsBenjamin Peterson2011-10-011-0/+12
| | | | | | | | | | | | These were just an artifact of the old unicode concatenation hack and likely just penalized other kinds of adding. Also, this fixes __(i)add__ on string subclasses.
* | Fix struct sizes. Drop -1, since the resulting string was actually the ↵Martin v. Löwis2011-09-281-6/+7
| | | | | | | | | | | | largest one that could be allocated.
* | Implement PEP 393.Martin v. Löwis2011-09-281-14/+27
| |
* | Merge indentation fix and skip decorator with 3.2.Ezio Melotti2011-08-221-24/+28
|\ \ | |/
| * Fix indentation and add a skip decorator.Ezio Melotti2011-08-221-24/+28
| |
* | #9200: merge with 3.2.Ezio Melotti2011-08-221-1/+157
|\ \ | |/
| * #9200: The str.is* methods now work with strings that contain non-BMP ↵Ezio Melotti2011-08-221-1/+157
| | | | | | | | characters even in narrow Unicode builds.
* | merge 3.2 (#12732)Benjamin Peterson2011-08-131-0/+1
|\ \ | |/
| * in narrow builds, make sure to test codepoints as identifier characters ↵Benjamin Peterson2011-08-131-0/+1
| | | | | | | | | | | | (closes #12732) This fixes the use of Unicode identifiers outside the BMP in narrow builds.
* | Merge from 3.2.Eric V. Smith2011-07-181-0/+5
|\ \ | |/
| * Closes #12579. Positional fields with str.format_map() now raise a ↵Eric V. Smith2011-07-181-0/+5
| | | | | | | | ValueError instead of SystemError.
* | merge from 3.2Senthil Kumaran2011-07-041-0/+1
|\ \ | |/
| * Fix closes issue12471 - wrong TypeError message when '%i' format spec was used.Senthil Kumaran2011-07-041-0/+1
| |
* | #6780: merge with 3.2.Ezio Melotti2011-04-261-0/+8
|\ \ | |/
| * #6780: merge with 3.1.Ezio Melotti2011-04-261-0/+8
| |\
| | * #6780: fix starts/endswith error message to mention that tuples are accepted ↵Ezio Melotti2011-04-261-0/+11
| | | | | | | | | | | | too.
| | * Merged revisions 86596 via svnmerge fromEzio Melotti2010-11-211-3/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r86596 | ezio.melotti | 2010-11-20 21:04:17 +0200 (Sat, 20 Nov 2010) | 1 line #9424: Replace deprecated assert* methods in the Python test suite. ........
| | * Merged revisions 85861 via svnmerge fromAntoine Pitrou2010-10-271-3/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r85861 | antoine.pitrou | 2010-10-27 20:52:48 +0200 (mer., 27 oct. 2010) | 3 lines Recode modules from latin-1 to utf-8 ........
| | * Merged revisions 83966 via svnmerge fromEric Smith2010-08-131-5/+0
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r83966 | eric.smith | 2010-08-12 17:55:30 -0400 (Thu, 12 Aug 2010) | 1 line Remove unused test class. ........
| | * Merged revisions 82978 via svnmerge fromStefan Krah2010-07-191-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r82978 | stefan.krah | 2010-07-19 19:58:26 +0200 (Mon, 19 Jul 2010) | 3 lines Sub-issue of #9036: Fix incorrect use of Py_CHARMASK. ........
| | * Merged revisions 82413,82468 via svnmerge fromEzio Melotti2010-07-031-0/+153
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r82413 | ezio.melotti | 2010-07-01 10:32:02 +0300 (Thu, 01 Jul 2010) | 13 lines Update PyUnicode_DecodeUTF8 from RFC 2279 to RFC 3629. 1) #8271: when a byte sequence is invalid, only the start byte and all the valid continuation bytes are now replaced by U+FFFD, instead of replacing the number of bytes specified by the start byte. See http://www.unicode.org/versions/Unicode5.2.0/ch03.pdf (pages 94-95); 2) 5- and 6-bytes-long UTF-8 sequences are now considered invalid (no changes in behavior); 3) Change the error messages "unexpected code byte" to "invalid start byte" and "invalid data" to "invalid continuation byte"; 4) Add an extensive set of tests in test_unicode; 5) Fix test_codeccallbacks because it was failing after this change. ........ r82468 | ezio.melotti | 2010-07-03 07:52:19 +0300 (Sat, 03 Jul 2010) | 1 line Update comment about surrogates. ........
| | * Merged revisions 81823,81835 via svnmerge fromBenjamin Peterson2010-06-081-0/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r81823 | benjamin.peterson | 2010-06-07 17:31:26 -0500 (Mon, 07 Jun 2010) | 9 lines Merged revisions 81820 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81820 | benjamin.peterson | 2010-06-07 17:23:23 -0500 (Mon, 07 Jun 2010) | 1 line correctly overflow when indexes are too large ........ ................ r81835 | benjamin.peterson | 2010-06-08 09:57:22 -0500 (Tue, 08 Jun 2010) | 9 lines Merged revisions 81834 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81834 | benjamin.peterson | 2010-06-08 09:53:29 -0500 (Tue, 08 Jun 2010) | 1 line kill extra word ........ ................
| | * Merged revisions 79281 via svnmerge fromVictor Stinner2010-03-221-0/+8
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r79281 | victor.stinner | 2010-03-22 13:50:40 +0100 (lun., 22 mars 2010) | 16 lines Merged revisions 79278,79280 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r79278 | victor.stinner | 2010-03-22 13:24:37 +0100 (lun., 22 mars 2010) | 2 lines Issue #1583863: An unicode subclass can now override the __str__ method ........ r79280 | victor.stinner | 2010-03-22 13:36:28 +0100 (lun., 22 mars 2010) | 5 lines Fix the NEWS about my last commit: an unicode subclass can now override the __unicode__ method (and not the __str__ method). Simplify also the testcase. ........ ................
| | * Merged revisions 73715 via svnmerge fromGeorg Brandl2009-08-131-38/+38
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://svn.python.org/python/branches/py3k ........ r73715 | benjamin.peterson | 2009-07-01 01:06:06 +0200 (Mi, 01 Jul 2009) | 1 line convert old fail* assertions to assert* ........
| * | Merged revisions 88697 via svnmerge fromVictor Stinner2011-03-011-0/+13
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r88697 | victor.stinner | 2011-03-01 23:46:52 +0100 (mar., 01 mars 2011) | 4 lines Issue #11246: Fix PyUnicode_FromFormat("%V") Decode the byte string from UTF-8 (with replace error handler) instead of ISO-8859-1 (in strict mode). Patch written by Ray Allen. ........
| * | Merged revisions 88481 via svnmerge fromVictor Stinner2011-02-231-1/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r88481 | victor.stinner | 2011-02-21 22:13:44 +0100 (lun., 21 févr. 2011) | 4 lines Fix PyUnicode_FromFormatV("%c") for non-BMP char Issue #10830: Fix PyUnicode_FromFormatV("%c") for non-BMP characters on narrow build. ........
* | | Issue 9856: Change object.__format__ with a non-empty format string from a ↵Eric V. Smith2011-03-121-1/+1
| | | | | | | | | | | | PendingDeprecationWarning to a DeprecationWarning.
* | | Issue #10831: PyUnicode_FromFormat() supports %li, %lli and %zi formatsVictor Stinner2011-03-021-11/+28
| | |
* | | Issue #10829: Refactor PyUnicode_FromFormat()Victor Stinner2011-03-011-0/+19
| | | | | | | | | | | | | | | * Use the same function to parse the format string in the 3 steps * Fix crashs on invalid format strings
* | | Issue #11246: Fix PyUnicode_FromFormat("%V")Victor Stinner2011-03-011-0/+13
| | | | | | | | | | | | | | | Decode the byte string from UTF-8 (with replace error handler) instead of ISO-8859-1 (in strict mode). Patch written by Ray Allen.
* | | Normalize the encoding names for Latin-1 and UTF-8 toMarc-André Lemburg2011-02-251-1/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 'latin-1' and 'utf-8'. These are optimized in the Python Unicode implementation to result in more direct processing, bypassing the codec registry. Also see issue11303.
* | | Fix PyUnicode_FromFormatV("%c") for non-BMP charVictor Stinner2011-02-211-1/+4
|/ / | | | | | | | | Issue #10830: Fix PyUnicode_FromFormatV("%c") for non-BMP characters on narrow build.
* | Issue #11302: missing type check on _string.formatter_field_name_split and ↵Eric Smith2011-01-291-0/+52
| | | | | | | | | | | | _string.formatter_parser caused crash. Originial patch by haypo, reviewed by me, okayed by Georg.
* | test_unicode: use ctypes to test PyUnicode_FromFormat()Victor Stinner2011-01-051-6/+20
| | | | | | | | | | Instead of _testcapi.format_unicode() because it has a limited API: it requires exactly one argument of type unicode.
* | Issue #10557: Fixed error messages from float() and other numericAlexander Belopolsky2010-12-041-2/+7
| | | | | | | | | | | | types. Added a new API function, PyUnicode_TransformDecimalToASCII(), which transforms non-ASCII decimal digits in a Unicode string to their ASCII equivalents.
* | #10273: Rename assertRegexpMatches and assertRaisesRegexp to assertRegex and ↵Ezio Melotti2010-12-011-1/+1
| | | | | | | | assertRaisesRegex.
* | Fix tests when ctypes isn't availableAntoine Pitrou2010-11-221-0/+2
| |
* | #9424: Replace deprecated assert* methods in the Python test suite.Ezio Melotti2010-11-201-18/+18
| |