summaryrefslogtreecommitdiffstats
path: root/Lib/test/test_unicode.py
Commit message (Collapse)AuthorAgeFilesLines
* Close #14648: Compute correctly maxchar in str.format() for substrinVictor Stinner2012-04-231-2/+8
|
* inherit maxchar of field value where needed (closes #14648)Benjamin Peterson2012-04-231-0/+2
|
* str.format_map tests don't do what they say: fix to actually implement the ↵Eric V. Smith2012-03-121-6/+9
|\ | | | | | | intent of the test. Closes #13450. Patch by Akira Li.
| * str.format_map tests don't do what they say: fix to actually implement the ↵Eric V. Smith2012-03-121-6/+9
| | | | | | | | intent of the test. Closes #13450.
| * Issue #13093: Fix error handling on PyUnicode_EncodeDecimal()Victor Stinner2011-11-221-0/+36
| | | | | | | | | | * Add tests for PyUnicode_EncodeDecimal() and PyUnicode_TransformDecimalToASCII() * Remove the unused "e" variable in replace()
* | add str.casefold() (closes #13752)Benjamin Peterson2012-01-141-0/+8
| |
* | use full unicode mappings for upper/lower/title case (#12736)Benjamin Peterson2012-01-111-0/+44
| | | | | | | | Also broaden the category of characters that count as lowercase/uppercase.
* | Close #13093: PyUnicode_EncodeDecimal() doesn't support error handlersVictor Stinner2011-11-251-14/+4
| | | | | | | | | | different than "strict" anymore. The caller was unable to compute the size of the output buffer: it depends on the error handler.
* | (Merge 3.2) Issue #13093: Fix error handling on PyUnicode_EncodeDecimal()Victor Stinner2011-11-221-0/+6
| |
* | Add a test on str.__getnewargs__()Victor Stinner2011-11-221-0/+7
| | | | | | | | It tests indirectly PyUnicode_Copy(): ensure that the string is a copy.
* | Rewrite PyUnicode_EncodeDecimal() to use the new Unicode APIVictor Stinner2011-11-211-0/+30
| | | | | | | | | | Add tests for PyUnicode_EncodeDecimal() and PyUnicode_TransformDecimalToASCII().
* | "unicode_internal" codec has been deprecated: fix related testsVictor Stinner2011-11-151-7/+15
| |
* | Issue #13333: The UTF-7 decoder now accepts lone surrogatesAntoine Pitrou2011-11-151-4/+12
|\ \ | |/ | | | | (the encoder already accepts them).
| * Issue #13333: The UTF-7 decoder now accepts lone surrogatesAntoine Pitrou2011-11-151-4/+12
| | | | | | | | (the encoder already accepts them).
* | Fix range in test.Ezio Melotti2011-11-111-1/+1
| |
* | Make test more inclusiveAntoine Pitrou2011-11-111-6/+4
| |
* | Enable commented out testAntoine Pitrou2011-11-111-1/+1
| |
* | Issue #13134: optimize finding single-character strings using memchrAntoine Pitrou2011-10-111-0/+17
| |
* | test_unicode was forgetting to run the common string tests for str.find()Antoine Pitrou2011-10-081-0/+1
|\ \ | |/
| * test_unicode was forgetting to run the common string tests for str.find()Antoine Pitrou2011-10-081-0/+1
| |
* | Mark 'abc'.expandtab() optimization as specific to CPythonVictor Stinner2011-10-071-3/+4
| | | | | | | | Improve also str.replace(a, a) test
* | str.replace(a, a) is now returning str unchanged if a is aVictor Stinner2011-10-071-0/+6
| |
* | #13054: fix usage of sys.maxunicode after PEP-393.Ezio Melotti2011-10-041-16/+4
| |
* | When expandtabs() would be a no-op, don't create a duplicate stringAntoine Pitrou2011-10-041-0/+4
| |
* | Optimize unicode_subtype_new(): don't encode to wchar_t and decode from wchar_tVictor Stinner2011-10-011-4/+7
| | | | | | | | Rewrite unicode_subtype_new(): allocate directly the right type.
* | remove "fast-path" for (i)adding stringsBenjamin Peterson2011-10-011-0/+12
| | | | | | | | | | | | These were just an artifact of the old unicode concatenation hack and likely just penalized other kinds of adding. Also, this fixes __(i)add__ on string subclasses.
* | Fix struct sizes. Drop -1, since the resulting string was actually the ↵Martin v. Löwis2011-09-281-6/+7
| | | | | | | | | | | | largest one that could be allocated.
* | Implement PEP 393.Martin v. Löwis2011-09-281-14/+27
| |
* | Merge indentation fix and skip decorator with 3.2.Ezio Melotti2011-08-221-24/+28
|\ \ | |/
| * Fix indentation and add a skip decorator.Ezio Melotti2011-08-221-24/+28
| |
* | #9200: merge with 3.2.Ezio Melotti2011-08-221-1/+157
|\ \ | |/
| * #9200: The str.is* methods now work with strings that contain non-BMP ↵Ezio Melotti2011-08-221-1/+157
| | | | | | | | characters even in narrow Unicode builds.
* | merge 3.2 (#12732)Benjamin Peterson2011-08-131-0/+1
|\ \ | |/
| * in narrow builds, make sure to test codepoints as identifier characters ↵Benjamin Peterson2011-08-131-0/+1
| | | | | | | | | | | | (closes #12732) This fixes the use of Unicode identifiers outside the BMP in narrow builds.
* | Merge from 3.2.Eric V. Smith2011-07-181-0/+5
|\ \ | |/
| * Closes #12579. Positional fields with str.format_map() now raise a ↵Eric V. Smith2011-07-181-0/+5
| | | | | | | | ValueError instead of SystemError.
* | merge from 3.2Senthil Kumaran2011-07-041-0/+1
|\ \ | |/
| * Fix closes issue12471 - wrong TypeError message when '%i' format spec was used.Senthil Kumaran2011-07-041-0/+1
| |
* | #6780: merge with 3.2.Ezio Melotti2011-04-261-0/+8
|\ \ | |/
| * #6780: merge with 3.1.Ezio Melotti2011-04-261-0/+8
| |\
| | * #6780: fix starts/endswith error message to mention that tuples are accepted ↵Ezio Melotti2011-04-261-0/+11
| | | | | | | | | | | | too.
| | * Merged revisions 86596 via svnmerge fromEzio Melotti2010-11-211-3/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r86596 | ezio.melotti | 2010-11-20 21:04:17 +0200 (Sat, 20 Nov 2010) | 1 line #9424: Replace deprecated assert* methods in the Python test suite. ........
| | * Merged revisions 85861 via svnmerge fromAntoine Pitrou2010-10-271-3/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r85861 | antoine.pitrou | 2010-10-27 20:52:48 +0200 (mer., 27 oct. 2010) | 3 lines Recode modules from latin-1 to utf-8 ........
| | * Merged revisions 83966 via svnmerge fromEric Smith2010-08-131-5/+0
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r83966 | eric.smith | 2010-08-12 17:55:30 -0400 (Thu, 12 Aug 2010) | 1 line Remove unused test class. ........
| | * Merged revisions 82978 via svnmerge fromStefan Krah2010-07-191-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r82978 | stefan.krah | 2010-07-19 19:58:26 +0200 (Mon, 19 Jul 2010) | 3 lines Sub-issue of #9036: Fix incorrect use of Py_CHARMASK. ........
| | * Merged revisions 82413,82468 via svnmerge fromEzio Melotti2010-07-031-0/+153
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r82413 | ezio.melotti | 2010-07-01 10:32:02 +0300 (Thu, 01 Jul 2010) | 13 lines Update PyUnicode_DecodeUTF8 from RFC 2279 to RFC 3629. 1) #8271: when a byte sequence is invalid, only the start byte and all the valid continuation bytes are now replaced by U+FFFD, instead of replacing the number of bytes specified by the start byte. See http://www.unicode.org/versions/Unicode5.2.0/ch03.pdf (pages 94-95); 2) 5- and 6-bytes-long UTF-8 sequences are now considered invalid (no changes in behavior); 3) Change the error messages "unexpected code byte" to "invalid start byte" and "invalid data" to "invalid continuation byte"; 4) Add an extensive set of tests in test_unicode; 5) Fix test_codeccallbacks because it was failing after this change. ........ r82468 | ezio.melotti | 2010-07-03 07:52:19 +0300 (Sat, 03 Jul 2010) | 1 line Update comment about surrogates. ........
| | * Merged revisions 81823,81835 via svnmerge fromBenjamin Peterson2010-06-081-0/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r81823 | benjamin.peterson | 2010-06-07 17:31:26 -0500 (Mon, 07 Jun 2010) | 9 lines Merged revisions 81820 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81820 | benjamin.peterson | 2010-06-07 17:23:23 -0500 (Mon, 07 Jun 2010) | 1 line correctly overflow when indexes are too large ........ ................ r81835 | benjamin.peterson | 2010-06-08 09:57:22 -0500 (Tue, 08 Jun 2010) | 9 lines Merged revisions 81834 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81834 | benjamin.peterson | 2010-06-08 09:53:29 -0500 (Tue, 08 Jun 2010) | 1 line kill extra word ........ ................
| | * Merged revisions 79281 via svnmerge fromVictor Stinner2010-03-221-0/+8
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r79281 | victor.stinner | 2010-03-22 13:50:40 +0100 (lun., 22 mars 2010) | 16 lines Merged revisions 79278,79280 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r79278 | victor.stinner | 2010-03-22 13:24:37 +0100 (lun., 22 mars 2010) | 2 lines Issue #1583863: An unicode subclass can now override the __str__ method ........ r79280 | victor.stinner | 2010-03-22 13:36:28 +0100 (lun., 22 mars 2010) | 5 lines Fix the NEWS about my last commit: an unicode subclass can now override the __unicode__ method (and not the __str__ method). Simplify also the testcase. ........ ................
| | * Merged revisions 73715 via svnmerge fromGeorg Brandl2009-08-131-38/+38
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://svn.python.org/python/branches/py3k ........ r73715 | benjamin.peterson | 2009-07-01 01:06:06 +0200 (Mi, 01 Jul 2009) | 1 line convert old fail* assertions to assert* ........
| * | Merged revisions 88697 via svnmerge fromVictor Stinner2011-03-011-0/+13
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r88697 | victor.stinner | 2011-03-01 23:46:52 +0100 (mar., 01 mars 2011) | 4 lines Issue #11246: Fix PyUnicode_FromFormat("%V") Decode the byte string from UTF-8 (with replace error handler) instead of ISO-8859-1 (in strict mode). Patch written by Ray Allen. ........