summaryrefslogtreecommitdiffstats
path: root/Lib/test/test_unicode.py
Commit message (Collapse)AuthorAgeFilesLines
* Fix spelling in code commentsMartin Panter2016-12-181-1/+1
|
* Moved Unicode C API related tests to separate test class.Serhiy Storchaka2016-10-021-5/+7
|
* Issue #26494: Fixed crash on iterating exhausting iterators.Serhiy Storchaka2016-03-301-0/+5
| | | | | Affected classes are generic sequence iterators, iterators of bytearray, list, tuple, set, frozenset, dict, OrderedDict and corresponding views.
* Issue #19543: Added Py3k warning for decoding unicode.Serhiy Storchaka2015-12-031-4/+6
|
* Issue #24731: Fixed crash on converting objects with special methodsSerhiy Storchaka2015-11-251-3/+6
| | | | | __str__, __trunc__, and __float__ returning instances of subclasses of str, long, and float to subclasses of str, long, and float correspondingly.
* Issue #24848: Fixed bugs in UTF-7 decoding of misformed data:Serhiy Storchaka2015-10-021-0/+1
| | | | | 1. Non-ASCII bytes were accepted after shift sequence. 2. A low surrogate could be emitted in case of error in high surrogate.
* Issue #23055: Fixed a buffer overflow in PyUnicode_FromFormatV. AnalysisSerhiy Storchaka2015-01-271-0/+40
| | | | and fix by Guido Vranken.
* Issue #23181: More "codepoint" -> "code point".Serhiy Storchaka2015-01-181-2/+2
|
* Issue #16056: Rename test methods to avoid conflict.Berker Peksag2014-11-241-1/+1
|
* Issue #22023: Fix %S, %R and %V formats of PyUnicode_FromFormat().Victor Stinner2014-07-291-0/+112
|
* Issue #12546: Allow \x00 as a fill character for builtin type __format__ ↵Eric V. Smith2014-04-141-0/+21
| | | | methods.
* Issue #20532: Tests which use _testcapi now are marked as CPython only.Serhiy Storchaka2014-02-071-1/+8
|
* Issue #19572: More silently skipped tests explicitly skipped.Zachary Ware2013-12-101-5/+5
|
* Issue #19457: Fixed xmlcharrefreplace tests on wide build when tests areSerhiy Storchaka2013-10-311-2/+2
| | | | loaded from .py[co] files.
* Issue #15866: The xmlcharrefreplace error handler no more produces two XMLSerhiy Storchaka2013-08-061-0/+12
| | | | entities for a non-BMP character on narrow build.
* Issue #14700: Fix buggy overflow checks for large precision and width in ↵Mark Dickinson2012-10-281-0/+27
| | | | new-style and old-style formatting.
* Issue #13093: Fix error handling on PyUnicode_EncodeDecimal()Victor Stinner2011-11-221-0/+25
| | | | Add tests for PyUnicode_EncodeDecimal()
* Issue #13333: The UTF-7 decoder now accepts lone surrogatesAntoine Pitrou2011-11-151-4/+12
| | | | (the encoder already accepts them).
* #9200: backport tests but run them on wide builds only.Ezio Melotti2011-08-221-2/+185
|
* #12266: move the tests in test_unicode.Ezio Melotti2011-08-151-0/+19
|
* #6780: fix starts/endswith error message to mention that tuples are accepted ↵Ezio Melotti2011-04-261-0/+11
| | | | too.
* Python 2 can encode/decode surrogates to utf-8. Add a test for this.Ezio Melotti2011-02-281-1/+7
|
* Merged revisions 85861 via svnmerge fromAntoine Pitrou2010-10-271-1/+0
| | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r85861 | antoine.pitrou | 2010-10-27 20:52:48 +0200 (mer., 27 oct. 2010) | 3 lines Recode modules from latin-1 to utf-8 ........
* Strengthen test_unicode with explicit type checking for assertEqual tests.Florent Xicluna2010-09-131-8/+22
|
* Check PendingDeprecationWarning after issue #7994.Florent Xicluna2010-09-131-4/+7
|
* Merged revisions 84470-84471,84566-84567,84759 via svnmerge fromFlorent Xicluna2010-09-131-5/+5
| | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r84470 | florent.xicluna | 2010-09-03 22:00:37 +0200 (ven., 03 sept. 2010) | 1 line Strengthen BytesWarning tests. ........ r84471 | florent.xicluna | 2010-09-03 22:23:40 +0200 (ven., 03 sept. 2010) | 1 line Typo ........ r84566 | florent.xicluna | 2010-09-06 22:27:15 +0200 (lun., 06 sept. 2010) | 1 line typo ........ r84567 | florent.xicluna | 2010-09-06 22:27:55 +0200 (lun., 06 sept. 2010) | 1 line typo ........ r84759 | florent.xicluna | 2010-09-13 04:28:18 +0200 (lun., 13 sept. 2010) | 1 line Reenable test_ucs4 and remove some duplicated lines. ........
* Sub-issue of #9036: Fix incorrect use of Py_CHARMASK.Stefan Krah2010-07-191-0/+1
|
* use unicode literalsBenjamin Peterson2010-06-071-3/+3
|
* correctly overflow when indexes are too largeBenjamin Peterson2010-06-071-0/+3
|
* Add a NEWS entry for r81758 and clarify a comment.Ezio Melotti2010-06-051-3/+3
|
* Update PyUnicode_DecodeUTF8 from RFC 2279 to RFC 3629.Ezio Melotti2010-06-051-0/+158
| | | | | | | | | | | | | | | 1) #8271: when a byte sequence is invalid, only the start byte and all the valid continuation bytes are now replaced by U+FFFD, instead of replacing the number of bytes specified by the start byte. See http://www.unicode.org/versions/Unicode5.2.0/ch03.pdf (pages 94-95); 2) 5- and 6-bytes-long UTF-8 sequences are now considered invalid (no changes in behavior); 3) Add code and tests to reject surrogates (U+D800-U+DFFF) as defined in RFC 3629, but leave it commented out since it's not backward compatible; 4) Change the error messages "unexpected code byte" to "invalid start byte" and "invalid data" to "invalid continuation byte"; 5) Add an extensive set of tests in test_unicode; 6) Fix test_codeccallbacks because it was failing after this change.
* #8016: add the CP858 codec (approved by Benjamin). (Also add CP720 to the ↵Georg Brandl2010-05-241-4/+4
| | | | tests, it was missing there.)
* Fix the NEWS about my last commit: an unicode subclass can now override theVictor Stinner2010-03-221-2/+0
| | | | | | __unicode__ method (and not the __str__ method). Simplify also the testcase.
* Issue #1583863: An unicode subclass can now override the __str__ methodVictor Stinner2010-03-221-0/+11
|
* Issue #7849: Now the utility ``check_warnings`` verifies if the warnings areFlorent Xicluna2010-03-071-2/+1
| | | | effectively raised. A new utility ``check_py3k_warnings`` deals with py3k warnings.
* Issue #7649: Fix u'%c' % char for character in range 0x80..0xFFVictor Stinner2010-02-231-0/+13
| | | | => raise an UnicodeDecodeError. Patch written by Ezio Melotti.
* use assert[Not]In where appropriateEzio Melotti2010-01-231-51/+51
|
* Issue #7462: Implement the stringlib fast search algorithm for the `rfind`,Antoine Pitrou2010-01-021-1/+4
| | | | `rindex`, `rsplit` and `rpartition` methods. Patch by Florent Xicluna.
* Issue #1680159: unicode coercion during an 'in' operation was maskingR. David Murray2009-12-141-1/+3
| | | | | | | any errors that might occur during coercion of the left operand and turning them into a TypeError with a message text that was confusing in the given context. This patch lets any errors through, as was already done during coercion of the right hand side.
* add keyword arguments support to str/unicode encode and decode #6300Benjamin Peterson2009-09-181-0/+8
|
* convert usage of fail* to assert*Benjamin Peterson2009-06-301-63/+63
|
* Issue 6089: str.format raises SystemError.Eric Smith2009-05-231-0/+4
|
* Issue #4426: The UTF-7 decoder was too strict and didn't accept some legal ↵Antoine Pitrou2009-05-041-6/+15
| | | | | | sequences. Patch by Nick Barnes and Victor Stinner.
* Unicode format tests weren't actually testing unicode. This was probably due ↵Eric Smith2009-03-141-51/+51
| | | | to the original backport from py3k.
* Issue 5237, Allow auto-numbered replacement fields in str.format() strings.Eric Smith2009-03-141-3/+33
| | | | | | | | | | | | | | | | | For simple uses for str.format(), this makes the typing easier. Hopfully this will help in the adoption of str.format(). For example: 'The {} is {}'.format('sky', 'blue') You can mix and matcth auto-numbering and named replacement fields: 'The {} is {color}'.format('sky', color='blue') But you can't mix and match auto-numbering and specified numbering: 'The {0} is {}'.format('sky', 'blue') ValueError: cannot switch from manual field specification to automatic field numbering Will port to 3.1.
* #3601: test_unicode.test_raiseMemError fails in UCS4Antoine Pitrou2008-09-051-1/+4
| | | | Reviewed by Benjamin Peterson on IRC.
* #3556: test_raiseMemError consumes an insane amount of memoryAntoine Pitrou2008-08-171-8/+3
|
* Correct a crash when two successive unicode allocations fail with a MemoryError:Amaury Forgeot d'Arc2008-07-311-0/+14
| | | | | | | | | the freelist contained half-initialized objects with freed pointers. The comment /* XXX UNREF/NEWREF interface should be more symmetrical */ was copied from tupleobject.c, and appears in some other places. I sign the petition.
* #2242: utf7 decoding crashes on bogus input on some Windows/MSVC versionsAntoine Pitrou2008-07-251-0/+3
|
* #1477: ur'\U0010FFFF' raised in narrow unicode builds.Amaury Forgeot d'Arc2008-03-231-2/+15
| | | | | Corrected the raw-unicode-escape codec to use UTF-16 surrogates in this case, just like the unicode-escape codec.