summaryrefslogtreecommitdiffstats
path: root/Objects/unicodeobject.c
Commit message (Collapse)AuthorAgeFilesLines
...
* | | Issue #21118: Optimize also str.translate() for ASCII => ASCII deletionVictor Stinner2014-04-051-19/+29
| | |
* | | Issue #21118: Add unit test for invalid character replacement (code point ↵Victor Stinner2014-04-051-10/+10
| | | | | | | | | | | | higher than U+10ffff)
* | | Issue #21118: Optimize str.translate() for ASCII => ASCII translationVictor Stinner2014-04-051-1/+120
| | |
* | | Issue #21118: Remove unused variableVictor Stinner2014-04-041-1/+0
| | |
* | | Issue #21118: Use _PyUnicodeWriter API in str.translate() to simplify andVictor Stinner2014-04-041-171/+95
| | | | | | | | | | | | factorize the code
* | | Issue19995: more informative error message; spelling corrections; use ↵Ethan Furman2014-03-211-5/+20
| | | | | | | | | | | | operator.mod instead of __mod__
* | | Issue19995: passing a non-int to %o, %c, %x, or %X now raises an exceptionEthan Furman2014-03-191-26/+2
| | |
* | | Issue #20574: Implement incremental decoder for cp65001 codeVictor Stinner2014-03-171-32/+9
|/ / | | | | | | (Windows code page 65001, Microsoft UTF-8).
* | Make the various iterators' "setstate" sliently and consistently clip theKristján Valur Jónsson2014-03-051-3/+7
|\ \ | |/ | | | | | | index. This avoids the possibility of setting an iterator to an invalid state.
| * Make the various iterators' "setstate" sliently and consistently clip theKristján Valur Jónsson2014-03-051-3/+7
| | | | | | | | | | index. This avoids the possibility of setting an iterator to an invalid state.
| * Issue #19619: Blacklist non-text codecs in method APISerhiy Storchaka2014-02-241-2/+2
| | | | | | | | | | | | | | | | | | | | | | | | str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings. Backported changeset d68df99d7a57.
* | merge 3.3 (#20507)Benjamin Peterson2014-02-151-1/+1
|\ \ | |/
| * give non-iterable TypeError a message (closes #20507)Benjamin Peterson2014-02-151-1/+1
| |
* | Issue #20437: Fixed 22 potential bugs when deleting objects references.Serhiy Storchaka2014-02-091-10/+5
|\ \ | |/
| * Issue #20437: Fixed 21 potential bugs when deleting objects references.Serhiy Storchaka2014-02-091-8/+4
| |
* | Issue #20530: Argument Clinic's signature format has been revised again.Larry Hastings2014-02-091-2/+4
| | | | | | | | | | | | | | The new syntax is highly human readable while still preventing false positives. The syntax also extends Python syntax to denote "self" and positional-only parameters, allowing inspect.Signature objects to be totally accurate for all supported builtins in Python 3.4.
* | Issue #20538: UTF-7 incremental decoder produced inconsistant string whenSerhiy Storchaka2014-02-081-1/+9
|\ \ | |/ | | | | input was truncated in BASE64 section.
| * Issue #20538: UTF-7 incremental decoder produced inconsistant string whenSerhiy Storchaka2014-02-081-1/+9
| | | | | | | | input was truncated in BASE64 section.
* | Issue #20326: Argument Clinic now uses a simple, unique signature toLarry Hastings2014-01-281-3/+3
| | | | | | | | | | | | | | | | | | | | annotate text signatures in docstrings, resulting in fewer false positives. "self" parameters are also explicitly marked, allowing inspect.Signature() to authoritatively detect (and skip) said parameters. Issue #20326: Argument Clinic now generates separate checksums for the input and output sections of the block, allowing external tools to verify that the input has not changed (and thus the output is not out-of-date).
* | Issue #20390: Small fixes and improvements for Argument Clinic.Larry Hastings2014-01-261-1/+1
| |
* | Issue #20189: Four additional builtin types (PyTypeObject,Larry Hastings2014-01-241-4/+4
| | | | | | | | | | | | PyMethodDescr_Type, _PyMethodWrapper_Type, and PyWrapperDescr_Type) have been modified to provide introspection information for builtins. Also: many additional Lib, test suite, and Argument Clinic fixes.
* | Issue19995: fixed typo; switched from test.support.check_warnings to assertWarnsEthan Furman2014-01-121-2/+2
| |
* | Issue19995: issue deprecation warning for non-integer values to %c, %o, %x, %XEthan Furman2014-01-121-2/+27
| |
* | Issue #19273: The marker comments Argument Clinic uses have been changedLarry Hastings2014-01-071-6/+6
| | | | | | | | to improve readability.
* | Issue19995: %o, %x, %X now only accept intsEthan Furman2014-01-051-8/+27
| |
* | Reverted changeset b72c5573c5e7 (issue #15027).Serhiy Storchaka2014-01-041-41/+61
| |
* | Issue #15027: Rewrite the UTF-32 encoder. It is now 1.6x to 3.5x faster.Serhiy Storchaka2014-01-041-61/+41
| |
* | Remove deadcode (HASH macro is no more defined)Victor Stinner2014-01-031-1/+0
| |
* | Remove now unused variablesVictor Stinner2014-01-031-5/+0
| |
* | unicode_char() uses get_latin1_char() to get latin1 singleton charactersVictor Stinner2014-01-031-0/+3
| |
* | add unicode_char() in unicodeobject.c to factorize codeVictor Stinner2014-01-031-55/+31
| |
* | Issue #19674: inspect.signature() now produces a correct signatureLarry Hastings2013-11-231-4/+7
| | | | | | | | for some builtins.
* | Issue #19730: Argument Clinic now supports all the existing PyArgLarry Hastings2013-11-231-5/+5
| | | | | | | | | | "format units" as legacy converters, as well as two new features: "self converters" and the "version" directive.
* | Issue #19619: Blacklist non-text codecs in method APINick Coghlan2013-11-221-2/+2
| | | | | | | | | | | | | | | | | | | | str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings.
* | ssue #19183: Implement PEP 456 'secure and interchangeable hash algorithm'.Christian Heimes2013-11-201-33/+2
| | | | | | | | Python now uses SipHash24 on all major platforms.
* | Add _PyUnicodeWriter_WriteASCIIString() functionVictor Stinner2013-11-191-18/+72
| |
* | Issue #12892: The utf-16* and utf-32* codecs now reject (lone) surrogates.Serhiy Storchaka2013-11-191-24/+221
| | | | | | | | | | | | | | | | | | | | The utf-16* and utf-32* encoders no longer allow surrogate code points (U+D800-U+DFFF) to be encoded. The utf-32* decoders no longer decode byte sequences that correspond to surrogate code points. The surrogatepass error handler now works with the utf-16* and utf-32* codecs. Based on patches by Victor Stinner and Kang-Hao (Kenny) Lu.
* | Issue #19581: Change the overallocation factor of _PyUnicodeWriter on WindowsVictor Stinner2013-11-181-6/+17
| | | | | | | | On Windows, a factor of 50% gives best performances.
* | Argument Clinic: rename "self" to "module" for module-level functions.Larry Hastings2013-11-181-1/+1
| |
* | #17806: Added keyword-argument support for "tabsize" to str/bytes.expandtabs().Ezio Melotti2013-11-161-5/+9
| |
* | Close #17828: better handling of codec errorsNick Coghlan2013-11-131-9/+18
| | | | | | | | | | | | | | | | - output type errors now redirect users to the type-neutral convenience functions in the codecs module - stateless errors that occur during encoding and decoding will now be automatically wrapped in exceptions that give the name of the codec involved
* | _Py_normalize_encoding(): explain how the value 6 was computedVictor Stinner2013-11-071-0/+1
| |
* | Fix _Py_normalize_encoding(): ensure that buffer is big enough to store "utf-8"Victor Stinner2013-11-071-0/+2
| | | | | | | | if the input string is NULL
* | Issue #19512: add _PyUnicode_CompareWithId() functionVictor Stinner2013-11-061-0/+9
| | | | | | | | | | | | | | _PyUnicode_CompareWithId() is faster than PyUnicode_CompareWithASCIIString() when both strings are equal and interned. Add also _PyId_builtins identifier for "builtins" common string.
* | Issue #19424: PyUnicode_CompareWithASCIIString() normalizes memcmp() resultVictor Stinner2013-11-041-2/+6
| | | | | | | | to -1, 0, 1
* | Issue #16286: remove duplicated identity check from unicode_compare()Victor Stinner2013-11-041-4/+5
| | | | | | | | Move the test to PyUnicode_Compare()
* | Issue #16286: optimize PyUnicode_RichCompare() for identical strings (sameVictor Stinner2013-11-041-5/+19
| | | | | | | | | | | | pointer) for any operator, not only Py_EQ and Py_NE. Code of bytes_richcompare() and PyUnicode_RichCompare() is now closer.
* | Issue #16286: write a new subfunction bytes_compare_eq()Victor Stinner2013-11-041-5/+3
| | | | | | | | | | * cleanup bytes_richcompare() * PyUnicode_RichCompare(): replace a test with a XOR
* | Issue #19424: Fix a compiler warning on comparing signed/unsigned size_tVictor Stinner2013-11-031-1/+1
| | | | | | | | Patch written by Zachary Ware.
* | Issue #19424: Fix a compiler warningVictor Stinner2013-10-301-1/+1
| | | | | | | | memcmp() just takes raw pointers