summaryrefslogtreecommitdiffstats
path: root/Objects/unicodeobject.c
Commit message (Collapse)AuthorAgeFilesLines
...
| | * cleanup overflowing handling in unicode_decode_call_errorhandler and ↵Benjamin Peterson2014-09-291-18/+56
| | | | | | | | | | | | unicode_encode_ucs1 (closes #22518)
* | | Removed redundant casts to `char *`.Serhiy Storchaka2014-09-281-2/+2
| | | | | | | | | | | | Corresponding functions now accept `const char *` (issue #1772673).
* | | Issue #22215: Now ValueError is raised instead of TypeError when str or bytesSerhiy Storchaka2014-09-061-5/+5
| | | | | | | | | | | | argument contains not permitted null character or byte.
* | | Issue #22156: Fix "comparison between signed and unsigned integers" compilerVictor Stinner2014-08-151-5/+5
| | | | | | | | | | | | | | | | | | | | | warnings in the Objects/ subdirectory. PyType_FromSpecWithBases() and PyType_FromSpec() now reject explicitly negative slot identifiers.
* | | Issue #18395: Rename ``_Py_char2wchar()`` to :c:func:`Py_DecodeLocale`, renameVictor Stinner2014-08-011-4/+4
| | | | | | | | | | | | | | | ``_Py_wchar2char()`` to :c:func:`Py_EncodeLocale`, and document these functions.
* | | unicodeobject.c: fix a compiler warning on Windows 64 bitsVictor Stinner2014-07-251-1/+2
| | |
* | | (Merge 3.4) Issue #21892, #21893: Partial revert of changeset 4f55e802baf0,Victor Stinner2014-07-041-18/+8
|\ \ \ | |/ / | | | | | | PyErr_Format() uses "%zd" for Py_ssize_t, not PY_FORMAT_SIZE_T
| * | Issue #21892, #21893: Partial revert of changeset 4f55e802baf0, PyErr_Format()Victor Stinner2014-07-041-18/+8
| | | | | | | | | | | | uses "%zd" for Py_ssize_t, not PY_FORMAT_SIZE_T
* | | (Merge 3.4) Closes #21892, #21893: Use PY_FORMAT_SIZE_T instead of %zi or %zuVictor Stinner2014-07-011-11/+23
|\ \ \ | |/ / | | | | | | to format C size_t, because %zi/%u is not supported on all platforms.
| * | Closes #21892, #21893: Use PY_FORMAT_SIZE_T instead of %zi or %zu to format CVictor Stinner2014-07-011-11/+23
| | | | | | | | | | | | size_t, because %zi/%u is not supported on all platforms.
* | | Issue #21118: PyLong_AS_LONG() result type is longVictor Stinner2014-04-081-1/+1
| | | | | | | | | | | | Even if PyLong_AS_LONG() cannot fail, I prefer to use the right type.
* | | fix reference leaks in the translate fast path (closes #21175)Benjamin Peterson2014-04-081-14/+8
| | | | | | | | | | | | Patch by Josh Rosenberg.
* | | Issue #21118: Optimize also str.translate() for ASCII => ASCII deletionVictor Stinner2014-04-051-19/+29
| | |
* | | Issue #21118: Add unit test for invalid character replacement (code point ↵Victor Stinner2014-04-051-10/+10
| | | | | | | | | | | | higher than U+10ffff)
* | | Issue #21118: Optimize str.translate() for ASCII => ASCII translationVictor Stinner2014-04-051-1/+120
| | |
* | | Issue #21118: Remove unused variableVictor Stinner2014-04-041-1/+0
| | |
* | | Issue #21118: Use _PyUnicodeWriter API in str.translate() to simplify andVictor Stinner2014-04-041-171/+95
| | | | | | | | | | | | factorize the code
* | | Issue19995: more informative error message; spelling corrections; use ↵Ethan Furman2014-03-211-5/+20
| | | | | | | | | | | | operator.mod instead of __mod__
* | | Issue19995: passing a non-int to %o, %c, %x, or %X now raises an exceptionEthan Furman2014-03-191-26/+2
| | |
* | | Issue #20574: Implement incremental decoder for cp65001 codeVictor Stinner2014-03-171-32/+9
|/ / | | | | | | (Windows code page 65001, Microsoft UTF-8).
* | Make the various iterators' "setstate" sliently and consistently clip theKristján Valur Jónsson2014-03-051-3/+7
|\ \ | |/ | | | | | | index. This avoids the possibility of setting an iterator to an invalid state.
| * Make the various iterators' "setstate" sliently and consistently clip theKristján Valur Jónsson2014-03-051-3/+7
| | | | | | | | | | index. This avoids the possibility of setting an iterator to an invalid state.
| * Issue #19619: Blacklist non-text codecs in method APISerhiy Storchaka2014-02-241-2/+2
| | | | | | | | | | | | | | | | | | | | | | | | str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings. Backported changeset d68df99d7a57.
* | merge 3.3 (#20507)Benjamin Peterson2014-02-151-1/+1
|\ \ | |/
| * give non-iterable TypeError a message (closes #20507)Benjamin Peterson2014-02-151-1/+1
| |
* | Issue #20437: Fixed 22 potential bugs when deleting objects references.Serhiy Storchaka2014-02-091-10/+5
|\ \ | |/
| * Issue #20437: Fixed 21 potential bugs when deleting objects references.Serhiy Storchaka2014-02-091-8/+4
| |
* | Issue #20530: Argument Clinic's signature format has been revised again.Larry Hastings2014-02-091-2/+4
| | | | | | | | | | | | | | The new syntax is highly human readable while still preventing false positives. The syntax also extends Python syntax to denote "self" and positional-only parameters, allowing inspect.Signature objects to be totally accurate for all supported builtins in Python 3.4.
* | Issue #20538: UTF-7 incremental decoder produced inconsistant string whenSerhiy Storchaka2014-02-081-1/+9
|\ \ | |/ | | | | input was truncated in BASE64 section.
| * Issue #20538: UTF-7 incremental decoder produced inconsistant string whenSerhiy Storchaka2014-02-081-1/+9
| | | | | | | | input was truncated in BASE64 section.
* | Issue #20326: Argument Clinic now uses a simple, unique signature toLarry Hastings2014-01-281-3/+3
| | | | | | | | | | | | | | | | | | | | annotate text signatures in docstrings, resulting in fewer false positives. "self" parameters are also explicitly marked, allowing inspect.Signature() to authoritatively detect (and skip) said parameters. Issue #20326: Argument Clinic now generates separate checksums for the input and output sections of the block, allowing external tools to verify that the input has not changed (and thus the output is not out-of-date).
* | Issue #20390: Small fixes and improvements for Argument Clinic.Larry Hastings2014-01-261-1/+1
| |
* | Issue #20189: Four additional builtin types (PyTypeObject,Larry Hastings2014-01-241-4/+4
| | | | | | | | | | | | PyMethodDescr_Type, _PyMethodWrapper_Type, and PyWrapperDescr_Type) have been modified to provide introspection information for builtins. Also: many additional Lib, test suite, and Argument Clinic fixes.
* | Issue19995: fixed typo; switched from test.support.check_warnings to assertWarnsEthan Furman2014-01-121-2/+2
| |
* | Issue19995: issue deprecation warning for non-integer values to %c, %o, %x, %XEthan Furman2014-01-121-2/+27
| |
* | Issue #19273: The marker comments Argument Clinic uses have been changedLarry Hastings2014-01-071-6/+6
| | | | | | | | to improve readability.
* | Issue19995: %o, %x, %X now only accept intsEthan Furman2014-01-051-8/+27
| |
* | Reverted changeset b72c5573c5e7 (issue #15027).Serhiy Storchaka2014-01-041-41/+61
| |
* | Issue #15027: Rewrite the UTF-32 encoder. It is now 1.6x to 3.5x faster.Serhiy Storchaka2014-01-041-61/+41
| |
* | Remove deadcode (HASH macro is no more defined)Victor Stinner2014-01-031-1/+0
| |
* | Remove now unused variablesVictor Stinner2014-01-031-5/+0
| |
* | unicode_char() uses get_latin1_char() to get latin1 singleton charactersVictor Stinner2014-01-031-0/+3
| |
* | add unicode_char() in unicodeobject.c to factorize codeVictor Stinner2014-01-031-55/+31
| |
* | Issue #19674: inspect.signature() now produces a correct signatureLarry Hastings2013-11-231-4/+7
| | | | | | | | for some builtins.
* | Issue #19730: Argument Clinic now supports all the existing PyArgLarry Hastings2013-11-231-5/+5
| | | | | | | | | | "format units" as legacy converters, as well as two new features: "self converters" and the "version" directive.
* | Issue #19619: Blacklist non-text codecs in method APINick Coghlan2013-11-221-2/+2
| | | | | | | | | | | | | | | | | | | | str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings.
* | ssue #19183: Implement PEP 456 'secure and interchangeable hash algorithm'.Christian Heimes2013-11-201-33/+2
| | | | | | | | Python now uses SipHash24 on all major platforms.
* | Add _PyUnicodeWriter_WriteASCIIString() functionVictor Stinner2013-11-191-18/+72
| |
* | Issue #12892: The utf-16* and utf-32* codecs now reject (lone) surrogates.Serhiy Storchaka2013-11-191-24/+221
| | | | | | | | | | | | | | | | | | | | The utf-16* and utf-32* encoders no longer allow surrogate code points (U+D800-U+DFFF) to be encoded. The utf-32* decoders no longer decode byte sequences that correspond to surrogate code points. The surrogatepass error handler now works with the utf-16* and utf-32* codecs. Based on patches by Victor Stinner and Kang-Hao (Kenny) Lu.
* | Issue #19581: Change the overallocation factor of _PyUnicodeWriter on WindowsVictor Stinner2013-11-181-6/+17
| | | | | | | | On Windows, a factor of 50% gives best performances.