Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | bpo-34523: Support surrogatepass in locale codecs (GH-8995) | Victor Stinner | 2018-08-29 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | Add support for the "surrogatepass" error handler in PyUnicode_DecodeFSDefault() and PyUnicode_EncodeFSDefault() for the UTF-8 encoding. Changes: * _Py_DecodeUTF8Ex() and _Py_EncodeUTF8Ex() now support the surrogatepass error handler (_Py_ERROR_SURROGATEPASS). * _Py_DecodeLocaleEx() and _Py_EncodeLocaleEx() now use the _Py_error_handler enum instead of "int surrogateescape" to pass the error handler. These functions now return -3 if the error handler is unknown. * Add unit tests on _Py_DecodeLocaleEx() and _Py_EncodeLocaleEx() in test_codecs. * Rename get_error_handler() to _Py_GetErrorHandler() and expose it as a private function. * _freeze_importlib doesn't need config.filesystem_errors="strict" workaround anymore. | ||||
* | bpo-20180: complete AC conversion of Objects/stringlib/transmogrify.h (GH-8039) | Tal Einat | 2018-07-06 | 2 | -33/+233 |
| | | | | * converted bytes methods: expandtabs, ljust, rjust, center, zfill * updated char_convertor to properly set the C default value | ||||
* | bpo-33012: Fix invalid function cast warnings with gcc 8 for METH_NOARGS. ↵ | Siddhesh Poyarekar | 2018-04-29 | 1 | -13/+13 |
| | | | | | | | | | (GH-6030) METH_NOARGS functions need only a single argument but they are cast into a PyCFunction, which takes two arguments. This triggers an invalid function cast warning in gcc8 due to the argument mismatch. Fix this by adding a dummy unused argument. | ||||
* | bpo-32677: Add .isascii() to str, bytes and bytearray (GH-5342) | INADA Naoki | 2018-01-27 | 1 | -0/+6 |
| | |||||
* | bpo-31338 (#3374) | Barry Warsaw | 2017-09-15 | 1 | -2/+1 |
| | | | | | | | * Add Py_UNREACHABLE() as an alias to abort(). * Use Py_UNREACHABLE() instead of assert(0) * Convert more unreachable code to use Py_UNREACHABLE() * Document Py_UNREACHABLE() and a few other macros. | ||||
* | bpo-30923: Silence fall-through warnings included in -Wextra since gcc-7.0. ↵ | Stefan Krah | 2017-08-21 | 1 | -2/+2 |
| | | | | (#3157) | ||||
* | bpo-30978: str.format_map() now passes key lookup exceptions through. (#2790) | Serhiy Storchaka | 2017-08-03 | 1 | -6/+10 |
| | | | Previously any exception was replaced with a KeyError exception. | ||||
* | bpo-24821: Fixed the slowing down to 25 times in the searching of some (#505) | Serhiy Storchaka | 2017-03-30 | 1 | -6/+40 |
| | | | | unlucky Unicode characters. | ||||
* | Issue #28999: Use Py_RETURN_NONE, Py_RETURN_TRUE and Py_RETURN_FALSE wherever | Serhiy Storchaka | 2017-01-23 | 1 | -4/+2 |
| | | | | possible but Coccinelle couldn't find opportunity. | ||||
* | Issue #29145: Merge 3.6. | Xiang Zhang | 2017-01-10 | 1 | -1/+1 |
| | |||||
* | Issue #28561: Clean up UTF-8 encoder: remove dead code, update comments, etc. | Serhiy Storchaka | 2016-10-30 | 1 | -10/+4 |
| | | | | Patch by Xiang Zhang. | ||||
* | Issue #28126: Replace Py_MEMCPY with memcpy(). Visual Studio can properly ↵ | Christian Heimes | 2016-09-13 | 2 | -23/+23 |
| | | | | optimize memcpy(). | ||||
* | remove all usage of Py_LOCAL | Benjamin Peterson | 2016-09-09 | 1 | -11/+11 |
| | |||||
* | PEP 7 style for if/else in C | Victor Stinner | 2016-09-02 | 1 | -1/+2 |
| | | | | Add also a newline for readability in normalize_encoding(). | ||||
* | Issue #27895: Spelling fixes (Contributed by Ville Skyttä). | Raymond Hettinger | 2016-08-30 | 1 | -3/+3 |
| | |||||
* | Backed out changeset b0087e17cd5e (issue #26765) | Serhiy Storchaka | 2016-07-03 | 1 | -54/+0 |
| | | | | For unknown reasons it perhaps caused a crash on 32-bit Windows (issue #). | ||||
* | Issue #26765: Moved wrappers for bytes and bytearray methods to common header | Serhiy Storchaka | 2016-07-01 | 1 | -0/+54 |
| | | | | file. | ||||
* | Issue #26765: Ensure that bytes- and unicode-specific stringlib files are used | Serhiy Storchaka | 2016-05-16 | 6 | -12/+15 |
| | | | | with correct type. | ||||
* | Issue #26765: Moved common code for the replace() method of bytes and bytearray | Serhiy Storchaka | 2016-05-05 | 1 | -57/+521 |
| | | | | to a template file. | ||||
* | Issue #26765: Moved common code and docstrings for bytes and bytearray methods | Serhiy Storchaka | 2016-05-04 | 2 | -103/+0 |
| | | | | to bytes_methods.c. | ||||
* | Issue #26778: Fixed "a/an/and" typos in code comment, documentation and error | Serhiy Storchaka | 2016-04-17 | 1 | -1/+1 |
|\ | | | | | | | messages. | ||||
| * | Issue #26778: Fixed "a/an/and" typos in code comment and documentation. | Serhiy Storchaka | 2016-04-17 | 1 | -1/+1 |
| | | |||||
* | | Issue #26057: Got rid of nonneeded use of PyUnicode_FromObject(). | Serhiy Storchaka | 2016-04-13 | 1 | -11/+2 |
| | | |||||
* | | Issue #24821: Refactor STRINGLIB(fastsearch_memchr_1char) and split it on | Serhiy Storchaka | 2015-11-14 | 1 | -63/+87 |
| | | | | | | | | | | STRINGLIB(find_char) and STRINGLIB(rfind_char) that can be used independedly without special preconditions. | ||||
* | | Optimize error handlers of ASCII and Latin1 encoders when the replacement | Victor Stinner | 2015-10-09 | 1 | -11/+7 |
| | | | | | | | | | | | | | | | | | | | | | | string is pure ASCII: use _PyBytesWriter_WriteBytes(), don't check individual character. Cleanup unicode_encode_ucs1(): * Rename repunicode to rep * Clear rep object on error * Factorize code between bytes and unicode path | ||||
* | | Add _PyBytesWriter_WriteBytes() to factorize the code | Victor Stinner | 2015-10-09 | 1 | -11/+11 |
| | | |||||
* | | _PyBytesWriter: simplify code to avoid "prealloc" parameters | Victor Stinner | 2015-10-09 | 1 | -8/+12 |
| | | | | | | | | | | Substract preallocate bytes from min_size before calling _PyBytesWriter_Prepare(). | ||||
* | | Optimize backslashreplace error handler | Victor Stinner | 2015-10-08 | 1 | -2/+16 |
| | | | | | | | | | | | | | | | | | | | | Issue #25318: Optimize backslashreplace and xmlcharrefreplace error handlers in UTF-8 encoder. Optimize also backslashreplace error handler for ASCII and Latin1 encoders. Use the new _PyBytesWriter API to optimize these error handlers for the encoders. It avoids to create an exception and call the slow implementation of the error handler. | ||||
* | | Issue #25318: Add _PyBytesWriter API | Victor Stinner | 2015-10-08 | 1 | -63/+21 |
| | | | | | | | | | | | | | | | | | | | | | | Add a new private API to optimize Unicode encoders. It uses a small buffer allocated on the stack and supports overallocation. Use _PyBytesWriter API for UCS1 (ASCII and Latin1) and UTF-8 encoders. Enable overallocation for the UTF-8 encoder with error handlers. unicode_encode_ucs1(): initialize collend to collstart+1 to not check the current character twice, we already know that it is not ASCII. | ||||
* | | Issue #25267: The UTF-8 encoder is now up to 75 times as fast for error | Victor Stinner | 2015-10-01 | 1 | -51/+96 |
| | | | | | | | | | | handlers: ``ignore``, ``replace``, ``surrogateescape``, ``surrogatepass``. Patch co-written with Serhiy Storchaka. | ||||
* | | Fixed an incorrect comment. | Eric V. Smith | 2015-08-26 | 1 | -1/+1 |
|/ | |||||
* | Fixed typos in comments. | Serhiy Storchaka | 2015-05-18 | 1 | -4/+4 |
|\ | |||||
| * | Fixed typos in comments. | Serhiy Storchaka | 2015-05-18 | 1 | -2/+2 |
| | | |||||
* | | Issue #15027: The UTF-32 encoder is now 3x to 7x faster. | Serhiy Storchaka | 2015-05-12 | 1 | -0/+87 |
| | | |||||
* | | Issue #23573: Increased performance of string search operations (str.find, | Serhiy Storchaka | 2015-03-24 | 2 | -23/+4 |
| | | | | | | | | | | str.index, str.count, the in operator, str.split, str.partition) with arguments of different kinds (UCS1, UCS2, UCS4). | ||||
* | | Removed unintentional trailing spaces in non-external and non-generated C files. | Serhiy Storchaka | 2015-03-18 | 1 | -1/+1 |
|/ | |||||
* | Issue #22896: Avoid to use PyObject_AsCharBuffer(), PyObject_AsReadBuffer() | Serhiy Storchaka | 2015-02-02 | 1 | -1/+8 |
| | | | | and PyObject_AsWriteBuffer(). | ||||
* | Issue #22581: Use more "bytes-like object" throughout the docs and comments. | Serhiy Storchaka | 2014-12-05 | 1 | -3/+3 |
| | |||||
* | s/stringobject/bytesobject/ (closes #22036) | Benjamin Peterson | 2014-07-24 | 1 | -1/+1 |
| | | | | Patch by Martin Matusiak. | ||||
* | merge 3.3 | Benjamin Peterson | 2014-03-30 | 1 | -19/+19 |
|\ | |||||
| * | merge 3.2 | Benjamin Peterson | 2014-03-30 | 1 | -19/+19 |
| |\ | |||||
| | * | fix expandtabs overflow detection to be consistent and not rely on signed ↵ | Benjamin Peterson | 2014-03-30 | 1 | -19/+19 |
| | | | | | | | | | | | | overflow | ||||
| | * | Issue #17173: Remove uses of locale-dependent C functions (isalpha() etc.) ↵ | Antoine Pitrou | 2013-02-09 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | in the interpreter. I've left a couple of them in: zlib (third-party lib), getaddrinfo.c (doesn't include Python.h, and probably obsolete), _sre.c (legitimate use for the re.LOCALE flag). | ||||
| | * | Issue #14700: Fix buggy overflow checks for large precision and width in ↵ | Mark Dickinson | 2012-10-28 | 2 | -18/+13 |
| | | | | | | | | | | | | new-style and old-style formatting. | ||||
| * | | fix format spec recursive expansion (closes #19729) | Benjamin Peterson | 2013-11-27 | 1 | -2/+4 |
| | | | |||||
* | | | Reverted changeset b72c5573c5e7 (issue #15027). | Serhiy Storchaka | 2014-01-04 | 1 | -87/+0 |
| | | | |||||
* | | | Issue #15027: Rewrite the UTF-32 encoder. It is now 1.6x to 3.5x faster. | Serhiy Storchaka | 2014-01-04 | 1 | -0/+87 |
| | | | |||||
* | | | Remove dead code committed in issue #12892. | Serhiy Storchaka | 2013-11-19 | 1 | -104/+0 |
| | | | |||||
* | | | Issue #12892: The utf-16* and utf-32* codecs now reject (lone) surrogates. | Serhiy Storchaka | 2013-11-19 | 1 | -16/+182 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The utf-16* and utf-32* encoders no longer allow surrogate code points (U+D800-U+DFFF) to be encoded. The utf-32* decoders no longer decode byte sequences that correspond to surrogate code points. The surrogatepass error handler now works with the utf-16* and utf-32* codecs. Based on patches by Victor Stinner and Kang-Hao (Kenny) Lu. | ||||
* | | | #17806: Added keyword-argument support for "tabsize" to str/bytes.expandtabs(). | Ezio Melotti | 2013-11-16 | 1 | -3/+5 |
| | | |