Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | | Ensure that Py_UCS4 is 32 bits and Py_UCS2 is 16 bits | Victor Stinner | 2011-11-20 | 1 | -2/+7 | |
| | | ||||||
* | | Fix misused of "PyUnicodeObject" structure name in unicodeobject.h | Victor Stinner | 2011-11-16 | 1 | -2/+2 | |
| | | ||||||
* | | Port encoders from Py_UNICODE API to unicode object API. | Martin v. Löwis | 2011-11-10 | 1 | -0/+16 | |
| | | ||||||
* | | Make _PyUnicode_FromId return borrowed references. | Martin v. Löwis | 2011-11-07 | 1 | -1/+1 | |
| | | | | | | | | http://mail.python.org/pipermail/python-dev/2011-November/114347.html | |||||
* | | Fix gdb/libpython.py for not ready Unicode strings | Victor Stinner | 2011-11-04 | 1 | -3/+5 | |
| | | | | | | | | | | _PyUnicode_CheckConsistency() checks also hash and length value for not ready Unicode strings. | |||||
* | | Replace PyUnicodeObject type by PyObject | Victor Stinner | 2011-11-03 | 1 | -2/+1 | |
| | | | | | | | | | | * _PyUnicode_CheckConsistency() now takes a PyObject* instead of void* * Remove now useless casts to PyObject* | |||||
* | | Port UCS1 and charmap codecs to new API. | Martin v. Löwis | 2011-11-02 | 1 | -0/+6 | |
| | | ||||||
* | | Drop Py_UCS4_ functions. Closes #13246. | Martin v. Löwis | 2011-10-31 | 1 | -37/+0 | |
| | | ||||||
* | | Replace PyUnicodeObject* by PyObject* where it was irrevelant | Victor Stinner | 2011-10-23 | 1 | -1/+1 | |
| | | | | | | | | | | | | A Unicode string can now be a PyASCIIObject, PyCompactUnicodeObject or PyUnicodeObject. Aliasing a PyASCIIObject* or PyCompactUnicodeObject* to PyUnicodeObject* is wrong | |||||
* | | Simplify _PyUnicode_COMPACT_DATA() macro | Victor Stinner | 2011-10-18 | 1 | -1/+1 | |
| | | ||||||
* | | Issue #12281: Rewrite the MBCS codec to handle correctly replace and ignore | Victor Stinner | 2011-10-18 | 1 | -1/+15 | |
| | | | | | | | | | | error handlers on all Windows versions. The MBCS codec is now supporting all error handlers, instead of only replace to encode and ignore to decode. | |||||
* | | Rename _Py_identifier to _Py_IDENTIFIER. | Martin v. Löwis | 2011-10-14 | 1 | -2/+2 | |
| | | ||||||
* | | Simplify PyUnicode_MAX_CHAR_VALUE | Victor Stinner | 2011-10-12 | 1 | -6/+5 | |
| | | | | | | | | | | | | | | Use PyUnicode_IS_ASCII instead of PyUnicode_IS_COMPACT_ASCII, so the following test can be removed: PyUnicode_DATA(op) == (((PyCompactUnicodeObject *)(op))->utf8) | |||||
* | | Drop extra semicolon. | Martin v. Löwis | 2011-10-09 | 1 | -1/+1 | |
| | | ||||||
* | | Add API for static strings, primarily good for identifiers. | Martin v. Löwis | 2011-10-09 | 1 | -0/+34 | |
| | | | | | | | | Thanks to Konrad Schöbel and Jasper Schulz for helping with the mass-editing. | |||||
* | | Change PyUnicode_KIND to 1,2,4. Drop _KIND_SIZE and _CHARACTER_SIZE. | Martin v. Löwis | 2011-10-07 | 1 | -25/+10 | |
| | | ||||||
* | | Update C API docs for PEP 393. | Georg Brandl | 2011-10-07 | 1 | -1/+1 | |
| | | ||||||
* | | Fix PyUnicode_CHARACTER_SIZE and PyUnicode_KIND_SIZE | Victor Stinner | 2011-10-06 | 1 | -2/+2 | |
| | | ||||||
* | | Fix compilation warnings under 64-bit Windows | Antoine Pitrou | 2011-10-06 | 1 | -2/+3 | |
| | | ||||||
* | | Branch merge | Éric Araujo | 2011-10-06 | 1 | -1/+1 | |
|\ \ | ||||||
| * | | Fix typo | Éric Araujo | 2011-10-04 | 1 | -1/+1 | |
| | | | ||||||
* | | | rephrase PyUnicode_1BYTE_KIND documentation | Victor Stinner | 2011-10-05 | 1 | -6/+7 | |
| | | | ||||||
* | | | Don't check for the maximum character when copying from unicodeobject.c | Victor Stinner | 2011-10-05 | 1 | -0/+7 | |
| | | | | | | | | | | | | | | | | | | | | | | | | * Create copy_characters() function which doesn't check for the maximum character in release mode * _PyUnicode_CheckConsistency() is no more static to be able to use it in _PyUnicode_FormatAdvanced() (in formatter_unicode.c) * _PyUnicode_CheckConsistency() checks the string hash | |||||
* | | | Add asciilib: similar to ucs1, ucs2 and ucs4 library, but specialized to ASCII | Victor Stinner | 2011-10-05 | 1 | -0/+1 | |
| | | | | | | | | | | | | | | | | | | ucs1, ucs2 and ucs4 libraries have to scan created substring to find the maximum character, whereas it is not need to ASCII strings. Because ASCII strings are common, it is useful to optimize ASCII. | |||||
* | | | Document requierements of Unicode kinds | Victor Stinner | 2011-10-04 | 1 | -4/+20 | |
| | | | ||||||
* | | | More fixes. | Georg Brandl | 2011-10-05 | 1 | -2/+2 | |
| | | | ||||||
* | | | Fix a few typos in the unicode header. | Georg Brandl | 2011-10-05 | 1 | -11/+11 | |
| | | | ||||||
* | | | Fix grammar. | Georg Brandl | 2011-10-05 | 1 | -1/+1 | |
| | | | ||||||
* | | | Speedup str[a:b] and PyUnicode_FromKindAndData | Victor Stinner | 2011-10-05 | 1 | -0/+2 | |
|/ / | | | | | | | | | | | | | | | * str[a:b] doesn't scan the string for the maximum character if the string is ascii only * PyUnicode_FromKindAndData() stops if we are sure that we cannot use a shorter character type. For example, _PyUnicode_FromUCS1() stops if we have at least one character in range U+0080-U+00FF | |||||
* | | Complete documentation of compact ASCII strings | Victor Stinner | 2011-10-03 | 1 | -3/+6 | |
| | | ||||||
* | | Document utf8_length and wstr_length states | Victor Stinner | 2011-10-03 | 1 | -7/+12 | |
| | | | | | | | | Ensure these states with assertions in _PyUnicode_CheckConsistency(). | |||||
* | | Unicode: document when the wstr pointer is shared with data | Victor Stinner | 2011-10-03 | 1 | -1/+7 | |
| | | | | | | | | Add also related assertions to _PyUnicode_CheckConsistency(). | |||||
* | | Improve string forms and PyUnicode_Resize() documentation | Victor Stinner | 2011-10-03 | 1 | -13/+22 | |
| | | | | | | | | | | Remove also the FIXME for resize_copy(): as discussed with Martin, copy the string on resize if the string is not resizable is just fine. | |||||
* | | _PyUnicode_CheckConsistency() checks utf8 field consistency | Victor Stinner | 2011-10-03 | 1 | -0/+2 | |
| | | ||||||
* | | PyUnicode_Ready() now sets ascii=1 if maxchar < 128 | Victor Stinner | 2011-10-03 | 1 | -16/+25 | |
| | | | | | | | | | | ascii=1 is no more reserved to PyASCIIObject. Use PyUnicode_IS_COMPACT_ASCII(obj) to check if obj is a PyASCIIObject (as before). | |||||
* | | Add _PyUnicode_CheckConsistency() macro to help debugging | Victor Stinner | 2011-10-03 | 1 | -0/+46 | |
| | | | | | | | | | | | | * Document Unicode string states * Use _PyUnicode_CheckConsistency() to ensure that objects are always consistent. | |||||
* | | PyUnicode_READ_CHAR() ensures that the string is ready | Victor Stinner | 2011-10-02 | 1 | -8/+10 | |
| | | ||||||
* | | Use Py_UCS1 instead of unsigned char in unicodeobject.h | Victor Stinner | 2011-10-01 | 1 | -2/+2 | |
| | | ||||||
* | | PyUnicode_WriteChar() raises IndexError on invalid index | Victor Stinner | 2011-10-01 | 1 | -1/+3 | |
| | | | | | | | | | | PyUnicode_WriteChar() raises also a ValueError if the string has more than 1 reference. | |||||
* | | _PyUnicode_AsKind() is *not* part of the stable ABI | Victor Stinner | 2011-10-01 | 1 | -0/+2 | |
| | | ||||||
* | | PyUnicode_CHARACTER_SIZE(): add a reference to PyUnicode_KIND_SIZE() | Victor Stinner | 2011-10-01 | 1 | -2/+5 | |
| | | ||||||
* | | Add PyUnicode_Copy() function, include it to the public API | Victor Stinner | 2011-09-30 | 1 | -0/+5 | |
| | | ||||||
* | | _PyUnicode_Ready() cannot be used on ready strings anymore | Victor Stinner | 2011-09-29 | 1 | -7/+9 | |
| | | | | | | | | | | | | * Change its prototype: PyObject* instead of PyUnicodeoObject*. * Remove an old assertion, the result of PyUnicode_READY (_PyUnicode_Ready) must be checked instead | |||||
* | | Move _PyUnicode_UTF8() and _PyUnicode_UTF8_LENGTH() outside unicodeobject.h | Victor Stinner | 2011-09-29 | 1 | -10/+0 | |
| | | | | | | | | Move these macros to unicodeobject.c | |||||
* | | Add a note in PyUnicode_CopyCharacters() doc: it doesn't write null character | Victor Stinner | 2011-09-29 | 1 | -3/+5 | |
| | | | | | | | | Cleanup also the code (avoid the goto). | |||||
* | | PyUnicode_CopyCharacters() fails if 'to' has more than 1 reference | Victor Stinner | 2011-09-28 | 1 | -1/+2 | |
| | | ||||||
* | | Mark _PyUnicode_FindMaxCharAndNumSurrogatePairs() as private | Victor Stinner | 2011-09-28 | 1 | -12/+0 | |
| | | ||||||
* | | Strip trailing spaces in unicodeobject.[ch] | Victor Stinner | 2011-09-28 | 1 | -5/+5 | |
| | | ||||||
* | | PyUnicode_CopyCharacters() checks for buffer and character overflow | Victor Stinner | 2011-09-28 | 1 | -3/+15 | |
| | | | | | | | | It now returns the number of written characters on success. | |||||
* | | Mark PyUnicode_CONVERT_BYTES as private | Victor Stinner | 2011-09-28 | 1 | -15/+0 | |
| | |