| Commit message (Collapse) | Author | Age | Files | Lines | |
|---|---|---|---|---|---|
| * | Issue #14579: Fix error handling bug in the utf-16 decoder. | Antoine Pitrou | 2012-07-20 | 1 | -2/+26 |
| | | | | | Patch by Serhiy Storchaka. | ||||
| * | Issue #1813: Fix codec lookup and setting/getting locales under Turkish locales. | Antoine Pitrou | 2011-07-24 | 1 | -0/+14 |
| | | |||||
| * | test_codecs now removes the temporay file (created by the test) | Victor Stinner | 2011-05-23 | 1 | -7/+6 |
| | | |||||
| * | Merged revisions 86596 via svnmerge from | Ezio Melotti | 2010-11-21 | 1 | -69/+69 |
| | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r86596 | ezio.melotti | 2010-11-20 21:04:17 +0200 (Sat, 20 Nov 2010) | 1 line #9424: Replace deprecated assert* methods in the Python test suite. ........ | ||||
| * | Issue #8941: decoding big endian UTF-32 data in UCS-2 builds could crash | Antoine Pitrou | 2010-06-11 | 1 | -0/+25 |
| | | | | | | the interpreter with characters outside the Basic Multilingual Plane (higher than 0x10000). | ||||
| * | #8016: add the CP858 codec (approved by Benjamin). (Also add CP720 to the ↵ | Georg Brandl | 2010-05-24 | 1 | -0/+2 |
| | | | | | tests, it was missing there.) | ||||
| * | Issue #6268: More bugfixes about BOM, UTF-16 and UTF-32 | Victor Stinner | 2010-05-22 | 1 | -3/+39 |
| | | | | | | | | * Fix seek() method of codecs.open(), don't write the BOM twice after seek(0) * Fix reset() method of codecs, UTF-16, UTF-32 and StreamWriter classes * test_codecs: use "w+" mode instead of "wt+". "t" mode is not supported by Solaris or Windows, but does it really exist? I found it the in the issue. | ||||
| * | Issue #6268: Fix seek() method of codecs.open(), don't read the BOM twice | Victor Stinner | 2010-05-22 | 1 | -0/+21 |
| | | | | | after seek(0) | ||||
| * | fix escape_encode to return the correct consumed size | Philip Jenvey | 2010-04-05 | 1 | -0/+3 |
| | | |||||
| * | #691291: codecs.open() should not convert end of lines on reading and writing. | Florent Xicluna | 2010-02-26 | 1 | -0/+15 |
| | | |||||
| * | use assert[Not]IsInstance where appropriate | Ezio Melotti | 2010-01-24 | 1 | -2/+2 |
| | | |||||
| * | Issue #6922: Fix an infinite loop when trying to decode an invalid | Georg Brandl | 2009-09-17 | 1 | -0/+12 |
| | | | | | UTF-32 stream with a non-raising error handler like "replace" or "ignore". | ||||
| * | convert usage of fail* to assert* | Benjamin Peterson | 2009-06-30 | 1 | -4/+4 |
| | | |||||
| * | Issue 3739: The unicode-internal encoder now reports the number of *characters* | Walter Dörwald | 2009-05-06 | 1 | -2/+7 |
| | | | | | consumed like any other encoder (instead of the number of bytes). | ||||
| * | The incremental decoder for utf-7 must preserve its state between calls. | Amaury Forgeot d'Arc | 2007-11-20 | 1 | -2/+12 |
| | | | | | | | | Solves issue1460. Might not be a backport candidate: a new API function was added, and some code may rely on details in utf-7.py. | ||||
| * | Fix for #1444: utf_8_sig.StreamReader was (indirectly through decode()) | Walter Dörwald | 2007-11-19 | 1 | -0/+44 |
| | | | | | | calling codecs.utf_8_decode() with final==True, which falled with incomplete byte sequences. Fix and test by James G. Sack. | ||||
| * | Fix typo in comment. | Walter Dörwald | 2007-11-19 | 1 | -1/+1 |
| | | |||||
| * | Backport r57105 and r57145 from the py3k branch: UTF-32 codecs. | Walter Dörwald | 2007-08-17 | 1 | -0/+134 |
| | | |||||
| * | Fix utf-8-sig incremental decoder, which didn't recognise a BOM when the | Walter Dörwald | 2007-04-12 | 1 | -0/+5 |
| | | | | | first chunk fed to the decoder started with a BOM, but was longer than 3 bytes. | ||||
| * | Change decode() so that it works with a buffer (i.e. unicode(..., 'utf-8-sig')) | Walter Dörwald | 2006-11-23 | 1 | -0/+4 |
| | | | | | SF bug #1601501. | ||||
| * | Whitespace normalization. | Tim Peters | 2006-11-03 | 1 | -2/+2 |
| | | |||||
| * | I'm assuming this is correct, it fixes the tests so they pass again | Neal Norwitz | 2006-10-29 | 1 | -0/+1 |
| | | |||||
| * | Add tests for incremental codecs with an errors | Walter Dörwald | 2006-10-29 | 1 | -0/+24 |
| | | | | | argument. | ||||
| * | Bug #1586613: fix zlib and bz2 codecs' incremental en/decoders. | Georg Brandl | 2006-10-29 | 1 | -0/+2 |
| | | |||||
| * | Fix the new EncodedFile test to work with big endian platforms. | Georg Brandl | 2006-10-29 | 1 | -2/+2 |
| | | |||||
| * | Fix codecs.EncodedFile which did not use file_encoding in 2.5.0, and | Georg Brandl | 2006-10-29 | 1 | -1/+28 |
| | | | | | | fix all codecs file wrappers to work correctly with the "with" statement (bug #1586513). | ||||
| * | Verify the crash due to EncodingMap not initialized does not return | Neal Norwitz | 2006-06-13 | 1 | -0/+6 |
| | | |||||
| * | Add a BufferedIncrementalEncoder class that can be used for implementing | Walter Dörwald | 2006-04-14 | 1 | -3/+70 |
| | | | | | | | | | | an incremental encoder that must retain part of the data between calls to the encode() method. Fix the incremental encoder and decoder for the IDNA encoding. This closes SF patch #1453235. | ||||
| * | Call encode()/decode() with final==True as the last call in the | Walter Dörwald | 2006-04-14 | 1 | -0/+4 |
| | | | | | incremental codec tests. | ||||
| * | Add tests for the C APIs PyCodec_IncrementalEncoder() and | Walter Dörwald | 2006-03-18 | 1 | -2/+14 |
| | | | | | PyCodec_IncrementalDecoder(). | ||||
| * | Patch #1436130: codecs.lookup() now returns a CodecInfo object (a subclass | Walter Dörwald | 2006-03-15 | 1 | -0/+57 |
| | | | | | | | | of tuple) that provides incremental decoders and encoders (a way to use stateful codecs without the stream API). Functions codecs.getincrementaldecoder() and codecs.getincrementalencoder() have been added. | ||||
| * | If size is specified, try to read at least size characters. | Walter Dörwald | 2006-03-06 | 1 | -5/+9 |
| | | | | | This is a alternative version of patch #1379332. | ||||
| * | Patch #1177307: UTF-8-Sig codec. | Martin v. Löwis | 2006-01-08 | 1 | -0/+28 |
| | | |||||
| * | Add tests to increase code coverage in Python/codecs.c and Python/exceptions.c. | Walter Dörwald | 2005-11-17 | 1 | -0/+19 |
| | | |||||
| * | Add tests for various error cases and for readbuffer_encode() and | Walter Dörwald | 2005-11-17 | 1 | -2/+67 |
| | | | | | | charbuffer_encode(). This increases code coverage in Modules/_codecsmodule.c from 83% to 95%. | ||||
| * | Part of SF patch #1313939: Speedup charmap decoding by extending | Walter Dörwald | 2005-10-06 | 1 | -1/+36 |
| | | | | | | | | PyUnicode_DecodeCharmap() the accept a unicode string as the mapping argument which is used as a mapping table. This code isn't used by any of the codecs yet. | ||||
| * | SF bug #1251300: On UCS-4 builds the "unicode-internal" codec will now complain | Walter Dörwald | 2005-08-30 | 1 | -1/+50 |
| | | | | | | about illegal code points. The codec now supports PEP 293 style error handlers. (This is a variant of the Nik Haldimann's patch that detects truncated data) | ||||
| * | Make IDNA return an empty string when the input is empty. Fixes #1163178. | Martin v. Löwis | 2005-08-25 | 1 | -0/+6 |
| | | | | | Will backport to 2.4. | ||||
| * | Make attributes and local variables in the StreamReader str objects instead | Walter Dörwald | 2005-07-20 | 1 | -0/+17 |
| | | | | | | of unicode objects, so that codecs that do a str->str decoding won't promote the result to unicode. This fixes SF bug #1241507. | ||||
| * | Update test to the current readline() behaviour. | Walter Dörwald | 2005-04-21 | 1 | -0/+2 |
| | | |||||
| * | Fix for SF bug #1175396: readline() will now read one more character, if | Walter Dörwald | 2005-04-04 | 1 | -9/+82 |
| | | | | | | | | | | | | | the last character read is "\r" (and size is None, i.e. we're allowed to call read() multiple times), so that we can return the correct line ending (this additional character might be a "\n"). If the stream is temporarily exhausted, we might return the wrong line ending (if the last character read is "\r" and the next one (after the byte stream provides more data) is "\n", but at least the atcr member ensure that we get the correct number of lines (i.e. this "\n" will not be treated as another line ending.) | ||||
| * | Reset internal buffers when seek() is called. This fixes SF bug #1156259. | Walter Dörwald | 2005-03-14 | 1 | -0/+15 |
| | | |||||
| * | Fix stupid typo: Don't read from a writer. | Walter Dörwald | 2005-02-08 | 1 | -2/+2 |
| | | |||||
| * | Add a test for UTF-16 reading where the byte sequence doesn't start with | Walter Dörwald | 2005-02-04 | 1 | -0/+9 |
| | | | | | a BOM. | ||||
| * | Fix and test for SF bug #1098990: codec readline() splits lines apart. | Walter Dörwald | 2005-01-10 | 1 | -0/+30 |
| | | |||||
| * | Add a test that checks the basic functionality of every encoding. | Walter Dörwald | 2004-12-29 | 1 | -1/+179 |
| | | |||||
| * | The changes to the stateful codecs in 2.4 resulted in StreamReader.readline() | Walter Dörwald | 2004-12-21 | 1 | -13/+80 |
| | | | | | | | | | | | | | | | trying to return a complete line even if a size parameter was given (see http://www.python.org/sf/1076985). This leads to buffer overflows with long source lines under Windows if e.g. cp1252 is used as the source encoding. This patch reverts the behaviour of readline() to something that behaves more like Python 2.3: If a size parameter is given, read() is called only once. As a side effect of this, readline() now supports all types of linebreaks supported by unicode.splitlines(). Note that the tokenizer is still broken and it's possible to provoke segfaults (see http://www.python.org/sf/1089395). | ||||
| * | Trigger a few error cases in Modules/_codecsmodule.c. | Walter Dörwald | 2004-10-28 | 1 | -0/+14 |
| | | |||||
| * | SF #1048865: Fix a trivial typo that breaks StreamReader.readlines() | Hye-Shik Chang | 2004-10-17 | 1 | -1/+12 |
| | | |||||
| * | SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now support | Walter Dörwald | 2004-09-07 | 1 | -1/+117 |
| | | | | | | | | | | | | decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful. | ||||
