cpython.git - https://github.com/python/cpython.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	The incremental decoder for utf-7 must preserve its state between calls.	Amaury Forgeot d'Arc	2007-11-20	1	-2/+12
\| \| \| \| \| \| \|	Solves issue1460. Might not be a backport candidate: a new API function was added, and some code may rely on details in utf-7.py.
*	Fix for #1444: utf_8_sig.StreamReader was (indirectly through decode())	Walter Dörwald	2007-11-19	1	-0/+44
\| \| \| \| \|	calling codecs.utf_8_decode() with final==True, which falled with incomplete byte sequences. Fix and test by James G. Sack.
*	Fix typo in comment.	Walter Dörwald	2007-11-19	1	-1/+1
\|
*	Backport r57105 and r57145 from the py3k branch: UTF-32 codecs.	Walter Dörwald	2007-08-17	1	-0/+134
\|
*	Fix utf-8-sig incremental decoder, which didn't recognise a BOM when the	Walter Dörwald	2007-04-12	1	-0/+5
\| \| \| \|	first chunk fed to the decoder started with a BOM, but was longer than 3 bytes.
*	Change decode() so that it works with a buffer (i.e. unicode(..., 'utf-8-sig'))	Walter Dörwald	2006-11-23	1	-0/+4
\| \| \| \|	SF bug #1601501.
*	Whitespace normalization.	Tim Peters	2006-11-03	1	-2/+2
\|
*	I'm assuming this is correct, it fixes the tests so they pass again	Neal Norwitz	2006-10-29	1	-0/+1
\|
*	Add tests for incremental codecs with an errors	Walter Dörwald	2006-10-29	1	-0/+24
\| \| \| \|	argument.
*	Bug #1586613: fix zlib and bz2 codecs' incremental en/decoders.	Georg Brandl	2006-10-29	1	-0/+2
\|
*	Fix the new EncodedFile test to work with big endian platforms.	Georg Brandl	2006-10-29	1	-2/+2
\|
*	Fix codecs.EncodedFile which did not use file_encoding in 2.5.0, and	Georg Brandl	2006-10-29	1	-1/+28
\| \| \| \| \|	fix all codecs file wrappers to work correctly with the "with" statement (bug #1586513).
*	Verify the crash due to EncodingMap not initialized does not return	Neal Norwitz	2006-06-13	1	-0/+6
\|
*	Add a BufferedIncrementalEncoder class that can be used for implementing	Walter Dörwald	2006-04-14	1	-3/+70
\| \| \| \| \| \| \| \| \|	an incremental encoder that must retain part of the data between calls to the encode() method. Fix the incremental encoder and decoder for the IDNA encoding. This closes SF patch #1453235.
*	Call encode()/decode() with final==True as the last call in the	Walter Dörwald	2006-04-14	1	-0/+4
\| \| \| \|	incremental codec tests.
*	Add tests for the C APIs PyCodec_IncrementalEncoder() and	Walter Dörwald	2006-03-18	1	-2/+14
\| \| \| \|	PyCodec_IncrementalDecoder().
*	Patch #1436130: codecs.lookup() now returns a CodecInfo object (a subclass	Walter Dörwald	2006-03-15	1	-0/+57
\| \| \| \| \| \| \|	of tuple) that provides incremental decoders and encoders (a way to use stateful codecs without the stream API). Functions codecs.getincrementaldecoder() and codecs.getincrementalencoder() have been added.
*	If size is specified, try to read at least size characters.	Walter Dörwald	2006-03-06	1	-5/+9
\| \| \| \|	This is a alternative version of patch #1379332.
*	Patch #1177307: UTF-8-Sig codec.	Martin v. Löwis	2006-01-08	1	-0/+28
\|
*	Add tests to increase code coverage in Python/codecs.c and Python/exceptions.c.	Walter Dörwald	2005-11-17	1	-0/+19
\|
*	Add tests for various error cases and for readbuffer_encode() and	Walter Dörwald	2005-11-17	1	-2/+67
\| \| \| \| \|	charbuffer_encode(). This increases code coverage in Modules/_codecsmodule.c from 83% to 95%.
*	Part of SF patch #1313939: Speedup charmap decoding by extending	Walter Dörwald	2005-10-06	1	-1/+36
\| \| \| \| \| \| \|	PyUnicode_DecodeCharmap() the accept a unicode string as the mapping argument which is used as a mapping table. This code isn't used by any of the codecs yet.
*	SF bug #1251300: On UCS-4 builds the "unicode-internal" codec will now complain	Walter Dörwald	2005-08-30	1	-1/+50
\| \| \| \| \|	about illegal code points. The codec now supports PEP 293 style error handlers. (This is a variant of the Nik Haldimann's patch that detects truncated data)
*	Make IDNA return an empty string when the input is empty. Fixes #1163178.	Martin v. Löwis	2005-08-25	1	-0/+6
\| \| \| \|	Will backport to 2.4.
*	Make attributes and local variables in the StreamReader str objects instead	Walter Dörwald	2005-07-20	1	-0/+17
\| \| \| \| \|	of unicode objects, so that codecs that do a str->str decoding won't promote the result to unicode. This fixes SF bug #1241507.
*	Update test to the current readline() behaviour.	Walter Dörwald	2005-04-21	1	-0/+2
\|
*	Fix for SF bug #1175396: readline() will now read one more character, if	Walter Dörwald	2005-04-04	1	-9/+82
\| \| \| \| \| \| \| \| \| \| \| \|	the last character read is "\r" (and size is None, i.e. we're allowed to call read() multiple times), so that we can return the correct line ending (this additional character might be a "\n"). If the stream is temporarily exhausted, we might return the wrong line ending (if the last character read is "\r" and the next one (after the byte stream provides more data) is "\n", but at least the atcr member ensure that we get the correct number of lines (i.e. this "\n" will not be treated as another line ending.)
*	Reset internal buffers when seek() is called. This fixes SF bug #1156259.	Walter Dörwald	2005-03-14	1	-0/+15
\|
*	Fix stupid typo: Don't read from a writer.	Walter Dörwald	2005-02-08	1	-2/+2
\|
*	Add a test for UTF-16 reading where the byte sequence doesn't start with	Walter Dörwald	2005-02-04	1	-0/+9
\| \| \| \|	a BOM.
*	Fix and test for SF bug #1098990: codec readline() splits lines apart.	Walter Dörwald	2005-01-10	1	-0/+30
\|
*	Add a test that checks the basic functionality of every encoding.	Walter Dörwald	2004-12-29	1	-1/+179
\|
*	The changes to the stateful codecs in 2.4 resulted in StreamReader.readline()	Walter Dörwald	2004-12-21	1	-13/+80
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	trying to return a complete line even if a size parameter was given (see http://www.python.org/sf/1076985). This leads to buffer overflows with long source lines under Windows if e.g. cp1252 is used as the source encoding. This patch reverts the behaviour of readline() to something that behaves more like Python 2.3: If a size parameter is given, read() is called only once. As a side effect of this, readline() now supports all types of linebreaks supported by unicode.splitlines(). Note that the tokenizer is still broken and it's possible to provoke segfaults (see http://www.python.org/sf/1089395).
*	Trigger a few error cases in Modules/_codecsmodule.c.	Walter Dörwald	2004-10-28	1	-0/+14
\|
*	SF #1048865: Fix a trivial typo that breaks StreamReader.readlines()	Hye-Shik Chang	2004-10-17	1	-1/+12
\|
*	SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now support	Walter Dörwald	2004-09-07	1	-1/+117
\| \| \| \| \| \| \| \| \| \| \|	decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful.
*	Add generic codecs.encode() and .decode() APIs that don't impose	Marc-André Lemburg	2004-07-10	1	-1/+11
\| \| \| \|	any restriction on the return type (like unicode.encode() et al. do).
*	Whitespace normalization.	Tim Peters	2004-07-08	1	-1/+1
\|
*	Add test case for unicode(somestring, "idna").	Martin v. Löwis	2004-03-24	1	-1/+6
\|
*	Combine the functionality of test_support.run_unittest()	Walter Dörwald	2003-05-01	1	-7/+7
\| \| \| \| \| \| \| \| \| \|	and test_support.run_classtests() into run_unittest() and use it wherever possible. Also don't use "from test.test_support import ...", but "from test import test_support" in a few spots. From SF patch #662807.
*	Whitespace normalization.	Tim Peters	2003-04-24	1	-2/+2
\|
*	Skip nameprep test 3.43, as we do allow unassigned characters. The test	Martin v. Löwis	2003-04-18	1	-2/+4
\| \| \| \|	fails only in UCS-2 mode, since it tests a non-BMP character.
*	Implement IDNA (Internationalized Domain Names in Applications).	Martin v. Löwis	2003-04-18	1	-0/+296
\|
*	Fix for [ 543344 ] Interpreter crashes when recoding; suggested	Marc-André Lemburg	2003-02-04	1	-0/+9
\| \| \| \| \| \|	by Michael Stone (mbrierst). Python 2.1.4, 2.2.2 candidate.
*	Check whether a string resize is necessary at the end	Walter Dörwald	2002-09-03	1	-1/+8
\| \| \| \| \| \| \| \| \|	of PyString_DecodeEscape(). This prevents a call to _PyString_Resize() for the empty string, which would result in a PyErr_BadInternalCall(), because the empty string has more than one reference. This closes SF bug http://www.python.org/sf/603937
*	Get rid of relative imports in all unittests. Now anything that	Barry Warsaw	2002-07-23	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	imports e.g. test_support must do so using an absolute package name such as "import test.test_support" or "from test import test_support". This also updates the README in Lib/test, and gets rid of the duplicate data dirctory in Lib/test/data (replaced by Lib/email/test/data). Now Tim and Jack can have at it. :)
*	Change the PyUnit-based tests to use the test_main() approach. This	Fred Drake	2001-09-20	1	-1/+7
\| \| \| \| \|	allows using the tests with unittest.py as a script. The tests will still run when run as a script themselves.
*	Test by Martin v. Loewis for the new UTF-16 codec handling of BOM	Marc-André Lemburg	2001-06-19	1	-0/+25
	marks.