cpython.git - https://github.com/python/cpython.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fix my last commit (r81471) about codecs	Victor Stinner	2010-05-22	1	-3/+3
\| \| \| \|	Rememder: don't touch the code just before a commit
*	Issue #6268: More bugfixes about BOM, UTF-16 and UTF-32	Victor Stinner	2010-05-22	1	-7/+13
\| \| \| \| \| \| \|	* Fix seek() method of codecs.open(), don't write the BOM twice after seek(0) * Fix reset() method of codecs, UTF-16, UTF-32 and StreamWriter classes * test_codecs: use "w+" mode instead of "wt+". "t" mode is not supported by Solaris or Windows, but does it really exist? I found it the in the issue.
*	Patch #1436130: codecs.lookup() now returns a CodecInfo object (a subclass	Walter Dörwald	2006-03-15	1	-2/+50
\| \| \| \| \| \| \|	of tuple) that provides incremental decoders and encoders (a way to use stateful codecs without the stream API). Functions codecs.getincrementaldecoder() and codecs.getincrementalencoder() have been added.
*	Reset internal buffers when seek() is called. This fixes SF bug #1156259.	Walter Dörwald	2005-03-14	1	-0/+7
\|
*	SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now support	Walter Dörwald	2004-09-07	1	-39/+25
\| \| \| \| \| \| \| \| \| \| \|	decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful.
*	Whitespace normalization.	Tim Peters	2002-08-08	1	-2/+1
\|
*	Fix for bug #222395: UTF-16 et al. don't handle .readline().	Marc-André Lemburg	2002-04-05	1	-0/+3
\| \| \| \|	They now raise an NotImplementedError to hint to the truth ;-)
*	This patch by Martin v. Loewis changes the UTF-16 codec to only	Marc-André Lemburg	2001-06-19	1	-3/+33
\| \| \| \| \| \| \| \| \| \| \|	write a BOM at the start of the stream and also to only read it as BOM at the start of a stream. Subsequent reading/writing of BOMs will read/write the BOM as ZWNBSP character. This is in sync with the Unicode specifications. Note that UTF-16 files will now have to start with a BOM mark in order to be readable by the codec.
*	Marc-Andre Lemburg: Unicode encodings.	Guido van Rossum	2000-03-10	1	-0/+31