cpython.git - https://github.com/python/cpython.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	Add a BufferedIncrementalEncoder class that can be used for implementing	Walter Dörwald	2006-04-14	1	-0/+27
\| \| \| \| \| \| \| \| \|	an incremental encoder that must retain part of the data between calls to the encode() method. Fix the incremental encoder and decoder for the IDNA encoding. This closes SF patch #1453235.
*	Fix wrong attribute name.	Walter Dörwald	2006-04-14	1	-1/+1
\|
*	Change raise statement to PEP 8 style.	Walter Dörwald	2006-03-18	1	-2/+1
\|
*	Add some versionadded info to new incremental codec docs and fix doco nits.	Neal Norwitz	2006-03-16	1	-2/+2
\|
*	Patch #1436130: codecs.lookup() now returns a CodecInfo object (a subclass	Walter Dörwald	2006-03-15	1	-11/+172
\| \| \| \| \| \| \|	of tuple) that provides incremental decoders and encoders (a way to use stateful codecs without the stream API). Functions codecs.getincrementaldecoder() and codecs.getincrementalencoder() have been added.
*	If size is specified, try to read at least size characters.	Walter Dörwald	2006-03-06	1	-1/+4
\| \| \| \|	This is a alternative version of patch #1379332.
*	Whitespace normalization.	Tim Peters	2005-12-25	1	-2/+2
\|
*	Patch #1268314: Cache lines in StreamReader.readlines for performance.	Martin v. Löwis	2005-09-18	1	-0/+37
\| \| \| \|	Will backport to Python 2.4.
*	SF bug #1235646: codecs.StreamRecoder.next() now reencodes the data it reads	Walter Dörwald	2005-09-01	1	-1/+3
\| \| \| \| \|	from the input stream, so that the output is a byte string in the correct encoding instead of a unicode string.
*	Return complete lines from codec stream readers	Martin v. Löwis	2005-08-24	1	-3/+17
\| \| \| \| \| \|	even if there is an exception in later lines, resulting in correct line numbers for decoding errors in source code. Fixes #1178484. Will backport to 2.4.
*	Make attributes and local variables in the StreamReader str objects instead	Walter Dörwald	2005-07-20	1	-5/+7
\| \| \| \| \|	of unicode objects, so that codecs that do a str->str decoding won't promote the result to unicode. This fixes SF bug #1241507.
*	Fix comment.	Walter Dörwald	2005-04-21	1	-2/+2
\|
*	If the data read from the bytestream in readline() ends in a '\r' read one more	Walter Dörwald	2005-04-21	1	-12/+4
\| \| \| \| \| \| \| \| \| \| \|	byte, even if the user has passed a size parameter. This extra byte shouldn't cause a buffer overflow in the tokenizer. The original plan was to return a line ending in '\r', which might be recognizable as a complete line and skip any '\n' that was read afterwards. Unfortunately this didn't work, as the tokenizer only recognizes '\n' as line ends, which in turn lead to joined lines and SyntaxErrors, so this special treatment of a split '\r\n' has been dropped. (It can only happen with a temporarily exhausted bytestream now anyway.) Fixes parts of SF bugs #1163244 and #1175396.
*	Fix typos.	Walter Dörwald	2005-04-04	1	-2/+2
\|
*	Fix for SF bug #1175396: readline() will now read one more character, if	Walter Dörwald	2005-04-04	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \|	the last character read is "\r" (and size is None, i.e. we're allowed to call read() multiple times), so that we can return the correct line ending (this additional character might be a "\n"). If the stream is temporarily exhausted, we might return the wrong line ending (if the last character read is "\r" and the next one (after the byte stream provides more data) is "\n", but at least the atcr member ensure that we get the correct number of lines (i.e. this "\n" will not be treated as another line ending.)
*	typo	Skip Montanaro	2005-03-16	1	-1/+1
\|
*	Add default value for "whence" argument.	Walter Dörwald	2005-03-14	1	-1/+1
\|
*	Reset internal buffers when seek() is called. This fixes SF bug #1156259.	Walter Dörwald	2005-03-14	1	-1/+11
\|
*	Build with --disable-unicode again. Fixes #1158607.	Martin v. Löwis	2005-03-08	1	-5/+13
\| \| \| \|	Will backport to 2.4.
*	Fix and test for SF bug #1098990: codec readline() splits lines apart.	Walter Dörwald	2005-01-10	1	-2/+2
\|
*	The changes to the stateful codecs in 2.4 resulted in StreamReader.readline()	Walter Dörwald	2004-12-21	1	-30/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	trying to return a complete line even if a size parameter was given (see http://www.python.org/sf/1076985). This leads to buffer overflows with long source lines under Windows if e.g. cp1252 is used as the source encoding. This patch reverts the behaviour of readline() to something that behaves more like Python 2.3: If a size parameter is given, read() is called only once. As a side effect of this, readline() now supports all types of linebreaks supported by unicode.splitlines(). Note that the tokenizer is still broken and it's possible to provoke segfaults (see http://www.python.org/sf/1089395).
*	SF #1048865: Fix a trivial typo that breaks StreamReader.readlines()	Hye-Shik Chang	2004-10-17	1	-1/+1
\|
*	SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now support	Walter Dörwald	2004-09-07	1	-43/+69
\| \| \| \| \| \| \| \| \| \| \|	decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful.
*	Ignore sizehint argument. Fixes SF #844561.	Marc-André Lemburg	2004-02-26	1	-10/+4
\|
*	Fix typos.	Walter Dörwald	2003-02-02	1	-4/+4
\|
*	sys was already imported, remove second import	Neal Norwitz	2002-12-30	1	-2/+0
\|
*	Patch to make _codecs a builtin module. This is necessary since	Marc-André Lemburg	2002-12-12	1	-5/+15
\| \| \| \| \| \| \|	Python 2.3 will support source code encodings which rely on the builtin codecs being available to the parser. Remove struct dependency from codecs.py
*	Add missing documentation for the PEP 293 functionality to	Walter Dörwald	2002-11-19	1	-7/+22
\| \| \| \|	the codecs docstrings.
*	Add next() and __iter__() methods to StreamReader, StreamReaderWriter	Walter Dörwald	2002-11-06	1	-0/+27
\| \| \| \| \| \|	and StreamRecoder. This closes SF bug #634246.
*	PEP 293 implemention (from SF patch http://www.python.org/sf/432401)	Walter Dörwald	2002-09-02	1	-1/+12
\|
*	Add constants BOM_UTF8, BOM_UTF16, BOM_UTF16_LE, BOM_UTF16_BE,	Walter Dörwald	2002-06-04	1	-17/+32
\| \| \| \| \| \| \| \| \| \|	BOM_UTF32, BOM_UTF32_LE and BOM_UTF32_BE that represent the Byte Order Mark in UTF-8, UTF-16 and UTF-32 encodings for little and big endian systems. The old names BOM32_* and BOM64_* were off by a factor of 2. This closes SF bug http://www.python.org/sf/555360
*	SF 563203. Replaced 'has_key()' with 'in'.	Raymond Hettinger	2002-06-01	1	-1/+1
\|
*	Set default value for readlines.sizehint to None. Change needed for 2.2.1	Martin v. Löwis	2002-03-05	1	-1/+1
\| \| \| \|	as well.
*	Added new helpers for easy access to codecs. Docs will follow.	Marc-André Lemburg	2001-09-19	1	-0/+42
\|
*	Fix typo in comment	Andrew M. Kuchling	2001-09-18	1	-1/+1
\|
*	Patch #444359: Remove unused imports.	Martin v. Löwis	2001-08-02	1	-1/+1
\|
*	Add dead imports of modules that are "magically" imported.	Martin v. Löwis	2001-07-31	1	-0/+6
\|
*	Whitespace normalization.	Tim Peters	2001-05-29	1	-1/+1
\|
*	Moved the encoding map building logic from the individual mapping	Marc-André Lemburg	2001-05-16	1	-0/+21
\| \| \| \| \| \|	codec files to codecs.py and added logic so that multi mappings in the decoding maps now result in mappings to None (undefined mapping) in the encoding maps.
*	Just changed "x,y" to "x, y" everywhere (i.e., inserted horizontal space	Tim Peters	2001-05-15	1	-37/+34
\| \| \| \|	after commas that didn't have any).
*	added __all__ lists to a number of Python modules	Skip Montanaro	2001-01-20	1	-0/+3
\| \| \| \| \| \| \| \|	added test script and expected output file as well this closes patch 103297. __all__ attributes will be added to other modules without first submitting a patch, just adding the necessary line to the test script to verify more-or-less correct implementation.
*	Whitespace normalization.	Tim Peters	2001-01-14	1	-7/+7
\|
*	This patch changes the default behaviour of the builtin charmap	Marc-André Lemburg	2001-01-03	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	codec to not apply Latin-1 mappings for keys which are not found in the mapping dictionaries, but instead treat them as undefined mappings. The patch was originally written by Martin v. Loewis with some additional (cosmetic) changes and an updated test script by Marc-Andre Lemburg. The standard codecs were recreated from the most current files available at the Unicode.org site using the Tools/scripts/gencodec.py tool. This patch closes the bugs #116285 and #119960.
*	(Patch #102698) Fix for a bug reported by Wade Leftwich:	Andrew M. Kuchling	2000-12-10	1	-4/+4
\| \| \| \|	StreamReader ignores the 'errors' parameter passed to its constructor
*	Remove redundent information from a docstring.	Fred Drake	2000-10-02	1	-3/+0
\|
*	Spelling fixes supplied by Rob W. W. Hooft. All these are fixes in either	Thomas Wouters	2000-07-16	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	comments, docstrings or error messages. I fixed two minor things in test_winreg.py ("didn't" -> "Didn't" and "Didnt" -> "Didn't"). There is a minor style issue involved: Guido seems to have preferred English grammar (behaviour, honour) in a couple places. This patch changes that to American, which is the more prominent style in the source. I prefer English myself, so if English is preferred, I'd be happy to supply a patch myself ;)
*	Marc-Andre Lemburg <mal@lemburg.com>:	Marc-André Lemburg	2000-06-21	1	-1/+6
\| \| \| \|	Made codecs.open() default to 'rb' as file mode.
*	Marc-Andre Lemburg:	Guido van Rossum	2000-05-01	1	-2/+2
\| \| \| \| \|	The two methods .readline() and .readlines() in StreamReaderWriter didn't define the self argument. Found by Tom Emerson.
*	M.-A. Lemburg <mal@lemburg.com>:	Fred Drake	2000-04-13	1	-4/+40
\| \| \| \|	Added more documentation. Clarified some existing comments.
*	Deleted trailing whitespace. This is really a way to be able to add	Guido van Rossum	2000-04-11	1	-14/+14
\| \| \| \| \| \| \| \| \|	a missing part of the previous checkin message: Marc-Andre Lemburg: Added encoding name attributes to wrapper classes which allow applications to check the used encoding names.