cpython.git - https://github.com/python/cpython.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	Backport stringobject.c 2.194 and unicodeobject.c 2.172:	Guido van Rossum	2002-10-11	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \|	Fix a nasty endcase reported by Armin Rigo in SF bug 618623: '%2147483647d' % -123 segfaults. This was because an integer overflow in a comparison caused the string resize to be skipped. After fixing the overflow, this could call _PyString_Resize() with a negative size, so I (1) test for that and raise MemoryError instead; (2) also added a test for negative newsize to _PyString_Resize(), raising SystemError as for all bad arguments. An identical bug existed in unicodeobject.c, of course.
*	Backport the relevant part of 2.192:	Guido van Rossum	2002-10-11	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	The string formatting code has a test to switch to Unicode when %s sees a Unicode argument. Unfortunately this test was also executed for %r, because %s and %r share almost all of their code. This meant that, if u is a unicode object while repr(u) is an 8-bit string containing ASCII characters, '%r' % u is a unicode string containing only ASCII characters! Fixed by executing the test only for %s.
*	Backport from trunk:	Guido van Rossum	2002-09-23	1	-1/+2
\| \| \| \| \| \| \| \|	unicodeobject.c 2.169 stringobject.c 2.189 Fix warnings on 64-bit platforms about casts from pointers to ints. Two of these were real bugs.
*	Fix some endcase bugs in unicode rfind()/rindex() and endswith().	Guido van Rossum	2002-08-20	1	-3/+3
\| \| \| \| \| \|	These were reported and fixed by Inyeol Lee in SF bug 595350. The endswith() bug is already fixed in 2.3; I'll fix the others in 2.3 next.
*	SF 582071 clarified the .split() method's docstring to note that sep=None	Raymond Hettinger	2002-08-05	1	-2/+2
\| \| \| \|	will trigger splitting on any whitespace.
*	Patch #554716: Use __va_copy where available.	Martin v. Löwis	2002-07-28	1	-0/+4
\|
*	Backport checkin:	Walter Dörwald	2002-05-13	1	-0/+6
\| \| \| \| \| \| \|	Add #ifdef PY_USING_UNICODE sections, so that stringobject.c compiles again with --disable-unicode. Fixes SF bug http://www.python.org/sf/554912
*	backport tim_one's patch:	Anthony Baxter	2002-04-30	1	-3/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Repair widespread misuse of _PyString_Resize. Since it's clear people don't understand how this function works, also beefed up the docs. The most common usage error is of this form (often spread out across gotos): if (_PyString_Resize(&s, n) < 0) { Py_DECREF(s); s = NULL; goto outtahere; } The error is that if _PyString_Resize runs out of memory, it automatically decrefs the input string object s (which also deallocates it, since its refcount must be 1 upon entry), and sets s to NULL. So if the "if" branch ever triggers, it's an error to call Py_DECREF(s): s is already NULL! A correct way to write the above is the simpler (and intended) if (_PyString_Resize(&s, n) < 0) goto outtahere; Bugfix candidate. Original patch(es): python/dist/src/Objects/fileobject.c:2.161 python/dist/src/Objects/stringobject.c:2.161 python/dist/src/Objects/unicodeobject.c:2.147
*	Backport checkin:	Walter Dörwald	2002-04-22	1	-12/+28
\| \| \| \| \| \| \| \| \| \|	Apply patch diff.txt from SF feature request http://www.python.org/sf/444708 This adds the optional argument for str.strip to unicode.strip too and makes it possible to call str.strip with a unicode argument and unicode.strip with a str argument.
*	Backport the following changes:	Walter Dörwald	2002-04-22	1	-16/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Misc/NEWS 1.387->1.388 Lib/test/string_tests.py 1.10->1.11, 1.12->1.14, Lib/test/test_unicode.py 1.50->1.51, 1.53->1.54, 1.55->1.56 Lib/test/test_string.py 1.15->1.16 Lib/string.py 1.61->1.63 Lib/test/test_userstring.py 1.5->1.6, 1.11, 1.12 Objects/stringobject.c 2.156->2.159 Objects/unicodeobject.c 2.137->2.139 Doc/lib/libstdtypes.tec 1.87->1.88 Add a method zfill to str, unicode and UserString and change Lib/string.py accordingly (see SF patch http://www.python.org/sf/536241) This also adds Guido's fix to test_userstring.py and the subinstance checks in test_string.py and test_unicode.py.
*	backport gvanrossum's patch:	Anthony Baxter	2002-04-18	1	-15/+86
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Partially implement SF feature request 444708. Add optional arg to string methods strip(), lstrip(), rstrip(). The optional arg specifies characters to delete. Also for UserString. Still to do: - Misc/NEWS - LaTeX docs (I did the docstrings though) - Unicode methods, and Unicode support in the string methods. Original patches were: python/dist/src/Objects/stringobject.c:2.156
*	SF patch #491049 (David Jacobs): Small PyString_FromString optimization	Guido van Rossum	2001-12-10	1	-1/+1
\| \| \| \| \| \|	PyString_FromString(): Since the length of the string is already being stored in size, changed the strcpy() to a memcpy() for a small speed improvement.
*	PyString_FromString: this requires its argument be non-NULL, but doesn't	Tim Peters	2001-12-06	1	-1/+4
\| \| \| \|	check it. Added an assert() to that effect.
*	Little stuff.	Jeremy Hylton	2001-12-06	1	-8/+9
\| \| \| \| \| \| \| \| \| \| \|	Add a missing DECREF in an obscure corner. If the str() or repr() of an object passed to a string interpolation -- e.g. "%s" % obj -- returns a non-string, the returned object was leaked. Repair an indentation glitch. Replace a bunch of PyString_AsString() calls (and their ilk) with macros.
*	Add more inline documentation, as contributed in #487906.	Martin v. Löwis	2001-12-03	1	-3/+8
\|
*	PyString_FromFormatV, string_repr: document why these use sprintf	Tim Peters	2001-12-03	1	-5/+16
\| \| \| \|	instead of PyOS_snprintf; add some relevant comments and asserts.
*	Patch 487906: update inline docs.	Martin v. Löwis	2001-12-02	1	-13/+21
\|
*	sprintf -> PyOS_snprintf in some "obviously safe" cases.	Tim Peters	2001-11-28	1	-4/+8
\| \| \| \| \|	Also changed <>-style #includes to ""-style in some places where the former didn't make sense.
*	Make the error message for unsupported operand types cleaner, in	Guido van Rossum	2001-10-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	response to a message by Laura Creighton on c.l.py. E.g. >>> 0+'' TypeError: unsupported operand types for +: 'int' and 'str' (previously this did not mention the operand types) >>> ''+0 TypeError: cannot concatenate 'str' and 'int' objects
*	SF bug [#468061] __str__ ignored in str subclass.	Tim Peters	2001-10-16	1	-2/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	object.c, PyObject_Str: Don't try to optimize anything except exact string objects here; in particular, let str subclasses go thru tp_str, same as non-str objects. This allows overrides of tp_str to take effect. stringobject.c: + string_print (str's tp_print): If the argument isn't an exact string object, get one from PyObject_Str. + string_str (str's tp_str): Make a genuine-string copy of the object if it's of a proper str subclass type. str() applied to a str subclass that doesn't override __str__ ends up here. test_descr.py: New str_of_str_subclass() test.
*	Remove extra "]" in splitlines() docstring.	Fred Drake	2001-10-13	1	-1/+1
\| \| \| \|	Reported by Neal Norwitz.
*	Enable GC for new-style instances. This touches lots of files, since	Guido van Rossum	2001-10-05	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	many types were subclassable but had a xxx_dealloc function that called PyObject_DEL(self) directly instead of deferring to self->ob_type->tp_free(self). It is permissible to set tp_free in the type object directly to _PyObject_Del, for non-GC types, or to _PyObject_GC_Del, for GC types. Still, PyObject_DEL was a tad faster, so I'm fearing that our pystone rating is going down again. I'm not sure if doing something like void xxx_dealloc(PyObject *self) { if (PyXxxCheckExact(self)) PyObject_DEL(self); else self->ob_type->tp_free(self); } is any faster than always calling the else branch, so I haven't attempted that -- however those types whose own dealloc is fancier (int, float, unicode) do use this pattern.
*	SF bug [#467265] Compile errors on SuSe Linux on IBM/s390.	Tim Peters	2001-10-02	1	-1/+6
\| \| \| \| \| \| \|	Unknown whether this fixes it. - stringobject.c, PyString_FromFormatV: don't assume that va_list is of a type that can be copied via an initializer. - errors.c, PyErr_Format: add a va_end() to balance the va_start().
*	Merge branch changes (coercion, rich comparisons) into trunk.	Guido van Rossum	2001-09-27	1	-4/+2
\|
*	Change string comparison so that it applies even when one (or both)	Guido van Rossum	2001-09-24	1	-3/+4
\| \| \| \| \|	arguments are subclasses of str, as long as they don't override rich comparison.
*	If interning an instance of a string subclass, intern a real string object	Tim Peters	2001-09-12	1	-4/+20
\| \| \| \| \| \|	with the same value instead. This ensures that a string (or string subclass) object's ob_sinterned pointer is always a str (or NULL), and that the dict of interned strings only has strs as keys.
*	str_subtype_new, unicode_subtype_new:	Tim Peters	2001-09-12	1	-5/+15
\| \| \| \| \| \| \| \|	+ These were leaving the hash fields at 0, which all string and unicode routines believe is a legitimate hash code. As a result, hash() applied to str and unicode subclass instances always returned 0, which in turn confused dict operations, etc. + Changed local names "new"; no point to antagonizing C++ compilers.
*	More bug 460020: lots of string optimizations inhibited for string	Tim Peters	2001-09-12	1	-79/+46
\| \| \| \| \| \| \| \| \|	subclasses, all "the usual" ones (slicing etc), plus replace, translate, ljust, rjust, center and strip. I don't know how to be sure they've all been caught. Question: Should we complain if someone tries to intern an instance of a string subclass? I hate to slow any code on those paths.
*	More on SF bug [#460020] bug or feature: unicode() and subclasses.	Tim Peters	2001-09-11	1	-1/+1
\| \| \| \| \|	Repaired str(i) to return a genuine string when i is an instance of a str subclass. New PyString_CheckExact() macro.
*	Fix a memory leak in str_subtype_new(). (All the other	Guido van Rossum	2001-08-31	1	-3/+3
\| \| \| \|	xxx_subtype_new() functions are OK, but I goofed up in this one. :-( )
*	Make str and tuple types subclassable.	Guido van Rossum	2001-08-30	1	-2/+24
\|
*	Two improvements suggested by Greg Stein:	Barry Warsaw	2001-08-27	1	-2/+5
\| \| \| \| \| \| \| \| \|	PyString_FromFormatV(): In the final resize at the end, we can use PyString_AS_STRING() since we know the object is a string and can avoid the typechecking. PyString_FromFormat(): GS sez: "For safety/propriety, you should call va_end() on the vargs variable."
*	PyString_FromFormatV: Massage platform %p output to match what gcc does,	Tim Peters	2001-08-25	1	-0/+8
\| \| \| \| \| \| \|	at least in the first two characters. %p is ill-defined, and people will forever commit bad tests otherwise ("bad" in the sense that they fall over (at least on Windows) for lack of a leading '0x'; 5 of the 7 tests in test_repr.py failed on Windows for that reason this time around).
*	PyString_FromFormat() and PyString_FromFormatV(): Largely ripped from	Barry Warsaw	2001-08-24	1	-0/+155
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	PyErr_Format() these new C API methods can be used instead of sprintf()'s into hardcoded char* buffers. This allows us to fix many situation where long package, module, or class names get truncated in reprs. PyString_FromFormat() is the varargs variety. PyString_FromFormatV() is the va_list variety Original PyErr_Format() code was modified to allow %p and %ld expansions. Many reprs were converted to this, checkins coming soo. Not changed: complex_repr(), float_repr(), float_print(), float_str(), int_repr(). There may be other candidates not yet converted. Closes patch #454743.
*	Patch #445762: Support --disable-unicode	Martin v. Löwis	2001-08-17	1	-4/+56
\| \| \| \| \| \| \| \|	- Do not compile unicodeobject, unicodectype, and unicodedata if Unicode is disabled - check for Py_USING_UNICODE in all places that use Unicode functions - disables unicode literals, and the builtin functions - add the types.StringTypes list - remove Unicode literals from most tests.
*	Patch #427190: Implement and use METH_NOARGS and METH_O.	Martin v. Löwis	2001-08-16	1	-103/+56
\|
*	Merge of descr-branch back into trunk.	Tim Peters	2001-08-02	1	-26/+50
\|
*	Add _PyUnicode_AsDefaultEncodedString to unicodeobject.h.	Jeremy Hylton	2001-07-30	1	-5/+0
\| \| \| \| \| \| \|	And remove all the extern decls in the middle of .c files. Apparently, it was excluded from the header file because it is intended for internal use by the interpreter. It's still intended for internal use and documented as such in the header file.
*	Reformat decl of new _PyString_Join. Add NEWS blurb about repr() speedup.	Tim Peters	2001-06-16	1	-1/+2
\|
*	SF bug 433228: repr(list) woes when len(list) big.	Tim Peters	2001-06-16	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \|	Gave Python linear-time repr() implementations for dicts, lists, strings. This means, e.g., that repr(range(50000)) is no longer 50x slower than pprint.pprint() in 2.2 <wink>. I don't consider this a bugfix candidate, as it's a performance boost. Added _PyString_Join() to the internal string API. If we want that in the public API, fine, but then it requires runtime error checks instead of asserts.
*	Fix for bug #432384: Recursion in PyString_AsEncodedString?	Marc-André Lemburg	2001-06-12	1	-1/+1
\|
*	Patch #424335: Implement string_richcompare, remove string_compare.	Martin v. Löwis	2001-05-24	1	-12/+77
\| \| \| \|	Use new _PyString_Eq in lookdict_string.
*	This patch changes the way the string .encode() method works slightly	Marc-André Lemburg	2001-05-15	1	-20/+90
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and introduces a new method .decode(). The major change is that strg.encode() will no longer try to convert Unicode returns from the codec into a string, but instead pass along the Unicode object as-is. The same is now true for all other codec return types. The underlying C APIs were changed accordingly. Note that even though this does have the potential of breaking existing code, the chances are low since conversion from Unicode previously took place using the default encoding which is normally set to ASCII rendering this auto-conversion mechanism useless for most Unicode encodings. The good news is that you can now use .encode() and .decode() with much greater ease and that the door was opened for better accessibility of the builtin codecs. As demonstration of the new feature, the patch includes a few new codecs which allow string to string encoding and decoding (rot13, hex, zip, uu, base64). Written by Marc-Andre Lemburg. Copyright assigned to the PSF.
*	Heh. I need a break. After this: stropmodule & stringobject were more	Tim Peters	2001-05-10	1	-8/+6
\| \| \| \| \| \|	out of synch than I realized, and I managed to break replace's "count" argument when it was 0. All is well again. Maybe. Bugfix candidate.
*	Fudge. stropmodule and stringobject both had copies of the buggy	Tim Peters	2001-05-10	1	-32/+41
\| \| \| \| \| \|	mymemXXX stuff, and they were already out of synch. Fix the remaining bugs in both and get them back in synch. Bugfix release candidate.
*	SF patch #416247 2.1c1 stringobject: unused vrbl cleanup.	Tim Peters	2001-05-09	1	-2/+0
\| \| \| \|	Thanks to Mark Favas.
*	Sheesh -- repair the dodge around "cast isn't an lvalue" complaints to	Tim Peters	2001-05-09	1	-0/+4
\| \| \| \|	restore correct semantics.
*	Mark Favas reported that gcc caught me using casts as lvalues. Dodge it.	Tim Peters	2001-05-09	1	-6/+10
\|
*	Ack! Restore the COUNT_ALLOCS one_strings code.	Tim Peters	2001-05-09	1	-1/+5
\|
*	My change to string_item() left an extra reference to each 1-character	Tim Peters	2001-05-09	1	-4/+3
\| \| \| \| \|	interned string created by "string"[i]. Since they're immortal anyway, this was hard to notice, but it was still wrong <wink>.