cpython.git - https://github.com/python/cpython.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	[3.12] gh-113993: Make interned strings mortal (GH-120520, GH-121364, ↵	Petr Viktorin	2024-09-27	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	GH-121903, GH-122303) (#123065) This backports several PRs for gh-113993, making interned strings mortal so they can be garbage-collected when no longer needed. * Allow interned strings to be mortal, and fix related issues (GH-120520) * Add an InternalDocs file describing how interning should work and how to use it. * Add internal functions to explicitly request what kind of interning is done: - `_PyUnicode_InternMortal` - `_PyUnicode_InternImmortal` - `_PyUnicode_InternStatic` * Switch uses of `PyUnicode_InternInPlace` to those. * Disallow using `_Py_SetImmortal` on strings directly. You should use `_PyUnicode_InternImmortal` instead: - Strings should be interned before immortalization, otherwise you're possibly interning a immortalizing copy. - `_Py_SetImmortal` doesn't handle the `SSTATE_INTERNED_MORTAL` to `SSTATE_INTERNED_IMMORTAL` update, and those flags can't be changed in backports, as they are now part of public API and version-specific ABI. * Add private `_only_immortal` argument for `sys.getunicodeinternedsize`, used in refleak test machinery. Make sure the statically allocated string singletons are unique. This means these sets are now disjoint: - `_Py_ID` - `_Py_STR` (including the empty string) - one-character latin-1 singletons Now, when you intern a singleton, that exact singleton will be interned. * Add a `_Py_LATIN1_CHR` macro, use it instead of `_Py_ID`/`_Py_STR` for one-character latin-1 singletons everywhere (including Clinic). * Intern `_Py_STR` singletons at startup. * Beef up the tests. Cover internal details (marked with `@cpython_only`). * Add lots of assertions * Don't immortalize in PyUnicode_InternInPlace; keep immortalizing in other API (GH-121364) * Switch PyUnicode_InternInPlace to _PyUnicode_InternMortal, clarify docs * Document immortality in some functions that take `const char ` This is PyUnicode_InternFromString; PyDict_SetItemString, PyObject_SetAttrString; PyObject_DelAttrString; PyUnicode_InternFromString; and the PyModule_Add convenience functions. Always point out a non-immortalizing alternative. Don't immortalize user-provided attr names in _ctypes * Immortalize names in code objects to avoid crash (GH-121903) * Intern latin-1 one-byte strings at startup (GH-122303) There are some 3.12-specific changes, mainly to allow statically allocated strings in deepfreeze. (In 3.13, deepfreeze switched to the general `_Py_ID`/`_Py_STR`.) Co-authored-by: Eric Snow <ericsnowcurrently@gmail.com>
*	[3.12] gh-93691: fix too broad source locations of for statement iterators ↵	Irit Katriel	2024-06-13	1	-5/+4
\| \| \| \| \| \|	(GH-120330 (#120405) [3.12] gh-93691: fix too broad source locations of for statement iterators (GH-120330). (cherry picked from commit 97b69db167be28a33688db436551a6c3c3ea4662)
*	gh-102856: Initial implementation of PEP 701 (#102855)	Pablo Galindo Salgado	2023-04-19	1	-4/+4
\| \| \| \| \| \|	Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com> Co-authored-by: Batuhan Taskaya <isidentical@gmail.com> Co-authored-by: Marta Gómez Macías <mgmacias@google.com> Co-authored-by: sunmy2019 <59365878+sunmy2019@users.noreply.github.com>
*	GH-88691: Shrink the CALL caches (GH-103230)	Brandt Bucher	2023-04-05	1	-33/+33
\|
*	GH-89987: Shrink the BINARY_SUBSCR caches (GH-103022)	Brandt Bucher	2023-03-29	1	-29/+28
\|
*	gh-101632: Add the new RETURN_CONST opcode (#101633)	penguin_wwy	2023-02-07	1	-25/+25
\|
*	GH-99554: Pack location tables more effectively (GH-99556)	Brandt Bucher	2022-12-22	1	-11/+8
\|
*	GH-96793: Change `FOR_ITER` to not pop the iterator on exhaustion. (GH-96801)	Mark Shannon	2022-10-27	1	-28/+28
\| \| \| \|	Change FOR_ITER to have the same stack effect regardless of whether it branches or not. Performance is unchanged as FOR_ITER (and specialized forms jump over the cleanup code).
*	gh-94485: Set line number of module's RESUME instruction to 0, as specified ↵	Irit Katriel	2022-07-05	1	-11/+11
\| \| \| \| \|	by PEP 626 (GH-94552) Co-authored-by: Mark Shannon <mark@hotpy.org>
*	GH-91432: Specialize FOR_ITER (GH-91713)	Dennis Sweeney	2022-06-21	1	-30/+31
\| \| \| \| \|	* Adds FOR_ITER_LIST and FOR_ITER_RANGE specializations. * Adds _PyLong_AssignValue() internal function to avoid temporary boxing of ints.
*	GH-93429: Merge `LOAD_METHOD` back into `LOAD_ATTR` (GH-93430)	Ken Jin	2022-06-14	1	-34/+35
\|
*	gh-93382: Cache result of `PyCode_GetCode` in codeobject (GH-93383)	Ken Jin	2022-06-03	1	-4/+4
\| \| \| \|	Co-authored-by: Kumar Aditya <59607654+kumaraditya303@users.noreply.github.com> Co-authored-by: Dennis Sweeney <36520290+sweeneyde@users.noreply.github.com>
*	GH-90690: Remove `PRECALL` instruction (GH-92925)	Mark Shannon	2022-05-19	1	-21/+19
\|
*	gh-78214: marshal: Stabilize FLAG_REF usage (GH-8226)	Inada Naoki	2022-05-04	1	-8/+8
\| \| \| \| \| \| \| \| \|	Use FLAG_REF always for interned strings. Refcounts of interned string is very unstable. When compiling same source, refcounts of interned string in the output may be 1 or >1. It makes FLAG_REF usage unstable. To help reproducible build, use FLAG_REF for interned string even if refcnt(obj)==1.
*	GH-88116: Use a compact format to represent end line and column offsets. ↵	Mark Shannon	2022-04-21	1	-15/+11
\| \| \| \| \| \| \| \| \| \| \| \|	(GH-91666) * Stores all location info in linetable to conform to PEP 626. * Remove column table from code objects. * Remove end-line table from code objects. * Document new location table format
*	bpo-47120: Replace the JUMP_ABSOLUTE opcode by the relative JUMP_BACKWARD ↵	Irit Katriel	2022-03-31	1	-1/+1
\| \| \| \|	(GH-32115)
*	bpo-46841: Use inline caching for calls (GH-31709)	Brandt Bucher	2022-03-07	1	-35/+40
\|
*	bpo-46841: Use inline caching for attribute accesses (GH-31640)	Brandt Bucher	2022-03-03	1	-7/+9
\|
*	bpo-46841: Use inline cache for `BINARY_SUBSCR`. (GH-31618)	Mark Shannon	2022-03-01	1	-10/+12
\|
*	bpo-46329: Streamline calling sequence a bit. (GH-31465)	Mark Shannon	2022-02-21	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Move handling of bound-methods to PRECALL. * Remove call_shape.postcall_shrink * Remove call_shape.callable * Remove call_shape.callable. Change CALL oparg to match PRECALL oparg. * Move KW_NAMES before PRECALL. * Update opcode docs in dis.rst
*	bpo-46329: Change calling sequence (again) (GH-31373)	Mark Shannon	2022-02-18	1	-31/+32
\| \| \| \|	* Change calling sequence: Add PUSH_NULL. Merge PRECALL_FUNCTION and PRECALL_METHOD into PRECALL.
*	bpo-46541: Replace core use of _Py_IDENTIFIER() with statically initialized ↵	Eric Snow	2022-02-08	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	global objects. (gh-30928) We're no longer using _Py_IDENTIFIER() (or _Py_static_string()) in any core CPython code. It is still used in a number of non-builtin stdlib modules. The replacement is: PyUnicodeObject (not pointer) fields under _PyRuntimeState, statically initialized as part of _PyRuntime. A new _Py_GET_GLOBAL_IDENTIFIER() macro facilitates lookup of the fields (along with _Py_GET_GLOBAL_STRING() for non-identifier strings). https://bugs.python.org/issue46541#msg411799 explains the rationale for this change. The core of the change is in: * (new) Include/internal/pycore_global_strings.h - the declarations for the global strings, along with the macros * Include/internal/pycore_runtime_init.h - added the static initializers for the global strings * Include/internal/pycore_global_objects.h - where the struct in pycore_global_strings.h is hooked into _PyRuntimeState * Tools/scripts/generate_global_objects.py - added generation of the global string declarations and static initializers I've also added a --check flag to generate_global_objects.py (along with make check-global-objects) to check for unused global strings. That check is added to the PR CI config. The remainder of this change updates the core code to use _Py_GET_GLOBAL_IDENTIFIER() instead of _Py_IDENTIFIER() and the related _PyId functions (likewise for _Py_GET_GLOBAL_STRING() instead of _Py_static_string()). This includes adding a few functions where there wasn't already an alternative to _PyId(), replacing the _Py_Identifier * parameter with PyObject . The following are not changed (yet): stop using _Py_IDENTIFIER() in the stdlib modules * (maybe) get rid of _Py_IDENTIFIER(), etc. entirely -- this may not be doable as at least one package on PyPI using this (private) API * (maybe) intern the strings during runtime init https://bugs.python.org/issue46541
*	bpo-46329: Split calls into precall and call instructions. (GH-30855)	Mark Shannon	2022-01-28	1	-29/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add PRECALL_FUNCTION opcode. * Move 'call shape' varaibles into struct. * Replace CALL_NO_KW and CALL_KW with KW_NAMES and CALL instructions. * Specialize for builtin methods taking using the METH_FASTCALL \| METH_KEYWORDS protocol. * Allow kwnames for specialized calls to builtin types. * Specialize calls to tuple(arg) and str(arg).
*	bpo-45923: Handle call events in bytecode (GH-30364)	Mark Shannon	2022-01-06	1	-31/+31
\| \| \| \|	* Add a RESUME instruction to handle "call" events.
*	bpo-44525: Split calls into PRECALL and CALL (GH-30011)	Mark Shannon	2021-12-14	1	-4/+4
\| \| \| \| \| \| \| \| \| \|	* Add 3 new opcodes for calls: PRECALL_METHOD, CALL_NO_KW, CALL_KW. * Update specialization to handle new CALL opcodes. * Specialize call to method descriptors. * Remove old CALL opcodes: CALL_FUNCTION, CALL_METHOD, CALL_METHOD_KW, CALL_FUNCTION_KW.
*	bpo-44530: Add co_qualname field to PyCodeObject (GH-26941)	Gabriele N. Tornetta	2021-07-07	1	-10/+11
\|
*	bpo-43950: Add code.co_positions (PEP 657) (GH-26955)	Pablo Galindo	2021-07-02	1	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This PR is part of PEP 657 and augments the compiler to emit ending line numbers as well as starting and ending columns from the AST into compiled code objects. This allows bytecodes to be correlated to the exact source code ranges that generated them. This information is made available through the following public APIs: * The `co_positions` method on code objects. * The C API function `PyCode_Addr2Location`. Co-authored-by: Batuhan Taskaya <isidentical@gmail.com> Co-authored-by: Ammar Askar <ammar@ammaraskar.com>
*	bpo-44313: generate LOAD_ATTR/CALL_FUNCTION for top-level imported objects ↵	Batuhan Taskaya	2021-06-30	1	-1/+1
\| \| \| \|	(GH-26677)
*	bpo-43693 Get rid of CO_NOFREE -- it's unused (GH-26839)	Guido van Rossum	2021-06-23	1	-1/+1
\| \| \| \| \| \|	All uses of this flag are either setting it or in doc or tests for it. So we should be able to get rid of it completely.
*	bpo-43693: Turn localspluskinds into an object (GH-26749)	Guido van Rossum	2021-06-21	1	-5/+5
\| \| \|	Managing it as a bare pointer to malloc'ed bytes is just too awkward in a few places.
*	bpo-43693: Un-revert commits 2c1e258 and b2bf2bc. (gh-26577)	Eric Snow	2021-06-07	1	-6/+5
\| \| \| \| \| \| \| \| \| \|	These were reverted in gh-26530 (commit 17c4edc) due to refleaks. * 2c1e258 - Compute deref offsets in compiler (gh-25152) * b2bf2bc - Add new internal code objects fields: co_fastlocalnames and co_fastlocalkinds. (gh-26388) This change fixes the refleaks. https://bugs.python.org/issue43693
*	bpo-43693: Revert commits 2c1e2583fdc4db6b43d163239ea42b0e8394171f and ↵	Pablo Galindo	2021-06-04	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	b2bf2bc1ece673d387341e06c8d3c2bc6e259747 (GH-26530) * Revert "bpo-43693: Compute deref offsets in compiler (gh-25152)" This reverts commit b2bf2bc1ece673d387341e06c8d3c2bc6e259747. * Revert "bpo-43693: Add new internal code objects fields: co_fastlocalnames and co_fastlocalkinds. (gh-26388)" This reverts commit 2c1e2583fdc4db6b43d163239ea42b0e8394171f. These two commits are breaking the refleak buildbots.
*	bpo-43693: Add new internal code objects fields: co_fastlocalnames and ↵	Eric Snow	2021-06-03	1	-6/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	co_fastlocalkinds. (gh-26388) A number of places in the code base (notably ceval.c and frameobject.c) rely on mapping variable names to indices in the frame "locals plus" array (AKA fast locals), and thus opargs. Currently the compiler indirectly encodes that information on the code object as the tuples co_varnames, co_cellvars, and co_freevars. At runtime the dependent code must calculate the proper mapping from those, which isn't ideal and impacts performance-sensitive sections. This is something we can easily address in the compiler instead. This change addresses the situation by replacing internal use of co_varnames, etc. with a single combined tuple of names in locals-plus order, along with a minimal array mapping each to its kind (local vs. cell vs. free). These two new PyCodeObject fields, co_fastlocalnames and co_fastllocalkinds, are not exposed to Python code for now, but co_varnames, etc. are still available with the same values as before (though computed lazily). Aside from the (mild) performance impact, there are a number of other benefits: * there's now a clear, direct relationship between locals-plus and variables * code that relies on the locals-plus-to-name mapping is simpler * marshaled code objects are smaller and serialize/de-serialize faster Also note that we can take this approach further by expanding the possible values in co_fastlocalkinds to include specific argument types (e.g. positional-only, kwargs). Doing so would allow further speed-ups in _PyEval_MakeFrameVector(), which is where args get unpacked into the locals-plus array. It would also allow us to shrink marshaled code objects even further. https://bugs.python.org/issue43693
*	bpo-43693: Add _PyCode_New(). (gh-26375)	Eric Snow	2021-05-27	1	-25/+25
\| \| \| \| \|	This is an internal-only API that helps us manage the many values used to create a code object. https://bugs.python.org/issue43693
*	bpo-44131: Py_FrozenMain() uses PyConfig_SetBytesArgv() (GH-26201)	Victor Stinner	2021-05-20	1	-24/+22
\| \| \| \| \|	Moreover, Py_FrozenMain() relies on Py_InitializeFromConfig() to handle the PYTHONUNBUFFERED environment variable and configure C stdio streams like stdout (make the stream unbuffered).
*	bpo-44131: Fix Makefile for test_frozenmain (GH-26203)	Victor Stinner	2021-05-18	1	-1/+1
\| \| \| \|	Remove Programs/test_frozenmain.h Makefile target: it ran make in parallel which caused build errors on LTO+PGO builds.
*	bpo-44131: Test Py_FrozenMain() (GH-26126)	Victor Stinner	2021-05-17	1	-0/+30
	* Add test_frozenmain to test_embed * Add Programs/test_frozenmain.py * Add Programs/freeze_test_frozenmain.py * Add Programs/test_frozenmain.h * Add make regen-test-frozenmain * Add test_frozenmain command to Programs/_testembed * _testembed.c: add error(msg) function