cpython.git - https://github.com/python/cpython.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	gh-109523: Raise a BlockingIOError if reading text from a non-blocking ↵	Giovanni Siragusa	2024-12-02	1	-1/+4
\| \| \| \|	stream cannot immediately return bytes. (GH-122933)
*	gh-120754: _io Ensure stat cache is cleared on fd change (#125166)	Cody Maloney	2024-11-01	1	-0/+3
\| \| \| \| \| \| \| \|	Performed an audit of `fileio.c` and `_pyio` and made sure anytime the fd changes the stat result, if set, is also cleared/changed. There's one case where it's not cleared, if code would clear it in __init__, keep the memory allocated and just do another fstat with the existing memory.
*	gh-90102: Fix pyio _isatty_open_only() (#125089)	Cody Maloney	2024-10-08	1	-1/+1
\| \| \| \| \| \| \|	Spotted by @ngnpope. `isatty` returns False to indicate the file is not a TTY. The C implementation of _io does that (`Py_RETURN_FALSE`) but I got the bool backwards in the _pyio implementaiton.
*	gh-90102: Remove isatty call during regular open (#124922)	Cody Maloney	2024-10-08	1	-1/+16
\| \| \|	Co-authored-by: Victor Stinner <vstinner@python.org>
*	gh-120754: Refactor I/O modules to stash whole stat result rather than ↵	Cody Maloney	2024-09-18	1	-20/+24
\| \| \| \| \| \| \| \| \| \| \| \|	individual members (#123412) Multiple places in the I/O stack optimize common cases by using the information from stat. Currently individual members are extracted from the stat and stored into the fileio struct. Refactor the code to store the whole stat struct instead. Parallels the changes to _io. The `stat` Python object doesn't allow changing members, so rather than modifying estimated_size, just clear the value.
*	gh-120754: Reduce system calls in full-file FileIO.readall() case (#120755)	Cody Maloney	2024-07-04	1	-8/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reduces the system call count of a simple program[0] that reads all the `.rst` files in Doc by over 10% (5706 -> 4734 system calls on my linux system, 5813 -> 4875 on my macOS) This reduces the number of `fstat()` calls always and seek calls most the time. Stat was always called twice, once at open (to error early on directories), and a second time to get the size of the file to be able to read the whole file in one read. Now the size is cached with the first call. The code keeps an optimization that if the user had previously read a lot of data, the current position is subtracted from the number of bytes to read. That is somewhat expensive so only do it on larger files, otherwise just try and read the extra bytes and resize the PyBytes as needeed. I built a little test program to validate the behavior + assumptions around relative costs and then ran it under `strace` to get a log of the system calls. Full samples below[1]. After the changes, this is everything in one `filename.read_text()`: ```python3 openat(AT_FDCWD, "cpython/Doc/howto/clinic.rst", O_RDONLY\|O_CLOEXEC) = 3` fstat(3, {st_mode=S_IFREG\|0644, st_size=343, ...}) = 0` ioctl(3, TCGETS, 0x7ffdfac04b40) = -1 ENOTTY (Inappropriate ioctl for device) lseek(3, 0, SEEK_CUR) = 0 read(3, ":orphan:\n\n.. This page is retain"..., 344) = 343 read(3, "", 1) = 0 close(3) = 0 ``` This does make some tradeoffs 1. If the file size changes between open() and readall(), this will still get all the data but might have more read calls. 2. I experimented with avoiding the stat + cached result for small files in general, but on my dev workstation at least that tended to reduce performance compared to using the fstat(). [0] ```python3 from pathlib import Path nlines = [] for filename in Path("cpython/Doc").glob("*/.rst"): nlines.append(len(filename.read_text())) ``` [1] Before small file: ``` openat(AT_FDCWD, "cpython/Doc/howto/clinic.rst", O_RDONLY\|O_CLOEXEC) = 3 fstat(3, {st_mode=S_IFREG\|0644, st_size=343, ...}) = 0 ioctl(3, TCGETS, 0x7ffe52525930) = -1 ENOTTY (Inappropriate ioctl for device) lseek(3, 0, SEEK_CUR) = 0 lseek(3, 0, SEEK_CUR) = 0 fstat(3, {st_mode=S_IFREG\|0644, st_size=343, ...}) = 0 read(3, ":orphan:\n\n.. This page is retain"..., 344) = 343 read(3, "", 1) = 0 close(3) = 0 ``` After small file: ``` openat(AT_FDCWD, "cpython/Doc/howto/clinic.rst", O_RDONLY\|O_CLOEXEC) = 3 fstat(3, {st_mode=S_IFREG\|0644, st_size=343, ...}) = 0 ioctl(3, TCGETS, 0x7ffdfac04b40) = -1 ENOTTY (Inappropriate ioctl for device) lseek(3, 0, SEEK_CUR) = 0 read(3, ":orphan:\n\n.. This page is retain"..., 344) = 343 read(3, "", 1) = 0 close(3) = 0 ``` Before large file: ``` openat(AT_FDCWD, "cpython/Doc/c-api/typeobj.rst", O_RDONLY\|O_CLOEXEC) = 3 fstat(3, {st_mode=S_IFREG\|0644, st_size=133104, ...}) = 0 ioctl(3, TCGETS, 0x7ffe52525930) = -1 ENOTTY (Inappropriate ioctl for device) lseek(3, 0, SEEK_CUR) = 0 lseek(3, 0, SEEK_CUR) = 0 fstat(3, {st_mode=S_IFREG\|0644, st_size=133104, ...}) = 0 read(3, ".. highlight:: c\n\n.. _type-struc"..., 133105) = 133104 read(3, "", 1) = 0 close(3) = 0 ``` After large file: ``` openat(AT_FDCWD, "cpython/Doc/c-api/typeobj.rst", O_RDONLY\|O_CLOEXEC) = 3 fstat(3, {st_mode=S_IFREG\|0644, st_size=133104, ...}) = 0 ioctl(3, TCGETS, 0x7ffdfac04b40) = -1 ENOTTY (Inappropriate ioctl for device) lseek(3, 0, SEEK_CUR) = 0 lseek(3, 0, SEEK_CUR) = 0 read(3, ".. highlight:: c\n\n.. _type-struc"..., 133105) = 133104 read(3, "", 1) = 0 close(3) = 0 ``` Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com> Co-authored-by: Erlend E. Aasland <erlend.aasland@protonmail.com> Co-authored-by: Victor Stinner <vstinner@python.org>
*	gh-120417: Add #noqa to used imports in the stdlib (#120421)	Victor Stinner	2024-06-13	1	-1/+1
\| \| \| \| \|	Tools such as ruff can ignore "imported but unused" warnings if a line ends with "# noqa: F401". It avoids the temptation to remove an import which is used effectively.
*	gh-95782: Fix io.BufferedReader.tell() etc. being able to return offsets < 0 ↵	6t8k	2024-02-17	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(GH-99709) lseek() always returns 0 for character pseudo-devices like `/dev/urandom` (for other non-regular files, e.g. `/dev/stdin`, it always returns -1, to which CPython reacts by raising appropriate exceptions). They are thus technically seekable despite not having seek semantics. When calling read() on e.g. an instance of `io.BufferedReader` that wraps such a file, `BufferedReader` reads ahead, filling its buffer, creating a discrepancy between the number of bytes read and the internal `tell()` always returning 0, which previously resulted in e.g. `BufferedReader.tell()` or `BufferedReader.seek()` being able to return positions < 0 even though these are supposed to be always >= 0. Invariably keep the return value non-negative by returning max(former_return_value, 0) instead, and add some corresponding tests.
*	gh-82626: Emit a warning when bool is used as a file descriptor (GH-111275)	Serhiy Storchaka	2024-02-05	1	-0/+5
\|
*	gh-80109: Fix io.TextIOWrapper dropping the internal buffer during write() ↵	Zackery Spytz	2024-01-08	1	-4/+6
\| \| \| \| \| \| \|	(GH-22535) io.TextIOWrapper was dropping the internal decoding buffer during read() and write() calls.
*	gh-62948: IOBase finalizer logs close() errors (#105104)	Victor Stinner	2023-05-31	1	-16/+4
\|
*	bpo-45975: Simplify some while-loops with walrus operator (GH-29347)	Nick Drozd	2022-11-26	1	-4/+1
\|
*	gh-98999: Raise `ValueError` in `_pyio` on closed buffers (gh-99009)	Nikita Sobolev	2022-11-03	1	-0/+5
\|
*	gh-94169: Remove deprecated io.OpenWrapper (#94170)	Victor Stinner	2022-06-24	1	-16/+0
\| \| \| \| \| \|	Remove io.OpenWrapper and _pyio.OpenWrapper, deprecated in Python 3.10: just use :func:`open` instead. The open() (io.open()) function is a built-in function. Since Python 3.10, _pyio.open() is also a static method.
*	gh-93099: Fix _pyio to use locale module properly (gh-93136)	Dong-hee Na	2022-05-24	1	-8/+11
\|
*	gh-91952: Make TextIOWrapper.reconfigure() supports "locale" encoding (GH-91982)	Inada Naoki	2022-05-01	1	-0/+2
\|
*	gh-91526: io: Remove device encoding support from TextIOWrapper (GH-91529)	Inada Naoki	2022-04-19	1	-8/+0
\| \| \| \|	`TextIOWrapper.__init__()` called `os.device_encoding(file.fileno())` if fileno is 0-2 and encoding=None. But it is very rarely works, and never documented behavior.
*	gh-91156: Fix `encoding="locale"` in UTF-8 mode (GH-70056)	Inada Naoki	2022-04-14	1	-3/+5
\|
*	bpo-47000: Make `io.text_encoding()` respects UTF-8 mode (GH-32003)	Inada Naoki	2022-04-04	1	-3/+7
\| \| \|	Co-authored-by: Eric Snow <ericsnowcurrently@gmail.com>
*	bpo-25415: Remove confusing sentence from IOBase docstrings (PR-31631)	slateny	2022-03-04	1	-3/+2
\|
*	bpo-46522: fix concurrent.futures and io AttributeError messages (GH-30887)	Thomas Grainger	2022-02-23	1	-1/+1
\| \| \| \|	Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com> Co-authored-by: Andrew Svetlov <andrew.svetlov@gmail.com>
*	bpo-37330: open() no longer accept 'U' in file mode (GH-28118)	Victor Stinner	2021-09-02	1	-13/+1
\| \| \| \| \|	open(), io.open(), codecs.open() and fileinput.FileInput no longer accept "U" ("universal newline") in the file mode. This flag was deprecated since Python 3.3.
*	bpo-43680: Deprecate io.OpenWrapper (GH-25357)	Victor Stinner	2021-04-14	1	-12/+14
\| \| \| \| \| \| \| \|	Deprecate io.OpenWrapper and _pyio.OpenWrapper: use io.open and _pyio.open instead. Until Python 3.9, _pyio.open was not a static method and builtins.open was set to OpenWrapper to not become a bound method when set to a class variable. _io.open is a built-in function whereas _pyio.open is a Python function. In Python 3.10, _pyio.open() is now a static method, and builtins.open() is now io.open().
*	bpo-43680: _pyio.open() becomes a static method (GH-25354)	Victor Stinner	2021-04-12	1	-11/+9
\| \| \| \| \| \| \| \| \| \| \|	The Python _pyio.open() function becomes a static method to behave as io.open() built-in function: don't become a bound method when stored as a class variable. It becomes possible since static methods are now callable in Python 3.10. Moreover, _pyio.OpenWrapper becomes a simple alias to _pyio.open. init_set_builtins_open() now sets builtins.open to io.open, rather than setting it to io.OpenWrapper, since OpenWrapper is now an alias to open in the io and _pyio modules.
*	Revert "bpo-43510: PEP 597: Accept `encoding="locale"` in binary mode ↵	Inada Naoki	2021-03-31	1	-1/+1
\| \| \| \| \|	(GH-25103)" (#25108) This reverts commit ff3c9739bd69aa8b58007e63c9e40e6708b4761e.
*	bpo-43510: PEP 597: Accept `encoding="locale"` in binary mode (GH-25103)	Inada Naoki	2021-03-31	1	-1/+1
\| \| \| \|	It make `encoding="locale"` usable everywhere `encoding=None` is allowed.
*	bpo-43510: Implement PEP 597 opt-in EncodingWarning. (GH-19481)	Inada Naoki	2021-03-29	1	-10/+37
\| \| \| \| \| \| \| \| \| \| \|	See [PEP 597](https://www.python.org/dev/peps/pep-0597/). * Add `-X warn_default_encoding` and `PYTHONWARNDEFAULTENCODING`. * Add EncodingWarning * Add io.text_encoding() * open(), TextIOWrapper() emits EncodingWarning when encoding is omitted and warn_default_encoding is enabled. * _pyio.TextIOWrapper() uses UTF-8 as fallback default encoding used when failed to import locale module. (used during building Python) * bz2, configparser, gzip, lzma, pathlib, tempfile modules use io.text_encoding(). * What's new entry
*	bpo-39674: Revert "bpo-37330: open() no longer accept 'U' in file mode ↵	Victor Stinner	2020-03-04	1	-1/+13
\| \| \| \| \| \| \|	(GH-16959)" (GH-18767) This reverts commit e471e72977c83664f13d041c78549140c86c92de. The mode will be removed from Python 3.10.
*	bpo-35950: Raise UnsupportedOperation in BufferedReader.truncate() (GH-18586)	Berker Peksag	2020-02-21	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The truncate() method of io.BufferedReader() should raise UnsupportedOperation when it is called on a read-only io.BufferedReader() instance. https://bugs.python.org/issue35950 Automerge-Triggered-By: @methane
*	closes bpo-27805: Ignore ESPIPE in initializing seek of append-mode files. ↵	Benjamin Peterson	2019-11-12	1	-1/+5
\| \| \| \| \|	(GH-17112) This change, which follows the behavior of C stdio's fdopen and Python 2's file object, allows pipes to be opened in append mode.
*	bpo-37330: open() no longer accept 'U' in file mode (GH-16959)	Victor Stinner	2019-10-28	1	-13/+1
\| \| \| \| \|	open(), io.open(), codecs.open() and fileinput.FileInput no longer accept "U" ("universal newline") in the file mode. This flag was deprecated since Python 3.3.
*	bpo-15999: Clean up of handling boolean arguments. (GH-15610)	Serhiy Storchaka	2019-09-01	1	-4/+4
\| \| \| \| \| \|	* Use the 'p' format unit instead of manually called PyObject_IsTrue(). * Pass boolean value instead 0/1 integers to functions that needs boolean. * Convert some arguments to boolean only once.
*	bpo-36743: __get__ is sometimes called without the owner argument (#12992)	Raymond Hettinger	2019-08-29	1	-1/+1
\|
*	bpo-37960: Silence only necessary errors in repr() of buffered and text ↵	Serhiy Storchaka	2019-08-29	1	-4/+4
\| \| \| \|	streams. (GH-15543)
*	Fix typos in comments, docs and test names (#15018)	Min ho Kim	2019-07-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Fix typos in comments, docs and test names * Update test_pyparse.py account for change in string length * Apply suggestion: splitable -> splittable Co-Authored-By: Terry Jan Reedy <tjreedy@udel.edu> * Apply suggestion: splitable -> splittable Co-Authored-By: Terry Jan Reedy <tjreedy@udel.edu> * Apply suggestion: Dealloccte -> Deallocate Co-Authored-By: Terry Jan Reedy <tjreedy@udel.edu> * Update posixmodule checksum. * Reverse idlelib changes.
*	bpo-37388: Development mode check encoding and errors (GH-14341)	Victor Stinner	2019-06-25	1	-0/+4
\| \| \| \| \| \| \| \| \|	In development mode and in debug build, encoding and errors arguments are now checked on string encoding and decoding operations. Examples: open(), str.encode() and bytes.decode(). By default, for best performances, the errors argument is only checked at the first encoding/decoding error, and the encoding argument is sometimes ignored for empty strings.
*	bpo-18748: Fix _pyio.IOBase destructor (closed case) (GH-13952)	Victor Stinner	2019-06-11	1	-0/+10
\| \| \| \|	_pyio.IOBase destructor now does nothing if getting the closed attribute fails to better mimick _io.IOBase finalizer.
*	bpo-37054, _pyio: Fix BytesIO and TextIOWrapper __del__() (GH-13601)	Victor Stinner	2019-05-27	1	-1/+10
\| \| \| \| \|	Fix destructor _pyio.BytesIO and _pyio.TextIOWrapper: initialize their _buffer attribute as soon as possible (in the class body), because it's used by __del__() which calls close().
*	bpo-36842: Implement PEP 578 (GH-12613)	Steve Dower	2019-05-23	1	-0/+23
\| \| \|	Adds sys.audit, sys.addaudithook, io.open_code, and associated C APIs.
*	bpo-18748: _pyio.IOBase emits unraisable exception (GH-13512)	Victor Stinner	2019-05-23	1	-8/+15
\| \| \| \| \| \| \| \|	In development (-X dev) mode and in a debug build, IOBase finalizer of the _pyio module now logs the exception if the close() method fails. The exception is ignored silently by default in release build. test_io: test_error_through_destructor() now uses support.catch_unraisable_exception() rather than capturing stderr.
*	bpo-36523: Add docstring to io.IOBase.writelines (GH-12683)	Marcin Niemira	2019-04-22	1	-0/+5
\|
*	closes bpo-35848: Move all documentation regarding the readinto out of ↵	Steve Palmer	2019-04-09	1	-6/+4
\| \| \| \| \| \| \|	IOBase. (GH-11893) Move all documentation regarding the readinto method into either io.RawIOBase or io.BufferedIOBase. Corresponding changes to documentation in the _pyio.py module.
*	Use names SEEK_SET, etc instead of magic number (GH-12057)	ngie-eign	2019-03-03	1	-3/+3
\| \| \| \| \| \| \|	The previous code hardcoded `SEEK_SET`, etc. While it's very unlikely that these values will change, it's best to use the definitions to avoid there being mismatches in behavior with the code in the future. Signed-off-by: Enji Cooper <yaneurabeya@gmail.com>
*	bpo-33138: Change standard error message for non-pickleable and non-copyable ↵	Serhiy Storchaka	2018-10-31	1	-3/+2
\| \| \| \|	types. (GH-6239)
*	bpo-32236: open() emits RuntimeWarning if buffering=1 for binary mode (GH-4842)	Alexey Izbyshev	2018-10-20	1	-0/+5
\| \| \| \| \| \| \| \| \|	If buffering=1 is specified for open() in binary mode, it is silently treated as buffering=-1 (i.e., the default buffer size). Coupled with the fact that line buffering is always supported in Python 2, such behavior caused several issues (e.g., bpo-10344, bpo-21332). Warn that line buffering is not supported if open() is called with binary mode and buffering=1.
*	Remove wording that could be deemed to be perjorative (GH-9287)	Raymond Hettinger	2018-09-14	1	-1/+1
\|
*	bpo-25862: Fix assertion failures in io.TextIOWrapper.tell(). (GH-3918)	Zackery Spytz	2018-06-29	1	-0/+1
\|
*	bpo-15216: io: TextIOWrapper.reconfigure() accepts encoding, errors and ↵	INADA Naoki	2017-12-21	1	-20/+56
\| \| \| \|	newline (GH-2343)
*	bpo-17852: Revert incorrect fix based on misunderstanding of _Py_PyAtExit() ↵	Antoine Pitrou	2017-12-13	1	-24/+0
\| \| \| \|	semantics (#4826)
*	bpo-31976: Fix race condition when flushing a file is slow. (#4331)	benfogle	2017-11-10	1	-2/+17
\|