summaryrefslogtreecommitdiffstats
path: root/Doc/library/io.rst
diff options
context:
space:
mode:
authorBenjamin Peterson <benjamin@python.org>2008-04-13 02:01:27 (GMT)
committerBenjamin Peterson <benjamin@python.org>2008-04-13 02:01:27 (GMT)
commit7bb4d2d0fa13544276b09852ed17df29994879a1 (patch)
treef511d4bd14b7ff3dc3b6db730faeb37949e6feb4 /Doc/library/io.rst
parentc161df6abd77ffc39c8686b7a7117c57f056b489 (diff)
downloadcpython-7bb4d2d0fa13544276b09852ed17df29994879a1.zip
cpython-7bb4d2d0fa13544276b09852ed17df29994879a1.tar.gz
cpython-7bb4d2d0fa13544276b09852ed17df29994879a1.tar.bz2
Backported io module docs
Diffstat (limited to 'Doc/library/io.rst')
-rw-r--r--Doc/library/io.rst627
1 files changed, 627 insertions, 0 deletions
diff --git a/Doc/library/io.rst b/Doc/library/io.rst
new file mode 100644
index 0000000..192ce90
--- /dev/null
+++ b/Doc/library/io.rst
@@ -0,0 +1,627 @@
+:mod:`io` --- Core tools for working with streams
+=================================================
+
+.. module:: io
+ :synopsis: Core tools for working with streams.
+.. moduleauthor:: Guido van Rossum <guido@python.org>
+.. moduleauthor:: Mike Verdone <mike.verdone@gmail.com>
+.. moduleauthor:: Mark Russell <mark.russell@zen.co.uk>
+.. sectionauthor:: Benjamin Peterson
+.. versionadded:: 2.6
+
+The :mod:`io` module provides the Python interfaces to stream handling. The
+builtin :func:`open` function is defined in this module.
+
+At the top of the I/O hierarchy is the abstract base class :class:`IOBase`. It
+defines the basic interface to a stream. Note, however, that there is no
+seperation between reading and writing to streams; implementations are allowed
+to throw an :exc:`IOError` if they do not support a given operation.
+
+Extending :class:`IOBase` is :class:`RawIOBase` which deals simply with the
+reading and writing of raw bytes to a stream. :class:`FileIO` subclasses
+:class:`RawIOBase` to provide an interface to OS files.
+
+:class:`BufferedIOBase` deals with buffering on a raw byte stream
+(:class:`RawIOBase`). Its subclasses, :class:`BufferedWriter`,
+:class:`BufferedReader`, and :class:`BufferedRWPair` buffer streams that are
+readable, writable, and both respectively. :class:`BufferedRandom` provides a
+buffered interface to random access streams. :class:`BytesIO` is a simple
+stream of in-memory bytes.
+
+Another :class:`IOBase` subclass, :class:`TextIOBase`, deals with the encoding
+and decoding of streams into text. :class:`TextIOWrapper`, which extends it, is
+a buffered text interface to a buffered raw stream (:class:`BufferedIOBase`).
+Finally, :class:`StringIO` is a in-memory stream for text.
+
+Argument names are not part of the specification, and only the arguments of
+:func:`open()` are intended to be used as keyword arguments.
+
+
+Module Interface
+----------------
+
+.. data:: DEFAULT_BUFFER_SIZE
+
+ An int containing the default buffer size used by the module's buffered I/O
+ classes. :func:`open()` uses the file's blksize (as obtained by
+ :func:`os.stat`) if possible.
+
+.. function:: open(file[, mode[, buffering[, encoding[, errors[, newline[, closefd=True]]]]]])
+
+ Open *file* and return a stream. If the file cannot be opened, an
+ :exc:`IOError` is raised.
+
+ *file* is either a string giving the name (and the path if the file isn't in
+ the current working directory) of the file to be opened or an integer file
+ descriptor of the file to be wrapped. (If a file descriptor is given, it is
+ closed when the returned I/O object is closed, unless *closefd* is set to
+ ``False``.)
+
+ *mode* is an optional string that specifies the mode in which the file is
+ opened. It defaults to ``'r'`` which means open for reading in text mode.
+ Other common values are ``'w'`` for writing (truncating the file if it
+ already exists), and ``'a'`` for appending (which on *some* Unix systems,
+ means that *all* writes append to the end of the file regardless of the
+ current seek position). In text mode, if *encoding* is not specified the
+ encoding used is platform dependent. (For reading and writing raw bytes use
+ binary mode and leave *encoding* unspecified.) The available modes are:
+
+ ========= ===============================================================
+ Character Meaning
+ --------- ---------------------------------------------------------------
+ ``'r'`` open for reading (default)
+ ``'w'`` open for writing, truncating the file first
+ ``'a'`` open for writing, appending to the end of the file if it exists
+ ``'b'`` binary mode
+ ``'t'`` text mode (default)
+ ``'+'`` open a disk file for updating (reading and writing)
+ ``'U'`` universal newline mode (for backwards compatibility; unneeded
+ for new code)
+ ========= ===============================================================
+
+ The default mode is ``'rt'`` (open for reading text). For binary random
+ access, the mode ``'w+b'`` opens and truncates the file to 0 bytes, while
+ ``'r+b'`` opens the file without truncation.
+
+ Python distinguishes between files opened in binary and text modes, even when
+ the underlying operating system doesn't. Files opened in binary mode
+ (appending ``'b'`` to the *mode* argument) return contents as ``bytes``
+ objects without any decoding. In text mode (the default, or when ``'t'`` is
+ appended to the *mode* argument), the contents of the file are returned as
+ strings, the bytes having been first decoded using a platform-dependent
+ encoding or using the specified *encoding* if given.
+
+ *buffering* is an optional integer used to set the buffering policy. By
+ default full buffering is on. Pass 0 to switch buffering off (only allowed
+ in binary mode), 1 to set line buffering, and an integer > 1 for full
+ buffering.
+
+ *encoding* is the name of the encoding used to decode or encode the file.
+ This should only be used in text mode. The default encoding is platform
+ dependent, but any encoding supported by Python can be passed. See the
+ :mod:`codecs` module for the list of supported encodings.
+
+ *errors* is an optional string that specifies how encoding errors are to be
+ handled---this argument should not be used in binary mode. Pass ``'strict'``
+ to raise a :exc:`ValueError` exception if there is an encoding error (the
+ default of ``None`` has the same effect), or pass ``'ignore'`` to ignore
+ errors. (Note that ignoring encoding errors can lead to data loss.) See the
+ documentation for :func:`codecs.register` for a list of the permitted
+ encoding error strings.
+
+ *newline* controls how universal newlines works (it only applies to text
+ mode). It can be ``None``, ``''``, ``'\n'``, ``'\r'``, and ``'\r\n'``. It
+ works as follows:
+
+ * On input, if *newline* is ``None``, universal newlines mode is enabled.
+ Lines in the input can end in ``'\n'``, ``'\r'``, or ``'\r\n'``, and these
+ are translated into ``'\n'`` before being returned to the caller. If it is
+ ``''``, universal newline mode is enabled, but line endings are returned to
+ the caller untranslated. If it has any of the other legal values, input
+ lines are only terminated by the given string, and the line ending is
+ returned to the caller untranslated.
+
+ * On output, if *newline* is ``None``, any ``'\n'`` characters written are
+ translated to the system default line separator, :data:`os.linesep`. If
+ *newline* is ``''``, no translation takes place. If *newline* is any of
+ the other legal values, any ``'\n'`` characters written are translated to
+ the given string.
+
+ If *closefd* is ``False``, the underlying file descriptor will be kept open
+ when the file is closed. This does not work when a file name is given and
+ must be ``True`` in that case.
+
+ :func:`open()` returns a file object whose type depends on the mode, and
+ through which the standard file operations such as reading and writing are
+ performed. When :func:`open()` is used to open a file in a text mode
+ (``'w'``, ``'r'``, ``'wt'``, ``'rt'``, etc.), it returns a
+ :class:`TextIOWrapper`. When used to open a file in a binary mode, the
+ returned class varies: in read binary mode, it returns a
+ :class:`BufferedReader`; in write binary and append binary modes, it returns
+ a :class:`BufferedWriter`, and in read/write mode, it returns a
+ :class:`BufferedRandom`.
+
+ It is also possible to use a string or bytearray as a file for both reading
+ and writing. For strings :class:`StringIO` can be used like a file opened in
+ a text mode, and for bytes a :class:`BytesIO` can be used like a file opened
+ in a binary mode.
+
+
+.. exception:: BlockingIOError
+
+ Error raised when blocking would occur on a non-blocking stream. It inherits
+ :exc:`IOError`.
+
+ In addition to those of :exc:`IOError`, :exc:`BlockingIOError` has one
+ attribute:
+
+ .. attribute:: characters_written
+
+ An integer containing the number of characters written to the stream
+ before it blocked.
+
+
+.. exception:: UnsupportedOperation
+
+ An exception inheriting :exc:`IOError` and :exc:`ValueError` that is raised
+ when an unsupported operation is called on a stream.
+
+
+I/O Base Classes
+----------------
+
+.. class:: IOBase
+
+ The abstract base class for all I/O classes, acting on streams of bytes.
+ There is no public constructor.
+
+ This class provides dummy implementations for many methods that derived
+ classes can override selectively; the default implementations represent a
+ file that cannot be read, written or seeked.
+
+ Even though :class:`IOBase` does not declare :meth:`read`, :meth:`readinto`,
+ or :meth:`write` because their signatures will vary, implementations and
+ clients should consider those methods part of the interface. Also,
+ implementations may raise a :exc:`IOError` when operations they do not
+ support are called.
+
+ The basic type used for binary data read from or written to a file is
+ :class:`bytes`. :class:`bytearray`\s are accepted too, and in some cases
+ (such as :class:`readinto`) needed. Text I/O classes work with :class:`str`
+ data.
+
+ Note that calling any method (even inquiries) on a closed stream is
+ undefined. Implementations may raise :exc:`IOError` in this case.
+
+ IOBase (and its subclasses) support the iterator protocol, meaning that an
+ :class:`IOBase` object can be iterated over yielding the lines in a stream.
+
+ IOBase also supports the :keyword:`with` statement. In this example, *fp* is
+ closed after the suite of the with statment is complete::
+
+ with open('spam.txt', 'r') as fp:
+ fp.write('Spam and eggs!')
+
+ :class:`IOBase` provides these methods:
+
+ .. method:: close()
+
+ Flush and close this stream. This method has no effect if the file is
+ already closed.
+
+ .. attribute:: closed
+
+ True if the stream is closed.
+
+ .. method:: fileno()
+
+ Return the underlying file descriptor (an integer) of the stream, if it
+ exists. An :exc:`IOError` is raised if the IO object does not use a file
+ descriptor.
+
+ .. method:: flush()
+
+ Flush the write buffers of the stream if applicable. This is not
+ implemented for read-only and non-blocking streams.
+
+ .. method:: isatty()
+
+ Tell if a stream is interactive (connected to a terminal/tty device).
+
+ .. method:: readable()
+
+ Tell if a stream can be read from. If False, :meth:`read` will raise
+ :exc:`IOError`.
+
+ .. method:: readline([limit])
+
+ Read and return a line from the stream. If *limit* is specified, at most
+ *limit* bytes will be read.
+
+ The line terminator is always ``b'\n'`` for binary files; for text files,
+ the *newlines* argument to :func:`.open()` can be used to select the line
+ terminator(s) recognized.
+
+ .. method:: readlines([hint])
+
+ Return a list of lines from the stream. *hint* can be specified to
+ control the number of lines read: no more lines will be read if the total
+ size (in bytes/characters) of all lines so far exceeds *hint*.
+
+ .. method:: seek(offset[, whence])
+
+ Change the stream position to byte offset *offset*. *offset* is
+ interpreted relative to the position indicated by *whence*. Values for
+ *whence* are:
+
+ * ``0`` -- start of stream (the default); *pos* should be zero or positive
+ * ``1`` -- current stream position; *pos* may be negative
+ * ``2`` -- end of stream; *pos* is usually negative
+
+ Return the new absolute position.
+
+ .. method:: seekable()
+
+ Tell if a stream supports random IO access. If ``False``, :meth:`seek`,
+ :meth:`tell` and :meth:`truncate` will raise :exc:`IOError`.
+
+ .. method:: tell()
+
+ Return an integer indicating the current stream position.
+
+ .. method:: truncate([pos])
+
+ Truncate the file to at most *pos* bytes. *pos* defaults to the current
+ file position, as returned by :meth:`tell`.
+
+ .. method:: writable()
+
+ Tell if a stream supports writing. If ``False``, :meth:`write` and
+ :meth:`truncate` will raise :exc:`IOError`.
+
+ .. method:: writelines(lines)
+
+ Write a list of lines to the stream. The lines will not be altered; they
+ must contain line separators.
+
+
+.. class:: RawIOBase
+
+ Base class for raw binary I/O. It inherits :class:`IOBase`. There is no
+ public constructor.
+
+ RawIOBase provides or overrides these methods in addition to those from
+ :class:`IOBase`:
+
+ .. method:: read([n])
+
+ Read and return all bytes from the stream until EOF, or if *n* is
+ specified, up to *n* bytes. An empty bytes object is returned on EOF;
+ ``None`` is returned if the object is set not to block and has no data to
+ read.
+
+ .. method:: readall()
+
+ Read and return all bytes from the stream until EOF, using multiple calls
+ to the stream.
+
+ .. method:: readinto(b)
+
+ Read up to len(b) bytes into bytearray *b* and return the number of bytes
+ read.
+
+ .. method:: write(b)
+
+ Write the given bytes, *b*, to the underlying raw stream and return the
+ number of bytes written (never less than ``len(b)``).
+
+
+Raw File I/O
+------------
+
+.. class:: FileIO(name[, mode])
+
+ :class:`FileIO` represents an OS file containing bytes data. It implements
+ the :class:`RawIOBase` interface (and therefore the :class:`IOBase`
+ interface, too).
+
+ The *mode* can be ``'r'``, ``'w'`` or ``'a'`` for reading (default), writing,
+ or appending. The file will be created if it doesn't exist when opened for
+ writing or appending; it will be truncated when opened for writing. Add a
+ ``'+'`` to the mode to allow simultaneous reading and writing.
+
+ :class:`FileIO` provides or overrides these methods in addition to those from
+ :class:`RawIOBase` and :class:`IOBase`:
+
+ .. attribute:: mode
+
+ The mode as given in the constructor.
+
+ .. attribute:: name
+
+ The file name.
+
+ .. method:: read([n])
+
+ Read and return bytes at most *n* bytes. Only one system call is made, so
+ less data than requested may be returned. In non-blocking mode, ``None``
+ is returned when no data is available.
+
+ .. method:: readall()
+
+ Read and return as bytes all the data from the file. As much as
+ immediately available is returned in non-blocking mode. If the EOF has
+ been reached, ``b''`` is returned.
+
+ .. method:: readinto(bytearray)
+
+ This method should not be used on :class:`FileIO` objects.
+
+ .. method:: write(b)
+
+ Write the bytes *b* to the file, and return the number actually written.
+ Only one system call is made, so not all of the data may be written.
+
+
+Buffered Streams
+----------------
+
+.. class:: BufferedIOBase
+
+ Base class for streams that support buffering. It inherits :class:`IOBase`.
+ There is no public constructor.
+
+ The main difference with :class:`RawIOBase` is that the :meth:`read` method
+ supports omitting the *size* argument, and does not have a default
+ implementation that defers to :meth:`readinto`.
+
+ In addition, :meth:`read`, :meth:`readinto`, and :meth:`write` may raise
+ :exc:`BlockingIOError` if the underlying raw stream is in non-blocking mode
+ and not ready; unlike their raw counterparts, they will never return
+ ``None``.
+
+ A typical implementation should not inherit from a :class:`RawIOBase`
+ implementation, but wrap one like :class:`BufferedWriter` and
+ :class:`BufferedReader`.
+
+ :class:`BufferedIOBase` provides or overrides these methods in addition to
+ those from :class:`IOBase`:
+
+ .. method:: read([n])
+
+ Read and return up to *n* bytes. If the argument is omitted, ``None``, or
+ negative, data is read and returned until EOF is reached. An empty bytes
+ object is returned if the stream is already at EOF.
+
+ If the argument is positive, and the underlying raw stream is not
+ interactive, multiple raw reads may be issued to satisfy the byte count
+ (unless EOF is reached first). But for interactive raw streams, at most
+ one raw read will be issued, and a short result does not imply that EOF is
+ imminent.
+
+ A :exc:`BlockingIOError` is raised if the underlying raw stream has no
+ data at the moment.
+
+ .. method:: readinto(b)
+
+ Read up to len(b) bytes into bytearray *b* and return the number of bytes
+ read.
+
+ Like :meth:`read`, multiple reads may be issued to the underlying raw
+ stream, unless the latter is 'interactive.'
+
+ A :exc:`BlockingIOError` is raised if the underlying raw stream has no
+ data at the moment.
+
+ .. method:: write(b)
+
+ Write the given bytes, *b*, to the underlying raw stream and return the
+ number of bytes written (never less than ``len(b)``).
+
+ A :exc:`BlockingIOError` is raised if the buffer is full, and the
+ underlying raw stream cannot accept more data at the moment.
+
+
+.. class:: BytesIO([initial_bytes])
+
+ A stream implementation using an in-memory bytes buffer. It inherits
+ :class:`BufferedIOBase`.
+
+ The argument *initial_bytes* is an optional initial bytearray.
+
+ :class:`BytesIO` provides or overrides these methods in addition to those
+ from :class:`BufferedIOBase` and :class:`IOBase`:
+
+ .. method:: getvalue()
+
+ Return the bytes value of the buffer.
+
+ .. method:: read1()
+
+ In :class:`BytesIO`, this is the same as :meth:`read()`.
+
+ .. method:: truncate([pos])
+
+ Truncate the file to at most *pos* bytes. *pos* defaults to the current
+ stream position, as returned by :meth:`tell()`.
+
+
+.. class:: BufferedReader(raw[, buffer_size])
+
+ A buffer for a readable, sequential :class:`BaseRawIO` object. It inherits
+ :class:`BufferedIOBase`.
+
+ The constructor creates a :class:`BufferedReader` for the given readable
+ *raw* stream and *buffer_size*. If *buffer_size* is omitted,
+ :data:`DEFAULT_BUFFER_SIZE` is used.
+
+ :class:`BufferedReader` provides or overrides these methods in addition to
+ those from :class:`BufferedIOBase` and :class:`IOBase`:
+
+ .. method:: peek([n])
+
+ Return bytes from a buffer without advancing the position. The argument
+ indicates a desired minimal number of bytes; only one read on the raw
+ stream is done to satisfy it. More than the buffer's size is never
+ returned.
+
+ .. method:: read([n])
+
+ Read and return *n* bytes, or if *n* is not given or negative, until EOF
+ or if the read call would block in non-blocking mode.
+
+ .. method:: read1(n)
+
+ Read and return up to *n* bytes with only one call on the raw stream. If
+ at least one byte is buffered, only buffered bytes are returned.
+ Otherwise, one raw stream read call is made.
+
+
+.. class:: BufferedWriter(raw[, buffer_size[, max_buffer_size]])
+
+ A buffer for a writeable sequential RawIO object. It inherits
+ :class:`BufferedIOBase`.
+
+ The constructor creates a :class:`BufferedWriter` for the given writeable
+ *raw* stream. If the *buffer_size* is not given, it defaults to
+ :data:`DEAFULT_BUFFER_SIZE`. If *max_buffer_size* is omitted, it defaults to
+ twice the buffer size.
+
+ :class:`BufferedWriter` provides or overrides these methods in addition to
+ those from :class:`BufferedIOBase` and :class:`IOBase`:
+
+ .. method:: flush()
+
+ Force bytes held in the buffer into the raw stream. A
+ :exc:`BlockingIOError` is be raised if the raw stream blocks.
+
+ .. method:: write(b)
+
+ Write bytes *b* onto the raw stream and return the number written. A
+ :exc:`BlockingIOError` is raised when the raw stream blocks.
+
+
+.. class:: BufferedRWPair(reader, writer[, buffer_size[, max_buffer_size]])
+
+ A buffered writer and reader object together for a raw stream that can be
+ written and read from. It has and supports both :meth:`read`, :meth:`write`,
+ and their variants. This is useful for such applications such as sockets and
+ two-way pipes. It inherits :class:`BufferedIOBase`.
+
+ *reader* and *writer* are :class:`RawIOBase` objects that are readable and
+ writeable respectively. If the *buffer_size* is omitted it defaults to
+ :data:`DEFAULT_BUFFER_SIZE`. The *max_buffer_size* (for the buffered writer)
+ defaults to twice the buffer size.
+
+ :class:`BufferedRWPair` implements all of :class:`BufferedIOBase`\'s methods.
+
+
+.. class:: BufferedRandom(raw[, buffer_size[, max_buffer_size]])
+
+ A buffered interface to random access streams. It inherits
+ :class:`BufferedReader` and :class:`BufferedWriter`.
+
+ The constructor creates a reader and writer for a seekable raw stream, given
+ in the first argument. If the *buffer_size* is omitted it defaults to
+ :data:`DEFAULT_BUFFER_SIZE`. The *max_buffer_size* (for the buffered writer)
+ defaults to twice the buffer size.
+
+ :class:`BufferedRandom` is capable of anything :class:`BufferedReader` or
+ :class:`BufferedWriter` can do.
+
+
+Text I/O
+--------
+
+.. class:: TextIOBase
+
+ Base class for text streams. This class provides a character and line based
+ interface to stream I/O. There is no :meth:`readinto` method because
+ Python's character strings are immutable. It inherits :class:`IOBase`.
+ There is no public constructor.
+
+ :class:`TextIOBase` provides or overrides these methods in addition to those
+ from :class:`IOBase`:
+
+ .. attribute:: encoding
+
+ Return the name of the encoding used to decode the stream's bytes into
+ strings, and to encode strings into bytes.
+
+ .. attribute:: newlines
+
+ Return a string, tuple of strings, or ``None`` indicating the newlines
+ translated so far.
+
+ .. method:: read(n)
+
+ Read and return at most *n* characters from the stream. If *n* is
+ negative or ``None``, read to EOF.
+
+ .. method:: readline()
+
+ Read until newline or EOF and return. If the stream is already at EOF, an
+ empty stream is returned.
+
+ .. method:: write(s)
+
+ Write string *s* to the stream and return the number of characters
+ written.
+
+
+.. class:: TextIOWrapper(buffer[, encoding[, errors[, newline[, line_buffering]]]])
+
+ A buffered text stream over a :class:`BufferedIOBase` raw stream, *buffer*.
+ It inherits :class:`TextIOBase`.
+
+ *encoding* gives the name of the encoding that the stream will be decoded or
+ encoded with. It defaults to :func:`locale.getpreferredencoding`.
+
+ *errors* determines the strictness of encoding and decoding (see the errors
+ argument of :func:`codecs.register`) and defaults to ``'strict'``.
+
+ *newline* can be ``None``, ``''``, ``'\n'``, ``'\r'``, or ``'\r\n'``. It
+ controls the handling of line endings. If it is ``None``, universal newlines
+ is enabled. With this enabled, on input, the lines endings ``'\n'``,
+ ``'\r'``, or ``'\r\n'`` are translated to ``'\n'`` before being returned to
+ the caller. Conversely, on output, ``'\n'`` is translated to the system
+ default line seperator, :data:`os.linesep`. If *newline* is any other of its
+ legal values, that newline becomes the newline when the file is read and it
+ is returned untranslated. On output, ``'\n'`` is converted to the *newline*.
+
+ If *line_buffering* is ``True``, :meth:`flush` is implied when a call to
+ write contains a newline character.
+
+ :class:`TextIOWrapper` provides these methods in addition to those of
+ :class:`TextIOBase` and its parents:
+
+ .. attribute:: errors
+
+ The encoding and decoding error setting.
+
+ .. attribute:: line_buffering
+
+ Whether line buffering is enabled.
+
+
+.. class:: StringIO([initial_value[, encoding[, errors[, newline]]]])
+
+ An in-memory stream for text. It in inherits :class:`TextIOWrapper`.
+
+ Create a new StringIO stream with an inital value, encoding, error handling,
+ and newline setting. See :class:`TextIOWrapper`\'s constructor for more
+ information.
+
+ :class:`StringIO` provides these methods in addition to those from
+ :class:`TextIOWrapper` and its parents:
+
+ .. method:: getvalue()
+
+ Return a str representation of the contents of the internal buffer.
+
+
+.. class:: IncrementalNewlineDecoder
+
+ A helper codec that decodes newlines for universal newlines mode. It
+ inherits :class:`codecs.IncrementalDecoder`.
+