summaryrefslogtreecommitdiffstats
path: root/Doc/whatsnew/2.3.rst
diff options
context:
space:
mode:
authorGeorg Brandl <georg@python.org>2007-08-15 14:28:22 (GMT)
committerGeorg Brandl <georg@python.org>2007-08-15 14:28:22 (GMT)
commit116aa62bf54a39697e25f21d6cf6799f7faa1349 (patch)
tree8db5729518ed4ca88e26f1e26cc8695151ca3eb3 /Doc/whatsnew/2.3.rst
parent739c01d47b9118d04e5722333f0e6b4d0c8bdd9e (diff)
downloadcpython-116aa62bf54a39697e25f21d6cf6799f7faa1349.zip
cpython-116aa62bf54a39697e25f21d6cf6799f7faa1349.tar.gz
cpython-116aa62bf54a39697e25f21d6cf6799f7faa1349.tar.bz2
Move the 3k reST doc tree in place.
Diffstat (limited to 'Doc/whatsnew/2.3.rst')
-rw-r--r--Doc/whatsnew/2.3.rst2084
1 files changed, 2084 insertions, 0 deletions
diff --git a/Doc/whatsnew/2.3.rst b/Doc/whatsnew/2.3.rst
new file mode 100644
index 0000000..7dd4930
--- /dev/null
+++ b/Doc/whatsnew/2.3.rst
@@ -0,0 +1,2084 @@
+****************************
+ What's New in Python 2.3
+****************************
+
+:Author: A.M. Kuchling
+
+.. |release| replace:: 1.01
+
+.. % $Id: whatsnew23.tex 55005 2007-04-27 19:54:29Z guido.van.rossum $
+
+This article explains the new features in Python 2.3. Python 2.3 was released
+on July 29, 2003.
+
+The main themes for Python 2.3 are polishing some of the features added in 2.2,
+adding various small but useful enhancements to the core language, and expanding
+the standard library. The new object model introduced in the previous version
+has benefited from 18 months of bugfixes and from optimization efforts that have
+improved the performance of new-style classes. A few new built-in functions
+have been added such as :func:`sum` and :func:`enumerate`. The :keyword:`in`
+operator can now be used for substring searches (e.g. ``"ab" in "abc"`` returns
+:const:`True`).
+
+Some of the many new library features include Boolean, set, heap, and date/time
+data types, the ability to import modules from ZIP-format archives, metadata
+support for the long-awaited Python catalog, an updated version of IDLE, and
+modules for logging messages, wrapping text, parsing CSV files, processing
+command-line options, using BerkeleyDB databases... the list of new and
+enhanced modules is lengthy.
+
+This article doesn't attempt to provide a complete specification of the new
+features, but instead provides a convenient overview. For full details, you
+should refer to the documentation for Python 2.3, such as the Python Library
+Reference and the Python Reference Manual. If you want to understand the
+complete implementation and design rationale, refer to the PEP for a particular
+new feature.
+
+.. % ======================================================================
+
+
+PEP 218: A Standard Set Datatype
+================================
+
+The new :mod:`sets` module contains an implementation of a set datatype. The
+:class:`Set` class is for mutable sets, sets that can have members added and
+removed. The :class:`ImmutableSet` class is for sets that can't be modified,
+and instances of :class:`ImmutableSet` can therefore be used as dictionary keys.
+Sets are built on top of dictionaries, so the elements within a set must be
+hashable.
+
+Here's a simple example::
+
+ >>> import sets
+ >>> S = sets.Set([1,2,3])
+ >>> S
+ Set([1, 2, 3])
+ >>> 1 in S
+ True
+ >>> 0 in S
+ False
+ >>> S.add(5)
+ >>> S.remove(3)
+ >>> S
+ Set([1, 2, 5])
+ >>>
+
+The union and intersection of sets can be computed with the :meth:`union` and
+:meth:`intersection` methods; an alternative notation uses the bitwise operators
+``&`` and ``|``. Mutable sets also have in-place versions of these methods,
+:meth:`union_update` and :meth:`intersection_update`. ::
+
+ >>> S1 = sets.Set([1,2,3])
+ >>> S2 = sets.Set([4,5,6])
+ >>> S1.union(S2)
+ Set([1, 2, 3, 4, 5, 6])
+ >>> S1 | S2 # Alternative notation
+ Set([1, 2, 3, 4, 5, 6])
+ >>> S1.intersection(S2)
+ Set([])
+ >>> S1 & S2 # Alternative notation
+ Set([])
+ >>> S1.union_update(S2)
+ >>> S1
+ Set([1, 2, 3, 4, 5, 6])
+ >>>
+
+It's also possible to take the symmetric difference of two sets. This is the
+set of all elements in the union that aren't in the intersection. Another way
+of putting it is that the symmetric difference contains all elements that are in
+exactly one set. Again, there's an alternative notation (``^``), and an in-
+place version with the ungainly name :meth:`symmetric_difference_update`. ::
+
+ >>> S1 = sets.Set([1,2,3,4])
+ >>> S2 = sets.Set([3,4,5,6])
+ >>> S1.symmetric_difference(S2)
+ Set([1, 2, 5, 6])
+ >>> S1 ^ S2
+ Set([1, 2, 5, 6])
+ >>>
+
+There are also :meth:`issubset` and :meth:`issuperset` methods for checking
+whether one set is a subset or superset of another::
+
+ >>> S1 = sets.Set([1,2,3])
+ >>> S2 = sets.Set([2,3])
+ >>> S2.issubset(S1)
+ True
+ >>> S1.issubset(S2)
+ False
+ >>> S1.issuperset(S2)
+ True
+ >>>
+
+
+.. seealso::
+
+ :pep:`218` - Adding a Built-In Set Object Type
+ PEP written by Greg V. Wilson. Implemented by Greg V. Wilson, Alex Martelli, and
+ GvR.
+
+.. % ======================================================================
+
+
+.. _section-generators:
+
+PEP 255: Simple Generators
+==========================
+
+In Python 2.2, generators were added as an optional feature, to be enabled by a
+``from __future__ import generators`` directive. In 2.3 generators no longer
+need to be specially enabled, and are now always present; this means that
+:keyword:`yield` is now always a keyword. The rest of this section is a copy of
+the description of generators from the "What's New in Python 2.2" document; if
+you read it back when Python 2.2 came out, you can skip the rest of this
+section.
+
+You're doubtless familiar with how function calls work in Python or C. When you
+call a function, it gets a private namespace where its local variables are
+created. When the function reaches a :keyword:`return` statement, the local
+variables are destroyed and the resulting value is returned to the caller. A
+later call to the same function will get a fresh new set of local variables.
+But, what if the local variables weren't thrown away on exiting a function?
+What if you could later resume the function where it left off? This is what
+generators provide; they can be thought of as resumable functions.
+
+Here's the simplest example of a generator function::
+
+ def generate_ints(N):
+ for i in range(N):
+ yield i
+
+A new keyword, :keyword:`yield`, was introduced for generators. Any function
+containing a :keyword:`yield` statement is a generator function; this is
+detected by Python's bytecode compiler which compiles the function specially as
+a result.
+
+When you call a generator function, it doesn't return a single value; instead it
+returns a generator object that supports the iterator protocol. On executing
+the :keyword:`yield` statement, the generator outputs the value of ``i``,
+similar to a :keyword:`return` statement. The big difference between
+:keyword:`yield` and a :keyword:`return` statement is that on reaching a
+:keyword:`yield` the generator's state of execution is suspended and local
+variables are preserved. On the next call to the generator's ``.next()``
+method, the function will resume executing immediately after the
+:keyword:`yield` statement. (For complicated reasons, the :keyword:`yield`
+statement isn't allowed inside the :keyword:`try` block of a :keyword:`try`...\
+:keyword:`finally` statement; read :pep:`255` for a full explanation of the
+interaction between :keyword:`yield` and exceptions.)
+
+Here's a sample usage of the :func:`generate_ints` generator::
+
+ >>> gen = generate_ints(3)
+ >>> gen
+ <generator object at 0x8117f90>
+ >>> gen.next()
+ 0
+ >>> gen.next()
+ 1
+ >>> gen.next()
+ 2
+ >>> gen.next()
+ Traceback (most recent call last):
+ File "stdin", line 1, in ?
+ File "stdin", line 2, in generate_ints
+ StopIteration
+
+You could equally write ``for i in generate_ints(5)``, or ``a,b,c =
+generate_ints(3)``.
+
+Inside a generator function, the :keyword:`return` statement can only be used
+without a value, and signals the end of the procession of values; afterwards the
+generator cannot return any further values. :keyword:`return` with a value, such
+as ``return 5``, is a syntax error inside a generator function. The end of the
+generator's results can also be indicated by raising :exc:`StopIteration`
+manually, or by just letting the flow of execution fall off the bottom of the
+function.
+
+You could achieve the effect of generators manually by writing your own class
+and storing all the local variables of the generator as instance variables. For
+example, returning a list of integers could be done by setting ``self.count`` to
+0, and having the :meth:`next` method increment ``self.count`` and return it.
+However, for a moderately complicated generator, writing a corresponding class
+would be much messier. :file:`Lib/test/test_generators.py` contains a number of
+more interesting examples. The simplest one implements an in-order traversal of
+a tree using generators recursively. ::
+
+ # A recursive generator that generates Tree leaves in in-order.
+ def inorder(t):
+ if t:
+ for x in inorder(t.left):
+ yield x
+ yield t.label
+ for x in inorder(t.right):
+ yield x
+
+Two other examples in :file:`Lib/test/test_generators.py` produce solutions for
+the N-Queens problem (placing $N$ queens on an $NxN$ chess board so that no
+queen threatens another) and the Knight's Tour (a route that takes a knight to
+every square of an $NxN$ chessboard without visiting any square twice).
+
+The idea of generators comes from other programming languages, especially Icon
+(http://www.cs.arizona.edu/icon/), where the idea of generators is central. In
+Icon, every expression and function call behaves like a generator. One example
+from "An Overview of the Icon Programming Language" at
+http://www.cs.arizona.edu/icon/docs/ipd266.htm gives an idea of what this looks
+like::
+
+ sentence := "Store it in the neighboring harbor"
+ if (i := find("or", sentence)) > 5 then write(i)
+
+In Icon the :func:`find` function returns the indexes at which the substring
+"or" is found: 3, 23, 33. In the :keyword:`if` statement, ``i`` is first
+assigned a value of 3, but 3 is less than 5, so the comparison fails, and Icon
+retries it with the second value of 23. 23 is greater than 5, so the comparison
+now succeeds, and the code prints the value 23 to the screen.
+
+Python doesn't go nearly as far as Icon in adopting generators as a central
+concept. Generators are considered part of the core Python language, but
+learning or using them isn't compulsory; if they don't solve any problems that
+you have, feel free to ignore them. One novel feature of Python's interface as
+compared to Icon's is that a generator's state is represented as a concrete
+object (the iterator) that can be passed around to other functions or stored in
+a data structure.
+
+
+.. seealso::
+
+ :pep:`255` - Simple Generators
+ Written by Neil Schemenauer, Tim Peters, Magnus Lie Hetland. Implemented mostly
+ by Neil Schemenauer and Tim Peters, with other fixes from the Python Labs crew.
+
+.. % ======================================================================
+
+
+.. _section-encodings:
+
+PEP 263: Source Code Encodings
+==============================
+
+Python source files can now be declared as being in different character set
+encodings. Encodings are declared by including a specially formatted comment in
+the first or second line of the source file. For example, a UTF-8 file can be
+declared with::
+
+ #!/usr/bin/env python
+ # -*- coding: UTF-8 -*-
+
+Without such an encoding declaration, the default encoding used is 7-bit ASCII.
+Executing or importing modules that contain string literals with 8-bit
+characters and have no encoding declaration will result in a
+:exc:`DeprecationWarning` being signalled by Python 2.3; in 2.4 this will be a
+syntax error.
+
+The encoding declaration only affects Unicode string literals, which will be
+converted to Unicode using the specified encoding. Note that Python identifiers
+are still restricted to ASCII characters, so you can't have variable names that
+use characters outside of the usual alphanumerics.
+
+
+.. seealso::
+
+ :pep:`263` - Defining Python Source Code Encodings
+ Written by Marc-André Lemburg and Martin von Löwis; implemented by Suzuki Hisao
+ and Martin von Löwis.
+
+.. % ======================================================================
+
+
+PEP 273: Importing Modules from ZIP Archives
+============================================
+
+The new :mod:`zipimport` module adds support for importing modules from a ZIP-
+format archive. You don't need to import the module explicitly; it will be
+automatically imported if a ZIP archive's filename is added to ``sys.path``.
+For example::
+
+ amk@nyman:~/src/python$ unzip -l /tmp/example.zip
+ Archive: /tmp/example.zip
+ Length Date Time Name
+ -------- ---- ---- ----
+ 8467 11-26-02 22:30 jwzthreading.py
+ -------- -------
+ 8467 1 file
+ amk@nyman:~/src/python$ ./python
+ Python 2.3 (#1, Aug 1 2003, 19:54:32)
+ >>> import sys
+ >>> sys.path.insert(0, '/tmp/example.zip') # Add .zip file to front of path
+ >>> import jwzthreading
+ >>> jwzthreading.__file__
+ '/tmp/example.zip/jwzthreading.py'
+ >>>
+
+An entry in ``sys.path`` can now be the filename of a ZIP archive. The ZIP
+archive can contain any kind of files, but only files named :file:`\*.py`,
+:file:`\*.pyc`, or :file:`\*.pyo` can be imported. If an archive only contains
+:file:`\*.py` files, Python will not attempt to modify the archive by adding the
+corresponding :file:`\*.pyc` file, meaning that if a ZIP archive doesn't contain
+:file:`\*.pyc` files, importing may be rather slow.
+
+A path within the archive can also be specified to only import from a
+subdirectory; for example, the path :file:`/tmp/example.zip/lib/` would only
+import from the :file:`lib/` subdirectory within the archive.
+
+
+.. seealso::
+
+ :pep:`273` - Import Modules from Zip Archives
+ Written by James C. Ahlstrom, who also provided an implementation. Python 2.3
+ follows the specification in :pep:`273`, but uses an implementation written by
+ Just van Rossum that uses the import hooks described in :pep:`302`. See section
+ :ref:`section-pep302` for a description of the new import hooks.
+
+.. % ======================================================================
+
+
+PEP 277: Unicode file name support for Windows NT
+=================================================
+
+On Windows NT, 2000, and XP, the system stores file names as Unicode strings.
+Traditionally, Python has represented file names as byte strings, which is
+inadequate because it renders some file names inaccessible.
+
+Python now allows using arbitrary Unicode strings (within the limitations of the
+file system) for all functions that expect file names, most notably the
+:func:`open` built-in function. If a Unicode string is passed to
+:func:`os.listdir`, Python now returns a list of Unicode strings. A new
+function, :func:`os.getcwdu`, returns the current directory as a Unicode string.
+
+Byte strings still work as file names, and on Windows Python will transparently
+convert them to Unicode using the ``mbcs`` encoding.
+
+Other systems also allow Unicode strings as file names but convert them to byte
+strings before passing them to the system, which can cause a :exc:`UnicodeError`
+to be raised. Applications can test whether arbitrary Unicode strings are
+supported as file names by checking :attr:`os.path.supports_unicode_filenames`,
+a Boolean value.
+
+Under MacOS, :func:`os.listdir` may now return Unicode filenames.
+
+
+.. seealso::
+
+ :pep:`277` - Unicode file name support for Windows NT
+ Written by Neil Hodgson; implemented by Neil Hodgson, Martin von Löwis, and Mark
+ Hammond.
+
+.. % ======================================================================
+
+
+PEP 278: Universal Newline Support
+==================================
+
+The three major operating systems used today are Microsoft Windows, Apple's
+Macintosh OS, and the various Unix derivatives. A minor irritation of cross-
+platform work is that these three platforms all use different characters to
+mark the ends of lines in text files. Unix uses the linefeed (ASCII character
+10), MacOS uses the carriage return (ASCII character 13), and Windows uses a
+two-character sequence of a carriage return plus a newline.
+
+Python's file objects can now support end of line conventions other than the one
+followed by the platform on which Python is running. Opening a file with the
+mode ``'U'`` or ``'rU'`` will open a file for reading in universal newline mode.
+All three line ending conventions will be translated to a ``'\n'`` in the
+strings returned by the various file methods such as :meth:`read` and
+:meth:`readline`.
+
+Universal newline support is also used when importing modules and when executing
+a file with the :func:`execfile` function. This means that Python modules can
+be shared between all three operating systems without needing to convert the
+line-endings.
+
+This feature can be disabled when compiling Python by specifying the
+:option:`--without-universal-newlines` switch when running Python's
+:program:`configure` script.
+
+
+.. seealso::
+
+ :pep:`278` - Universal Newline Support
+ Written and implemented by Jack Jansen.
+
+.. % ======================================================================
+
+
+.. _section-enumerate:
+
+PEP 279: enumerate()
+====================
+
+A new built-in function, :func:`enumerate`, will make certain loops a bit
+clearer. ``enumerate(thing)``, where *thing* is either an iterator or a
+sequence, returns a iterator that will return ``(0, thing[0])``, ``(1,
+thing[1])``, ``(2, thing[2])``, and so forth.
+
+A common idiom to change every element of a list looks like this::
+
+ for i in range(len(L)):
+ item = L[i]
+ # ... compute some result based on item ...
+ L[i] = result
+
+This can be rewritten using :func:`enumerate` as::
+
+ for i, item in enumerate(L):
+ # ... compute some result based on item ...
+ L[i] = result
+
+
+.. seealso::
+
+ :pep:`279` - The enumerate() built-in function
+ Written and implemented by Raymond D. Hettinger.
+
+.. % ======================================================================
+
+
+PEP 282: The logging Package
+============================
+
+A standard package for writing logs, :mod:`logging`, has been added to Python
+2.3. It provides a powerful and flexible mechanism for generating logging
+output which can then be filtered and processed in various ways. A
+configuration file written in a standard format can be used to control the
+logging behavior of a program. Python includes handlers that will write log
+records to standard error or to a file or socket, send them to the system log,
+or even e-mail them to a particular address; of course, it's also possible to
+write your own handler classes.
+
+The :class:`Logger` class is the primary class. Most application code will deal
+with one or more :class:`Logger` objects, each one used by a particular
+subsystem of the application. Each :class:`Logger` is identified by a name, and
+names are organized into a hierarchy using ``.`` as the component separator.
+For example, you might have :class:`Logger` instances named ``server``,
+``server.auth`` and ``server.network``. The latter two instances are below
+``server`` in the hierarchy. This means that if you turn up the verbosity for
+``server`` or direct ``server`` messages to a different handler, the changes
+will also apply to records logged to ``server.auth`` and ``server.network``.
+There's also a root :class:`Logger` that's the parent of all other loggers.
+
+For simple uses, the :mod:`logging` package contains some convenience functions
+that always use the root log::
+
+ import logging
+
+ logging.debug('Debugging information')
+ logging.info('Informational message')
+ logging.warning('Warning:config file %s not found', 'server.conf')
+ logging.error('Error occurred')
+ logging.critical('Critical error -- shutting down')
+
+This produces the following output::
+
+ WARNING:root:Warning:config file server.conf not found
+ ERROR:root:Error occurred
+ CRITICAL:root:Critical error -- shutting down
+
+In the default configuration, informational and debugging messages are
+suppressed and the output is sent to standard error. You can enable the display
+of informational and debugging messages by calling the :meth:`setLevel` method
+on the root logger.
+
+Notice the :func:`warning` call's use of string formatting operators; all of the
+functions for logging messages take the arguments ``(msg, arg1, arg2, ...)`` and
+log the string resulting from ``msg % (arg1, arg2, ...)``.
+
+There's also an :func:`exception` function that records the most recent
+traceback. Any of the other functions will also record the traceback if you
+specify a true value for the keyword argument *exc_info*. ::
+
+ def f():
+ try: 1/0
+ except: logging.exception('Problem recorded')
+
+ f()
+
+This produces the following output::
+
+ ERROR:root:Problem recorded
+ Traceback (most recent call last):
+ File "t.py", line 6, in f
+ 1/0
+ ZeroDivisionError: integer division or modulo by zero
+
+Slightly more advanced programs will use a logger other than the root logger.
+The :func:`getLogger(name)` function is used to get a particular log, creating
+it if it doesn't exist yet. :func:`getLogger(None)` returns the root logger. ::
+
+ log = logging.getLogger('server')
+ ...
+ log.info('Listening on port %i', port)
+ ...
+ log.critical('Disk full')
+ ...
+
+Log records are usually propagated up the hierarchy, so a message logged to
+``server.auth`` is also seen by ``server`` and ``root``, but a :class:`Logger`
+can prevent this by setting its :attr:`propagate` attribute to :const:`False`.
+
+There are more classes provided by the :mod:`logging` package that can be
+customized. When a :class:`Logger` instance is told to log a message, it
+creates a :class:`LogRecord` instance that is sent to any number of different
+:class:`Handler` instances. Loggers and handlers can also have an attached list
+of filters, and each filter can cause the :class:`LogRecord` to be ignored or
+can modify the record before passing it along. When they're finally output,
+:class:`LogRecord` instances are converted to text by a :class:`Formatter`
+class. All of these classes can be replaced by your own specially-written
+classes.
+
+With all of these features the :mod:`logging` package should provide enough
+flexibility for even the most complicated applications. This is only an
+incomplete overview of its features, so please see the package's reference
+documentation for all of the details. Reading :pep:`282` will also be helpful.
+
+
+.. seealso::
+
+ :pep:`282` - A Logging System
+ Written by Vinay Sajip and Trent Mick; implemented by Vinay Sajip.
+
+.. % ======================================================================
+
+
+.. _section-bool:
+
+PEP 285: A Boolean Type
+=======================
+
+A Boolean type was added to Python 2.3. Two new constants were added to the
+:mod:`__builtin__` module, :const:`True` and :const:`False`. (:const:`True` and
+:const:`False` constants were added to the built-ins in Python 2.2.1, but the
+2.2.1 versions are simply set to integer values of 1 and 0 and aren't a
+different type.)
+
+The type object for this new type is named :class:`bool`; the constructor for it
+takes any Python value and converts it to :const:`True` or :const:`False`. ::
+
+ >>> bool(1)
+ True
+ >>> bool(0)
+ False
+ >>> bool([])
+ False
+ >>> bool( (1,) )
+ True
+
+Most of the standard library modules and built-in functions have been changed to
+return Booleans. ::
+
+ >>> obj = []
+ >>> hasattr(obj, 'append')
+ True
+ >>> isinstance(obj, list)
+ True
+ >>> isinstance(obj, tuple)
+ False
+
+Python's Booleans were added with the primary goal of making code clearer. For
+example, if you're reading a function and encounter the statement ``return 1``,
+you might wonder whether the ``1`` represents a Boolean truth value, an index,
+or a coefficient that multiplies some other quantity. If the statement is
+``return True``, however, the meaning of the return value is quite clear.
+
+Python's Booleans were *not* added for the sake of strict type-checking. A very
+strict language such as Pascal would also prevent you performing arithmetic with
+Booleans, and would require that the expression in an :keyword:`if` statement
+always evaluate to a Boolean result. Python is not this strict and never will
+be, as :pep:`285` explicitly says. This means you can still use any expression
+in an :keyword:`if` statement, even ones that evaluate to a list or tuple or
+some random object. The Boolean type is a subclass of the :class:`int` class so
+that arithmetic using a Boolean still works. ::
+
+ >>> True + 1
+ 2
+ >>> False + 1
+ 1
+ >>> False * 75
+ 0
+ >>> True * 75
+ 75
+
+To sum up :const:`True` and :const:`False` in a sentence: they're alternative
+ways to spell the integer values 1 and 0, with the single difference that
+:func:`str` and :func:`repr` return the strings ``'True'`` and ``'False'``
+instead of ``'1'`` and ``'0'``.
+
+
+.. seealso::
+
+ :pep:`285` - Adding a bool type
+ Written and implemented by GvR.
+
+.. % ======================================================================
+
+
+PEP 293: Codec Error Handling Callbacks
+=======================================
+
+When encoding a Unicode string into a byte string, unencodable characters may be
+encountered. So far, Python has allowed specifying the error processing as
+either "strict" (raising :exc:`UnicodeError`), "ignore" (skipping the
+character), or "replace" (using a question mark in the output string), with
+"strict" being the default behavior. It may be desirable to specify alternative
+processing of such errors, such as inserting an XML character reference or HTML
+entity reference into the converted string.
+
+Python now has a flexible framework to add different processing strategies. New
+error handlers can be added with :func:`codecs.register_error`, and codecs then
+can access the error handler with :func:`codecs.lookup_error`. An equivalent C
+API has been added for codecs written in C. The error handler gets the necessary
+state information such as the string being converted, the position in the string
+where the error was detected, and the target encoding. The handler can then
+either raise an exception or return a replacement string.
+
+Two additional error handlers have been implemented using this framework:
+"backslashreplace" uses Python backslash quoting to represent unencodable
+characters and "xmlcharrefreplace" emits XML character references.
+
+
+.. seealso::
+
+ :pep:`293` - Codec Error Handling Callbacks
+ Written and implemented by Walter Dörwald.
+
+.. % ======================================================================
+
+
+.. _section-pep301:
+
+PEP 301: Package Index and Metadata for Distutils
+=================================================
+
+Support for the long-requested Python catalog makes its first appearance in 2.3.
+
+The heart of the catalog is the new Distutils :command:`register` command.
+Running ``python setup.py register`` will collect the metadata describing a
+package, such as its name, version, maintainer, description, &c., and send it to
+a central catalog server. The resulting catalog is available from
+http://www.python.org/pypi.
+
+To make the catalog a bit more useful, a new optional *classifiers* keyword
+argument has been added to the Distutils :func:`setup` function. A list of
+`Trove <http://catb.org/~esr/trove/>`_-style strings can be supplied to help
+classify the software.
+
+Here's an example :file:`setup.py` with classifiers, written to be compatible
+with older versions of the Distutils::
+
+ from distutils import core
+ kw = {'name': "Quixote",
+ 'version': "0.5.1",
+ 'description': "A highly Pythonic Web application framework",
+ # ...
+ }
+
+ if (hasattr(core, 'setup_keywords') and
+ 'classifiers' in core.setup_keywords):
+ kw['classifiers'] = \
+ ['Topic :: Internet :: WWW/HTTP :: Dynamic Content',
+ 'Environment :: No Input/Output (Daemon)',
+ 'Intended Audience :: Developers'],
+
+ core.setup(**kw)
+
+The full list of classifiers can be obtained by running ``python setup.py
+register --list-classifiers``.
+
+
+.. seealso::
+
+ :pep:`301` - Package Index and Metadata for Distutils
+ Written and implemented by Richard Jones.
+
+.. % ======================================================================
+
+
+.. _section-pep302:
+
+PEP 302: New Import Hooks
+=========================
+
+While it's been possible to write custom import hooks ever since the
+:mod:`ihooks` module was introduced in Python 1.3, no one has ever been really
+happy with it because writing new import hooks is difficult and messy. There
+have been various proposed alternatives such as the :mod:`imputil` and :mod:`iu`
+modules, but none of them has ever gained much acceptance, and none of them were
+easily usable from C code.
+
+:pep:`302` borrows ideas from its predecessors, especially from Gordon
+McMillan's :mod:`iu` module. Three new items are added to the :mod:`sys`
+module:
+
+* ``sys.path_hooks`` is a list of callable objects; most often they'll be
+ classes. Each callable takes a string containing a path and either returns an
+ importer object that will handle imports from this path or raises an
+ :exc:`ImportError` exception if it can't handle this path.
+
+* ``sys.path_importer_cache`` caches importer objects for each path, so
+ ``sys.path_hooks`` will only need to be traversed once for each path.
+
+* ``sys.meta_path`` is a list of importer objects that will be traversed before
+ ``sys.path`` is checked. This list is initially empty, but user code can add
+ objects to it. Additional built-in and frozen modules can be imported by an
+ object added to this list.
+
+Importer objects must have a single method, :meth:`find_module(fullname,
+path=None)`. *fullname* will be a module or package name, e.g. ``string`` or
+``distutils.core``. :meth:`find_module` must return a loader object that has a
+single method, :meth:`load_module(fullname)`, that creates and returns the
+corresponding module object.
+
+Pseudo-code for Python's new import logic, therefore, looks something like this
+(simplified a bit; see :pep:`302` for the full details)::
+
+ for mp in sys.meta_path:
+ loader = mp(fullname)
+ if loader is not None:
+ <module> = loader.load_module(fullname)
+
+ for path in sys.path:
+ for hook in sys.path_hooks:
+ try:
+ importer = hook(path)
+ except ImportError:
+ # ImportError, so try the other path hooks
+ pass
+ else:
+ loader = importer.find_module(fullname)
+ <module> = loader.load_module(fullname)
+
+ # Not found!
+ raise ImportError
+
+
+.. seealso::
+
+ :pep:`302` - New Import Hooks
+ Written by Just van Rossum and Paul Moore. Implemented by Just van Rossum.
+
+.. % ======================================================================
+
+
+.. _section-pep305:
+
+PEP 305: Comma-separated Files
+==============================
+
+Comma-separated files are a format frequently used for exporting data from
+databases and spreadsheets. Python 2.3 adds a parser for comma-separated files.
+
+Comma-separated format is deceptively simple at first glance::
+
+ Costs,150,200,3.95
+
+Read a line and call ``line.split(',')``: what could be simpler? But toss in
+string data that can contain commas, and things get more complicated::
+
+ "Costs",150,200,3.95,"Includes taxes, shipping, and sundry items"
+
+A big ugly regular expression can parse this, but using the new :mod:`csv`
+package is much simpler::
+
+ import csv
+
+ input = open('datafile', 'rb')
+ reader = csv.reader(input)
+ for line in reader:
+ print line
+
+The :func:`reader` function takes a number of different options. The field
+separator isn't limited to the comma and can be changed to any character, and so
+can the quoting and line-ending characters.
+
+Different dialects of comma-separated files can be defined and registered;
+currently there are two dialects, both used by Microsoft Excel. A separate
+:class:`csv.writer` class will generate comma-separated files from a succession
+of tuples or lists, quoting strings that contain the delimiter.
+
+
+.. seealso::
+
+ :pep:`305` - CSV File API
+ Written and implemented by Kevin Altis, Dave Cole, Andrew McNamara, Skip
+ Montanaro, Cliff Wells.
+
+.. % ======================================================================
+
+
+.. _section-pep307:
+
+PEP 307: Pickle Enhancements
+============================
+
+The :mod:`pickle` and :mod:`cPickle` modules received some attention during the
+2.3 development cycle. In 2.2, new-style classes could be pickled without
+difficulty, but they weren't pickled very compactly; :pep:`307` quotes a trivial
+example where a new-style class results in a pickled string three times longer
+than that for a classic class.
+
+The solution was to invent a new pickle protocol. The :func:`pickle.dumps`
+function has supported a text-or-binary flag for a long time. In 2.3, this
+flag is redefined from a Boolean to an integer: 0 is the old text-mode pickle
+format, 1 is the old binary format, and now 2 is a new 2.3-specific format. A
+new constant, :const:`pickle.HIGHEST_PROTOCOL`, can be used to select the
+fanciest protocol available.
+
+Unpickling is no longer considered a safe operation. 2.2's :mod:`pickle`
+provided hooks for trying to prevent unsafe classes from being unpickled
+(specifically, a :attr:`__safe_for_unpickling__` attribute), but none of this
+code was ever audited and therefore it's all been ripped out in 2.3. You should
+not unpickle untrusted data in any version of Python.
+
+To reduce the pickling overhead for new-style classes, a new interface for
+customizing pickling was added using three special methods:
+:meth:`__getstate__`, :meth:`__setstate__`, and :meth:`__getnewargs__`. Consult
+:pep:`307` for the full semantics of these methods.
+
+As a way to compress pickles yet further, it's now possible to use integer codes
+instead of long strings to identify pickled classes. The Python Software
+Foundation will maintain a list of standardized codes; there's also a range of
+codes for private use. Currently no codes have been specified.
+
+
+.. seealso::
+
+ :pep:`307` - Extensions to the pickle protocol
+ Written and implemented by Guido van Rossum and Tim Peters.
+
+.. % ======================================================================
+
+
+.. _section-slices:
+
+Extended Slices
+===============
+
+Ever since Python 1.4, the slicing syntax has supported an optional third "step"
+or "stride" argument. For example, these are all legal Python syntax:
+``L[1:10:2]``, ``L[:-1:1]``, ``L[::-1]``. This was added to Python at the
+request of the developers of Numerical Python, which uses the third argument
+extensively. However, Python's built-in list, tuple, and string sequence types
+have never supported this feature, raising a :exc:`TypeError` if you tried it.
+Michael Hudson contributed a patch to fix this shortcoming.
+
+For example, you can now easily extract the elements of a list that have even
+indexes::
+
+ >>> L = range(10)
+ >>> L[::2]
+ [0, 2, 4, 6, 8]
+
+Negative values also work to make a copy of the same list in reverse order::
+
+ >>> L[::-1]
+ [9, 8, 7, 6, 5, 4, 3, 2, 1, 0]
+
+This also works for tuples, arrays, and strings::
+
+ >>> s='abcd'
+ >>> s[::2]
+ 'ac'
+ >>> s[::-1]
+ 'dcba'
+
+If you have a mutable sequence such as a list or an array you can assign to or
+delete an extended slice, but there are some differences between assignment to
+extended and regular slices. Assignment to a regular slice can be used to
+change the length of the sequence::
+
+ >>> a = range(3)
+ >>> a
+ [0, 1, 2]
+ >>> a[1:3] = [4, 5, 6]
+ >>> a
+ [0, 4, 5, 6]
+
+Extended slices aren't this flexible. When assigning to an extended slice, the
+list on the right hand side of the statement must contain the same number of
+items as the slice it is replacing::
+
+ >>> a = range(4)
+ >>> a
+ [0, 1, 2, 3]
+ >>> a[::2]
+ [0, 2]
+ >>> a[::2] = [0, -1]
+ >>> a
+ [0, 1, -1, 3]
+ >>> a[::2] = [0,1,2]
+ Traceback (most recent call last):
+ File "<stdin>", line 1, in ?
+ ValueError: attempt to assign sequence of size 3 to extended slice of size 2
+
+Deletion is more straightforward::
+
+ >>> a = range(4)
+ >>> a
+ [0, 1, 2, 3]
+ >>> a[::2]
+ [0, 2]
+ >>> del a[::2]
+ >>> a
+ [1, 3]
+
+One can also now pass slice objects to the :meth:`__getitem__` methods of the
+built-in sequences::
+
+ >>> range(10).__getitem__(slice(0, 5, 2))
+ [0, 2, 4]
+
+Or use slice objects directly in subscripts::
+
+ >>> range(10)[slice(0, 5, 2)]
+ [0, 2, 4]
+
+To simplify implementing sequences that support extended slicing, slice objects
+now have a method :meth:`indices(length)` which, given the length of a sequence,
+returns a ``(start, stop, step)`` tuple that can be passed directly to
+:func:`range`. :meth:`indices` handles omitted and out-of-bounds indices in a
+manner consistent with regular slices (and this innocuous phrase hides a welter
+of confusing details!). The method is intended to be used like this::
+
+ class FakeSeq:
+ ...
+ def calc_item(self, i):
+ ...
+ def __getitem__(self, item):
+ if isinstance(item, slice):
+ indices = item.indices(len(self))
+ return FakeSeq([self.calc_item(i) for i in range(*indices)])
+ else:
+ return self.calc_item(i)
+
+From this example you can also see that the built-in :class:`slice` object is
+now the type object for the slice type, and is no longer a function. This is
+consistent with Python 2.2, where :class:`int`, :class:`str`, etc., underwent
+the same change.
+
+.. % ======================================================================
+
+
+Other Language Changes
+======================
+
+Here are all of the changes that Python 2.3 makes to the core Python language.
+
+* The :keyword:`yield` statement is now always a keyword, as described in
+ section :ref:`section-generators` of this document.
+
+* A new built-in function :func:`enumerate` was added, as described in section
+ :ref:`section-enumerate` of this document.
+
+* Two new constants, :const:`True` and :const:`False` were added along with the
+ built-in :class:`bool` type, as described in section :ref:`section-bool` of this
+ document.
+
+* The :func:`int` type constructor will now return a long integer instead of
+ raising an :exc:`OverflowError` when a string or floating-point number is too
+ large to fit into an integer. This can lead to the paradoxical result that
+ ``isinstance(int(expression), int)`` is false, but that seems unlikely to cause
+ problems in practice.
+
+* Built-in types now support the extended slicing syntax, as described in
+ section :ref:`section-slices` of this document.
+
+* A new built-in function, :func:`sum(iterable, start=0)`, adds up the numeric
+ items in the iterable object and returns their sum. :func:`sum` only accepts
+ numbers, meaning that you can't use it to concatenate a bunch of strings.
+ (Contributed by Alex Martelli.)
+
+* ``list.insert(pos, value)`` used to insert *value* at the front of the list
+ when *pos* was negative. The behaviour has now been changed to be consistent
+ with slice indexing, so when *pos* is -1 the value will be inserted before the
+ last element, and so forth.
+
+* ``list.index(value)``, which searches for *value* within the list and returns
+ its index, now takes optional *start* and *stop* arguments to limit the search
+ to only part of the list.
+
+* Dictionaries have a new method, :meth:`pop(key[, *default*])`, that returns
+ the value corresponding to *key* and removes that key/value pair from the
+ dictionary. If the requested key isn't present in the dictionary, *default* is
+ returned if it's specified and :exc:`KeyError` raised if it isn't. ::
+
+ >>> d = {1:2}
+ >>> d
+ {1: 2}
+ >>> d.pop(4)
+ Traceback (most recent call last):
+ File "stdin", line 1, in ?
+ KeyError: 4
+ >>> d.pop(1)
+ 2
+ >>> d.pop(1)
+ Traceback (most recent call last):
+ File "stdin", line 1, in ?
+ KeyError: 'pop(): dictionary is empty'
+ >>> d
+ {}
+ >>>
+
+ There's also a new class method, :meth:`dict.fromkeys(iterable, value)`, that
+ creates a dictionary with keys taken from the supplied iterator *iterable* and
+ all values set to *value*, defaulting to ``None``.
+
+ (Patches contributed by Raymond Hettinger.)
+
+ Also, the :func:`dict` constructor now accepts keyword arguments to simplify
+ creating small dictionaries::
+
+ >>> dict(red=1, blue=2, green=3, black=4)
+ {'blue': 2, 'black': 4, 'green': 3, 'red': 1}
+
+ (Contributed by Just van Rossum.)
+
+* The :keyword:`assert` statement no longer checks the ``__debug__`` flag, so
+ you can no longer disable assertions by assigning to ``__debug__``. Running
+ Python with the :option:`-O` switch will still generate code that doesn't
+ execute any assertions.
+
+* Most type objects are now callable, so you can use them to create new objects
+ such as functions, classes, and modules. (This means that the :mod:`new` module
+ can be deprecated in a future Python version, because you can now use the type
+ objects available in the :mod:`types` module.) For example, you can create a new
+ module object with the following code:
+
+ .. % XXX should new.py use PendingDeprecationWarning?
+
+ ::
+
+ >>> import types
+ >>> m = types.ModuleType('abc','docstring')
+ >>> m
+ <module 'abc' (built-in)>
+ >>> m.__doc__
+ 'docstring'
+
+* A new warning, :exc:`PendingDeprecationWarning` was added to indicate features
+ which are in the process of being deprecated. The warning will *not* be printed
+ by default. To check for use of features that will be deprecated in the future,
+ supply :option:`-Walways::PendingDeprecationWarning::` on the command line or
+ use :func:`warnings.filterwarnings`.
+
+* The process of deprecating string-based exceptions, as in ``raise "Error
+ occurred"``, has begun. Raising a string will now trigger
+ :exc:`PendingDeprecationWarning`.
+
+* Using ``None`` as a variable name will now result in a :exc:`SyntaxWarning`
+ warning. In a future version of Python, ``None`` may finally become a keyword.
+
+* The :meth:`xreadlines` method of file objects, introduced in Python 2.1, is no
+ longer necessary because files now behave as their own iterator.
+ :meth:`xreadlines` was originally introduced as a faster way to loop over all
+ the lines in a file, but now you can simply write ``for line in file_obj``.
+ File objects also have a new read-only :attr:`encoding` attribute that gives the
+ encoding used by the file; Unicode strings written to the file will be
+ automatically converted to bytes using the given encoding.
+
+* The method resolution order used by new-style classes has changed, though
+ you'll only notice the difference if you have a really complicated inheritance
+ hierarchy. Classic classes are unaffected by this change. Python 2.2
+ originally used a topological sort of a class's ancestors, but 2.3 now uses the
+ C3 algorithm as described in the paper `"A Monotonic Superclass Linearization
+ for Dylan" <http://www.webcom.com/haahr/dylan/linearization-oopsla96.html>`_. To
+ understand the motivation for this change, read Michele Simionato's article
+ `"Python 2.3 Method Resolution Order" <http://www.python.org/2.3/mro.html>`_, or
+ read the thread on python-dev starting with the message at
+ http://mail.python.org/pipermail/python-dev/2002-October/029035.html. Samuele
+ Pedroni first pointed out the problem and also implemented the fix by coding the
+ C3 algorithm.
+
+* Python runs multithreaded programs by switching between threads after
+ executing N bytecodes. The default value for N has been increased from 10 to
+ 100 bytecodes, speeding up single-threaded applications by reducing the
+ switching overhead. Some multithreaded applications may suffer slower response
+ time, but that's easily fixed by setting the limit back to a lower number using
+ :func:`sys.setcheckinterval(N)`. The limit can be retrieved with the new
+ :func:`sys.getcheckinterval` function.
+
+* One minor but far-reaching change is that the names of extension types defined
+ by the modules included with Python now contain the module and a ``'.'`` in
+ front of the type name. For example, in Python 2.2, if you created a socket and
+ printed its :attr:`__class__`, you'd get this output::
+
+ >>> s = socket.socket()
+ >>> s.__class__
+ <type 'socket'>
+
+ In 2.3, you get this::
+
+ >>> s.__class__
+ <type '_socket.socket'>
+
+* One of the noted incompatibilities between old- and new-style classes has been
+ removed: you can now assign to the :attr:`__name__` and :attr:`__bases__`
+ attributes of new-style classes. There are some restrictions on what can be
+ assigned to :attr:`__bases__` along the lines of those relating to assigning to
+ an instance's :attr:`__class__` attribute.
+
+.. % ======================================================================
+
+
+String Changes
+--------------
+
+* The :keyword:`in` operator now works differently for strings. Previously, when
+ evaluating ``X in Y`` where *X* and *Y* are strings, *X* could only be a single
+ character. That's now changed; *X* can be a string of any length, and ``X in Y``
+ will return :const:`True` if *X* is a substring of *Y*. If *X* is the empty
+ string, the result is always :const:`True`. ::
+
+ >>> 'ab' in 'abcd'
+ True
+ >>> 'ad' in 'abcd'
+ False
+ >>> '' in 'abcd'
+ True
+
+ Note that this doesn't tell you where the substring starts; if you need that
+ information, use the :meth:`find` string method.
+
+* The :meth:`strip`, :meth:`lstrip`, and :meth:`rstrip` string methods now have
+ an optional argument for specifying the characters to strip. The default is
+ still to remove all whitespace characters::
+
+ >>> ' abc '.strip()
+ 'abc'
+ >>> '><><abc<><><>'.strip('<>')
+ 'abc'
+ >>> '><><abc<><><>\n'.strip('<>')
+ 'abc<><><>\n'
+ >>> u'\u4000\u4001abc\u4000'.strip(u'\u4000')
+ u'\u4001abc'
+ >>>
+
+ (Suggested by Simon Brunning and implemented by Walter Dörwald.)
+
+* The :meth:`startswith` and :meth:`endswith` string methods now accept negative
+ numbers for the *start* and *end* parameters.
+
+* Another new string method is :meth:`zfill`, originally a function in the
+ :mod:`string` module. :meth:`zfill` pads a numeric string with zeros on the
+ left until it's the specified width. Note that the ``%`` operator is still more
+ flexible and powerful than :meth:`zfill`. ::
+
+ >>> '45'.zfill(4)
+ '0045'
+ >>> '12345'.zfill(4)
+ '12345'
+ >>> 'goofy'.zfill(6)
+ '0goofy'
+
+ (Contributed by Walter Dörwald.)
+
+* A new type object, :class:`basestring`, has been added. Both 8-bit strings and
+ Unicode strings inherit from this type, so ``isinstance(obj, basestring)`` will
+ return :const:`True` for either kind of string. It's a completely abstract
+ type, so you can't create :class:`basestring` instances.
+
+* Interned strings are no longer immortal and will now be garbage-collected in
+ the usual way when the only reference to them is from the internal dictionary of
+ interned strings. (Implemented by Oren Tirosh.)
+
+.. % ======================================================================
+
+
+Optimizations
+-------------
+
+* The creation of new-style class instances has been made much faster; they're
+ now faster than classic classes!
+
+* The :meth:`sort` method of list objects has been extensively rewritten by Tim
+ Peters, and the implementation is significantly faster.
+
+* Multiplication of large long integers is now much faster thanks to an
+ implementation of Karatsuba multiplication, an algorithm that scales better than
+ the O(n\*n) required for the grade-school multiplication algorithm. (Original
+ patch by Christopher A. Craig, and significantly reworked by Tim Peters.)
+
+* The ``SET_LINENO`` opcode is now gone. This may provide a small speed
+ increase, depending on your compiler's idiosyncrasies. See section
+ :ref:`section-other` for a longer explanation. (Removed by Michael Hudson.)
+
+* :func:`xrange` objects now have their own iterator, making ``for i in
+ xrange(n)`` slightly faster than ``for i in range(n)``. (Patch by Raymond
+ Hettinger.)
+
+* A number of small rearrangements have been made in various hotspots to improve
+ performance, such as inlining a function or removing some code. (Implemented
+ mostly by GvR, but lots of people have contributed single changes.)
+
+The net result of the 2.3 optimizations is that Python 2.3 runs the pystone
+benchmark around 25% faster than Python 2.2.
+
+.. % ======================================================================
+
+
+New, Improved, and Deprecated Modules
+=====================================
+
+As usual, Python's standard library received a number of enhancements and bug
+fixes. Here's a partial list of the most notable changes, sorted alphabetically
+by module name. Consult the :file:`Misc/NEWS` file in the source tree for a more
+complete list of changes, or look through the CVS logs for all the details.
+
+* The :mod:`array` module now supports arrays of Unicode characters using the
+ ``'u'`` format character. Arrays also now support using the ``+=`` assignment
+ operator to add another array's contents, and the ``*=`` assignment operator to
+ repeat an array. (Contributed by Jason Orendorff.)
+
+* The :mod:`bsddb` module has been replaced by version 4.1.6 of the `PyBSDDB
+ <http://pybsddb.sourceforge.net>`_ package, providing a more complete interface
+ to the transactional features of the BerkeleyDB library.
+
+ The old version of the module has been renamed to :mod:`bsddb185` and is no
+ longer built automatically; you'll have to edit :file:`Modules/Setup` to enable
+ it. Note that the new :mod:`bsddb` package is intended to be compatible with
+ the old module, so be sure to file bugs if you discover any incompatibilities.
+ When upgrading to Python 2.3, if the new interpreter is compiled with a new
+ version of the underlying BerkeleyDB library, you will almost certainly have to
+ convert your database files to the new version. You can do this fairly easily
+ with the new scripts :file:`db2pickle.py` and :file:`pickle2db.py` which you
+ will find in the distribution's :file:`Tools/scripts` directory. If you've
+ already been using the PyBSDDB package and importing it as :mod:`bsddb3`, you
+ will have to change your ``import`` statements to import it as :mod:`bsddb`.
+
+* The new :mod:`bz2` module is an interface to the bz2 data compression library.
+ bz2-compressed data is usually smaller than corresponding :mod:`zlib`\
+ -compressed data. (Contributed by Gustavo Niemeyer.)
+
+* A set of standard date/time types has been added in the new :mod:`datetime`
+ module. See the following section for more details.
+
+* The Distutils :class:`Extension` class now supports an extra constructor
+ argument named *depends* for listing additional source files that an extension
+ depends on. This lets Distutils recompile the module if any of the dependency
+ files are modified. For example, if :file:`sampmodule.c` includes the header
+ file :file:`sample.h`, you would create the :class:`Extension` object like
+ this::
+
+ ext = Extension("samp",
+ sources=["sampmodule.c"],
+ depends=["sample.h"])
+
+ Modifying :file:`sample.h` would then cause the module to be recompiled.
+ (Contributed by Jeremy Hylton.)
+
+* Other minor changes to Distutils: it now checks for the :envvar:`CC`,
+ :envvar:`CFLAGS`, :envvar:`CPP`, :envvar:`LDFLAGS`, and :envvar:`CPPFLAGS`
+ environment variables, using them to override the settings in Python's
+ configuration (contributed by Robert Weber).
+
+* Previously the :mod:`doctest` module would only search the docstrings of
+ public methods and functions for test cases, but it now also examines private
+ ones as well. The :func:`DocTestSuite(` function creates a
+ :class:`unittest.TestSuite` object from a set of :mod:`doctest` tests.
+
+* The new :func:`gc.get_referents(object)` function returns a list of all the
+ objects referenced by *object*.
+
+* The :mod:`getopt` module gained a new function, :func:`gnu_getopt`, that
+ supports the same arguments as the existing :func:`getopt` function but uses
+ GNU-style scanning mode. The existing :func:`getopt` stops processing options as
+ soon as a non-option argument is encountered, but in GNU-style mode processing
+ continues, meaning that options and arguments can be mixed. For example::
+
+ >>> getopt.getopt(['-f', 'filename', 'output', '-v'], 'f:v')
+ ([('-f', 'filename')], ['output', '-v'])
+ >>> getopt.gnu_getopt(['-f', 'filename', 'output', '-v'], 'f:v')
+ ([('-f', 'filename'), ('-v', '')], ['output'])
+
+ (Contributed by Peter Åstrand.)
+
+* The :mod:`grp`, :mod:`pwd`, and :mod:`resource` modules now return enhanced
+ tuples::
+
+ >>> import grp
+ >>> g = grp.getgrnam('amk')
+ >>> g.gr_name, g.gr_gid
+ ('amk', 500)
+
+* The :mod:`gzip` module can now handle files exceeding 2 GiB.
+
+* The new :mod:`heapq` module contains an implementation of a heap queue
+ algorithm. A heap is an array-like data structure that keeps items in a
+ partially sorted order such that, for every index *k*, ``heap[k] <=
+ heap[2*k+1]`` and ``heap[k] <= heap[2*k+2]``. This makes it quick to remove the
+ smallest item, and inserting a new item while maintaining the heap property is
+ O(lg n). (See http://www.nist.gov/dads/HTML/priorityque.html for more
+ information about the priority queue data structure.)
+
+ The :mod:`heapq` module provides :func:`heappush` and :func:`heappop` functions
+ for adding and removing items while maintaining the heap property on top of some
+ other mutable Python sequence type. Here's an example that uses a Python list::
+
+ >>> import heapq
+ >>> heap = []
+ >>> for item in [3, 7, 5, 11, 1]:
+ ... heapq.heappush(heap, item)
+ ...
+ >>> heap
+ [1, 3, 5, 11, 7]
+ >>> heapq.heappop(heap)
+ 1
+ >>> heapq.heappop(heap)
+ 3
+ >>> heap
+ [5, 7, 11]
+
+ (Contributed by Kevin O'Connor.)
+
+* The IDLE integrated development environment has been updated using the code
+ from the IDLEfork project (http://idlefork.sf.net). The most notable feature is
+ that the code being developed is now executed in a subprocess, meaning that
+ there's no longer any need for manual ``reload()`` operations. IDLE's core code
+ has been incorporated into the standard library as the :mod:`idlelib` package.
+
+* The :mod:`imaplib` module now supports IMAP over SSL. (Contributed by Piers
+ Lauder and Tino Lange.)
+
+* The :mod:`itertools` contains a number of useful functions for use with
+ iterators, inspired by various functions provided by the ML and Haskell
+ languages. For example, ``itertools.ifilter(predicate, iterator)`` returns all
+ elements in the iterator for which the function :func:`predicate` returns
+ :const:`True`, and ``itertools.repeat(obj, N)`` returns ``obj`` *N* times.
+ There are a number of other functions in the module; see the package's reference
+ documentation for details.
+ (Contributed by Raymond Hettinger.)
+
+* Two new functions in the :mod:`math` module, :func:`degrees(rads)` and
+ :func:`radians(degs)`, convert between radians and degrees. Other functions in
+ the :mod:`math` module such as :func:`math.sin` and :func:`math.cos` have always
+ required input values measured in radians. Also, an optional *base* argument
+ was added to :func:`math.log` to make it easier to compute logarithms for bases
+ other than ``e`` and ``10``. (Contributed by Raymond Hettinger.)
+
+* Several new POSIX functions (:func:`getpgid`, :func:`killpg`, :func:`lchown`,
+ :func:`loadavg`, :func:`major`, :func:`makedev`, :func:`minor`, and
+ :func:`mknod`) were added to the :mod:`posix` module that underlies the
+ :mod:`os` module. (Contributed by Gustavo Niemeyer, Geert Jansen, and Denis S.
+ Otkidach.)
+
+* In the :mod:`os` module, the :func:`\*stat` family of functions can now report
+ fractions of a second in a timestamp. Such time stamps are represented as
+ floats, similar to the value returned by :func:`time.time`.
+
+ During testing, it was found that some applications will break if time stamps
+ are floats. For compatibility, when using the tuple interface of the
+ :class:`stat_result` time stamps will be represented as integers. When using
+ named fields (a feature first introduced in Python 2.2), time stamps are still
+ represented as integers, unless :func:`os.stat_float_times` is invoked to enable
+ float return values::
+
+ >>> os.stat("/tmp").st_mtime
+ 1034791200
+ >>> os.stat_float_times(True)
+ >>> os.stat("/tmp").st_mtime
+ 1034791200.6335014
+
+ In Python 2.4, the default will change to always returning floats.
+
+ Application developers should enable this feature only if all their libraries
+ work properly when confronted with floating point time stamps, or if they use
+ the tuple API. If used, the feature should be activated on an application level
+ instead of trying to enable it on a per-use basis.
+
+* The :mod:`optparse` module contains a new parser for command-line arguments
+ that can convert option values to a particular Python type and will
+ automatically generate a usage message. See the following section for more
+ details.
+
+* The old and never-documented :mod:`linuxaudiodev` module has been deprecated,
+ and a new version named :mod:`ossaudiodev` has been added. The module was
+ renamed because the OSS sound drivers can be used on platforms other than Linux,
+ and the interface has also been tidied and brought up to date in various ways.
+ (Contributed by Greg Ward and Nicholas FitzRoy-Dale.)
+
+* The new :mod:`platform` module contains a number of functions that try to
+ determine various properties of the platform you're running on. There are
+ functions for getting the architecture, CPU type, the Windows OS version, and
+ even the Linux distribution version. (Contributed by Marc-André Lemburg.)
+
+* The parser objects provided by the :mod:`pyexpat` module can now optionally
+ buffer character data, resulting in fewer calls to your character data handler
+ and therefore faster performance. Setting the parser object's
+ :attr:`buffer_text` attribute to :const:`True` will enable buffering.
+
+* The :func:`sample(population, k)` function was added to the :mod:`random`
+ module. *population* is a sequence or :class:`xrange` object containing the
+ elements of a population, and :func:`sample` chooses *k* elements from the
+ population without replacing chosen elements. *k* can be any value up to
+ ``len(population)``. For example::
+
+ >>> days = ['Mo', 'Tu', 'We', 'Th', 'Fr', 'St', 'Sn']
+ >>> random.sample(days, 3) # Choose 3 elements
+ ['St', 'Sn', 'Th']
+ >>> random.sample(days, 7) # Choose 7 elements
+ ['Tu', 'Th', 'Mo', 'We', 'St', 'Fr', 'Sn']
+ >>> random.sample(days, 7) # Choose 7 again
+ ['We', 'Mo', 'Sn', 'Fr', 'Tu', 'St', 'Th']
+ >>> random.sample(days, 8) # Can't choose eight
+ Traceback (most recent call last):
+ File "<stdin>", line 1, in ?
+ File "random.py", line 414, in sample
+ raise ValueError, "sample larger than population"
+ ValueError: sample larger than population
+ >>> random.sample(xrange(1,10000,2), 10) # Choose ten odd nos. under 10000
+ [3407, 3805, 1505, 7023, 2401, 2267, 9733, 3151, 8083, 9195]
+
+ The :mod:`random` module now uses a new algorithm, the Mersenne Twister,
+ implemented in C. It's faster and more extensively studied than the previous
+ algorithm.
+
+ (All changes contributed by Raymond Hettinger.)
+
+* The :mod:`readline` module also gained a number of new functions:
+ :func:`get_history_item`, :func:`get_current_history_length`, and
+ :func:`redisplay`.
+
+* The :mod:`rexec` and :mod:`Bastion` modules have been declared dead, and
+ attempts to import them will fail with a :exc:`RuntimeError`. New-style classes
+ provide new ways to break out of the restricted execution environment provided
+ by :mod:`rexec`, and no one has interest in fixing them or time to do so. If
+ you have applications using :mod:`rexec`, rewrite them to use something else.
+
+ (Sticking with Python 2.2 or 2.1 will not make your applications any safer
+ because there are known bugs in the :mod:`rexec` module in those versions. To
+ repeat: if you're using :mod:`rexec`, stop using it immediately.)
+
+* The :mod:`rotor` module has been deprecated because the algorithm it uses for
+ encryption is not believed to be secure. If you need encryption, use one of the
+ several AES Python modules that are available separately.
+
+* The :mod:`shutil` module gained a :func:`move(src, dest)` function that
+ recursively moves a file or directory to a new location.
+
+* Support for more advanced POSIX signal handling was added to the :mod:`signal`
+ but then removed again as it proved impossible to make it work reliably across
+ platforms.
+
+* The :mod:`socket` module now supports timeouts. You can call the
+ :meth:`settimeout(t)` method on a socket object to set a timeout of *t* seconds.
+ Subsequent socket operations that take longer than *t* seconds to complete will
+ abort and raise a :exc:`socket.timeout` exception.
+
+ The original timeout implementation was by Tim O'Malley. Michael Gilfix
+ integrated it into the Python :mod:`socket` module and shepherded it through a
+ lengthy review. After the code was checked in, Guido van Rossum rewrote parts
+ of it. (This is a good example of a collaborative development process in
+ action.)
+
+* On Windows, the :mod:`socket` module now ships with Secure Sockets Layer
+ (SSL) support.
+
+* The value of the C :const:`PYTHON_API_VERSION` macro is now exposed at the
+ Python level as ``sys.api_version``. The current exception can be cleared by
+ calling the new :func:`sys.exc_clear` function.
+
+* The new :mod:`tarfile` module allows reading from and writing to
+ :program:`tar`\ -format archive files. (Contributed by Lars Gustäbel.)
+
+* The new :mod:`textwrap` module contains functions for wrapping strings
+ containing paragraphs of text. The :func:`wrap(text, width)` function takes a
+ string and returns a list containing the text split into lines of no more than
+ the chosen width. The :func:`fill(text, width)` function returns a single
+ string, reformatted to fit into lines no longer than the chosen width. (As you
+ can guess, :func:`fill` is built on top of :func:`wrap`. For example::
+
+ >>> import textwrap
+ >>> paragraph = "Not a whit, we defy augury: ... more text ..."
+ >>> textwrap.wrap(paragraph, 60)
+ ["Not a whit, we defy augury: there's a special providence in",
+ "the fall of a sparrow. If it be now, 'tis not to come; if it",
+ ...]
+ >>> print textwrap.fill(paragraph, 35)
+ Not a whit, we defy augury: there's
+ a special providence in the fall of
+ a sparrow. If it be now, 'tis not
+ to come; if it be not to come, it
+ will be now; if it be not now, yet
+ it will come: the readiness is all.
+ >>>
+
+ The module also contains a :class:`TextWrapper` class that actually implements
+ the text wrapping strategy. Both the :class:`TextWrapper` class and the
+ :func:`wrap` and :func:`fill` functions support a number of additional keyword
+ arguments for fine-tuning the formatting; consult the module's documentation
+ for details. (Contributed by Greg Ward.)
+
+* The :mod:`thread` and :mod:`threading` modules now have companion modules,
+ :mod:`dummy_thread` and :mod:`dummy_threading`, that provide a do-nothing
+ implementation of the :mod:`thread` module's interface for platforms where
+ threads are not supported. The intention is to simplify thread-aware modules
+ (ones that *don't* rely on threads to run) by putting the following code at the
+ top::
+
+ try:
+ import threading as _threading
+ except ImportError:
+ import dummy_threading as _threading
+
+ In this example, :mod:`_threading` is used as the module name to make it clear
+ that the module being used is not necessarily the actual :mod:`threading`
+ module. Code can call functions and use classes in :mod:`_threading` whether or
+ not threads are supported, avoiding an :keyword:`if` statement and making the
+ code slightly clearer. This module will not magically make multithreaded code
+ run without threads; code that waits for another thread to return or to do
+ something will simply hang forever.
+
+* The :mod:`time` module's :func:`strptime` function has long been an annoyance
+ because it uses the platform C library's :func:`strptime` implementation, and
+ different platforms sometimes have odd bugs. Brett Cannon contributed a
+ portable implementation that's written in pure Python and should behave
+ identically on all platforms.
+
+* The new :mod:`timeit` module helps measure how long snippets of Python code
+ take to execute. The :file:`timeit.py` file can be run directly from the
+ command line, or the module's :class:`Timer` class can be imported and used
+ directly. Here's a short example that figures out whether it's faster to
+ convert an 8-bit string to Unicode by appending an empty Unicode string to it or
+ by using the :func:`unicode` function::
+
+ import timeit
+
+ timer1 = timeit.Timer('unicode("abc")')
+ timer2 = timeit.Timer('"abc" + u""')
+
+ # Run three trials
+ print timer1.repeat(repeat=3, number=100000)
+ print timer2.repeat(repeat=3, number=100000)
+
+ # On my laptop this outputs:
+ # [0.36831796169281006, 0.37441694736480713, 0.35304892063140869]
+ # [0.17574405670166016, 0.18193507194519043, 0.17565798759460449]
+
+* The :mod:`Tix` module has received various bug fixes and updates for the
+ current version of the Tix package.
+
+* The :mod:`Tkinter` module now works with a thread-enabled version of Tcl.
+ Tcl's threading model requires that widgets only be accessed from the thread in
+ which they're created; accesses from another thread can cause Tcl to panic. For
+ certain Tcl interfaces, :mod:`Tkinter` will now automatically avoid this when a
+ widget is accessed from a different thread by marshalling a command, passing it
+ to the correct thread, and waiting for the results. Other interfaces can't be
+ handled automatically but :mod:`Tkinter` will now raise an exception on such an
+ access so that you can at least find out about the problem. See
+ http://mail.python.org/pipermail/python-dev/2002-December/031107.html for a more
+ detailed explanation of this change. (Implemented by Martin von Löwis.)
+
+ .. %
+
+* Calling Tcl methods through :mod:`_tkinter` no longer returns only strings.
+ Instead, if Tcl returns other objects those objects are converted to their
+ Python equivalent, if one exists, or wrapped with a :class:`_tkinter.Tcl_Obj`
+ object if no Python equivalent exists. This behavior can be controlled through
+ the :meth:`wantobjects` method of :class:`tkapp` objects.
+
+ When using :mod:`_tkinter` through the :mod:`Tkinter` module (as most Tkinter
+ applications will), this feature is always activated. It should not cause
+ compatibility problems, since Tkinter would always convert string results to
+ Python types where possible.
+
+ If any incompatibilities are found, the old behavior can be restored by setting
+ the :attr:`wantobjects` variable in the :mod:`Tkinter` module to false before
+ creating the first :class:`tkapp` object. ::
+
+ import Tkinter
+ Tkinter.wantobjects = 0
+
+ Any breakage caused by this change should be reported as a bug.
+
+* The :mod:`UserDict` module has a new :class:`DictMixin` class which defines
+ all dictionary methods for classes that already have a minimum mapping
+ interface. This greatly simplifies writing classes that need to be
+ substitutable for dictionaries, such as the classes in the :mod:`shelve`
+ module.
+
+ Adding the mix-in as a superclass provides the full dictionary interface
+ whenever the class defines :meth:`__getitem__`, :meth:`__setitem__`,
+ :meth:`__delitem__`, and :meth:`keys`. For example::
+
+ >>> import UserDict
+ >>> class SeqDict(UserDict.DictMixin):
+ ... """Dictionary lookalike implemented with lists."""
+ ... def __init__(self):
+ ... self.keylist = []
+ ... self.valuelist = []
+ ... def __getitem__(self, key):
+ ... try:
+ ... i = self.keylist.index(key)
+ ... except ValueError:
+ ... raise KeyError
+ ... return self.valuelist[i]
+ ... def __setitem__(self, key, value):
+ ... try:
+ ... i = self.keylist.index(key)
+ ... self.valuelist[i] = value
+ ... except ValueError:
+ ... self.keylist.append(key)
+ ... self.valuelist.append(value)
+ ... def __delitem__(self, key):
+ ... try:
+ ... i = self.keylist.index(key)
+ ... except ValueError:
+ ... raise KeyError
+ ... self.keylist.pop(i)
+ ... self.valuelist.pop(i)
+ ... def keys(self):
+ ... return list(self.keylist)
+ ...
+ >>> s = SeqDict()
+ >>> dir(s) # See that other dictionary methods are implemented
+ ['__cmp__', '__contains__', '__delitem__', '__doc__', '__getitem__',
+ '__init__', '__iter__', '__len__', '__module__', '__repr__',
+ '__setitem__', 'clear', 'get', 'has_key', 'items', 'iteritems',
+ 'iterkeys', 'itervalues', 'keylist', 'keys', 'pop', 'popitem',
+ 'setdefault', 'update', 'valuelist', 'values']
+
+ (Contributed by Raymond Hettinger.)
+
+* The DOM implementation in :mod:`xml.dom.minidom` can now generate XML output
+ in a particular encoding by providing an optional encoding argument to the
+ :meth:`toxml` and :meth:`toprettyxml` methods of DOM nodes.
+
+* The :mod:`xmlrpclib` module now supports an XML-RPC extension for handling nil
+ data values such as Python's ``None``. Nil values are always supported on
+ unmarshalling an XML-RPC response. To generate requests containing ``None``,
+ you must supply a true value for the *allow_none* parameter when creating a
+ :class:`Marshaller` instance.
+
+* The new :mod:`DocXMLRPCServer` module allows writing self-documenting XML-RPC
+ servers. Run it in demo mode (as a program) to see it in action. Pointing the
+ Web browser to the RPC server produces pydoc-style documentation; pointing
+ xmlrpclib to the server allows invoking the actual methods. (Contributed by
+ Brian Quinlan.)
+
+* Support for internationalized domain names (RFCs 3454, 3490, 3491, and 3492)
+ has been added. The "idna" encoding can be used to convert between a Unicode
+ domain name and the ASCII-compatible encoding (ACE) of that name. ::
+
+ >{}>{}> u"www.Alliancefrançaise.nu".encode("idna")
+ 'www.xn--alliancefranaise-npb.nu'
+
+ The :mod:`socket` module has also been extended to transparently convert
+ Unicode hostnames to the ACE version before passing them to the C library.
+ Modules that deal with hostnames such as :mod:`httplib` and :mod:`ftplib`)
+ also support Unicode host names; :mod:`httplib` also sends HTTP ``Host``
+ headers using the ACE version of the domain name. :mod:`urllib` supports
+ Unicode URLs with non-ASCII host names as long as the ``path`` part of the URL
+ is ASCII only.
+
+ To implement this change, the :mod:`stringprep` module, the ``mkstringprep``
+ tool and the ``punycode`` encoding have been added.
+
+.. % ======================================================================
+
+
+Date/Time Type
+--------------
+
+Date and time types suitable for expressing timestamps were added as the
+:mod:`datetime` module. The types don't support different calendars or many
+fancy features, and just stick to the basics of representing time.
+
+The three primary types are: :class:`date`, representing a day, month, and year;
+:class:`time`, consisting of hour, minute, and second; and :class:`datetime`,
+which contains all the attributes of both :class:`date` and :class:`time`.
+There's also a :class:`timedelta` class representing differences between two
+points in time, and time zone logic is implemented by classes inheriting from
+the abstract :class:`tzinfo` class.
+
+You can create instances of :class:`date` and :class:`time` by either supplying
+keyword arguments to the appropriate constructor, e.g.
+``datetime.date(year=1972, month=10, day=15)``, or by using one of a number of
+class methods. For example, the :meth:`date.today` class method returns the
+current local date.
+
+Once created, instances of the date/time classes are all immutable. There are a
+number of methods for producing formatted strings from objects::
+
+ >>> import datetime
+ >>> now = datetime.datetime.now()
+ >>> now.isoformat()
+ '2002-12-30T21:27:03.994956'
+ >>> now.ctime() # Only available on date, datetime
+ 'Mon Dec 30 21:27:03 2002'
+ >>> now.strftime('%Y %d %b')
+ '2002 30 Dec'
+
+The :meth:`replace` method allows modifying one or more fields of a
+:class:`date` or :class:`datetime` instance, returning a new instance::
+
+ >>> d = datetime.datetime.now()
+ >>> d
+ datetime.datetime(2002, 12, 30, 22, 15, 38, 827738)
+ >>> d.replace(year=2001, hour = 12)
+ datetime.datetime(2001, 12, 30, 12, 15, 38, 827738)
+ >>>
+
+Instances can be compared, hashed, and converted to strings (the result is the
+same as that of :meth:`isoformat`). :class:`date` and :class:`datetime`
+instances can be subtracted from each other, and added to :class:`timedelta`
+instances. The largest missing feature is that there's no standard library
+support for parsing strings and getting back a :class:`date` or
+:class:`datetime`.
+
+For more information, refer to the module's reference documentation.
+(Contributed by Tim Peters.)
+
+.. % ======================================================================
+
+
+The optparse Module
+-------------------
+
+The :mod:`getopt` module provides simple parsing of command-line arguments. The
+new :mod:`optparse` module (originally named Optik) provides more elaborate
+command-line parsing that follows the Unix conventions, automatically creates
+the output for :option:`--help`, and can perform different actions for different
+options.
+
+You start by creating an instance of :class:`OptionParser` and telling it what
+your program's options are. ::
+
+ import sys
+ from optparse import OptionParser
+
+ op = OptionParser()
+ op.add_option('-i', '--input',
+ action='store', type='string', dest='input',
+ help='set input filename')
+ op.add_option('-l', '--length',
+ action='store', type='int', dest='length',
+ help='set maximum length of output')
+
+Parsing a command line is then done by calling the :meth:`parse_args` method. ::
+
+ options, args = op.parse_args(sys.argv[1:])
+ print options
+ print args
+
+This returns an object containing all of the option values, and a list of
+strings containing the remaining arguments.
+
+Invoking the script with the various arguments now works as you'd expect it to.
+Note that the length argument is automatically converted to an integer. ::
+
+ $ ./python opt.py -i data arg1
+ <Values at 0x400cad4c: {'input': 'data', 'length': None}>
+ ['arg1']
+ $ ./python opt.py --input=data --length=4
+ <Values at 0x400cad2c: {'input': 'data', 'length': 4}>
+ []
+ $
+
+The help message is automatically generated for you::
+
+ $ ./python opt.py --help
+ usage: opt.py [options]
+
+ options:
+ -h, --help show this help message and exit
+ -iINPUT, --input=INPUT
+ set input filename
+ -lLENGTH, --length=LENGTH
+ set maximum length of output
+ $
+
+See the module's documentation for more details.
+
+
+Optik was written by Greg Ward, with suggestions from the readers of the Getopt
+SIG.
+
+.. % ======================================================================
+
+
+.. _section-pymalloc:
+
+Pymalloc: A Specialized Object Allocator
+========================================
+
+Pymalloc, a specialized object allocator written by Vladimir Marangozov, was a
+feature added to Python 2.1. Pymalloc is intended to be faster than the system
+:cfunc:`malloc` and to have less memory overhead for allocation patterns typical
+of Python programs. The allocator uses C's :cfunc:`malloc` function to get large
+pools of memory and then fulfills smaller memory requests from these pools.
+
+In 2.1 and 2.2, pymalloc was an experimental feature and wasn't enabled by
+default; you had to explicitly enable it when compiling Python by providing the
+:option:`--with-pymalloc` option to the :program:`configure` script. In 2.3,
+pymalloc has had further enhancements and is now enabled by default; you'll have
+to supply :option:`--without-pymalloc` to disable it.
+
+This change is transparent to code written in Python; however, pymalloc may
+expose bugs in C extensions. Authors of C extension modules should test their
+code with pymalloc enabled, because some incorrect code may cause core dumps at
+runtime.
+
+There's one particularly common error that causes problems. There are a number
+of memory allocation functions in Python's C API that have previously just been
+aliases for the C library's :cfunc:`malloc` and :cfunc:`free`, meaning that if
+you accidentally called mismatched functions the error wouldn't be noticeable.
+When the object allocator is enabled, these functions aren't aliases of
+:cfunc:`malloc` and :cfunc:`free` any more, and calling the wrong function to
+free memory may get you a core dump. For example, if memory was allocated using
+:cfunc:`PyObject_Malloc`, it has to be freed using :cfunc:`PyObject_Free`, not
+:cfunc:`free`. A few modules included with Python fell afoul of this and had to
+be fixed; doubtless there are more third-party modules that will have the same
+problem.
+
+As part of this change, the confusing multiple interfaces for allocating memory
+have been consolidated down into two API families. Memory allocated with one
+family must not be manipulated with functions from the other family. There is
+one family for allocating chunks of memory and another family of functions
+specifically for allocating Python objects.
+
+* To allocate and free an undistinguished chunk of memory use the "raw memory"
+ family: :cfunc:`PyMem_Malloc`, :cfunc:`PyMem_Realloc`, and :cfunc:`PyMem_Free`.
+
+* The "object memory" family is the interface to the pymalloc facility described
+ above and is biased towards a large number of "small" allocations:
+ :cfunc:`PyObject_Malloc`, :cfunc:`PyObject_Realloc`, and :cfunc:`PyObject_Free`.
+
+* To allocate and free Python objects, use the "object" family
+ :cfunc:`PyObject_New`, :cfunc:`PyObject_NewVar`, and :cfunc:`PyObject_Del`.
+
+Thanks to lots of work by Tim Peters, pymalloc in 2.3 also provides debugging
+features to catch memory overwrites and doubled frees in both extension modules
+and in the interpreter itself. To enable this support, compile a debugging
+version of the Python interpreter by running :program:`configure` with
+:option:`--with-pydebug`.
+
+To aid extension writers, a header file :file:`Misc/pymemcompat.h` is
+distributed with the source to Python 2.3 that allows Python extensions to use
+the 2.3 interfaces to memory allocation while compiling against any version of
+Python since 1.5.2. You would copy the file from Python's source distribution
+and bundle it with the source of your extension.
+
+
+.. seealso::
+
+ http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/python/python/dist/src/Objects/obmalloc.c
+ For the full details of the pymalloc implementation, see the comments at the top
+ of the file :file:`Objects/obmalloc.c` in the Python source code. The above
+ link points to the file within the SourceForge CVS browser.
+
+.. % ======================================================================
+
+
+Build and C API Changes
+=======================
+
+Changes to Python's build process and to the C API include:
+
+* The cycle detection implementation used by the garbage collection has proven
+ to be stable, so it's now been made mandatory. You can no longer compile Python
+ without it, and the :option:`--with-cycle-gc` switch to :program:`configure` has
+ been removed.
+
+* Python can now optionally be built as a shared library
+ (:file:`libpython2.3.so`) by supplying :option:`--enable-shared` when running
+ Python's :program:`configure` script. (Contributed by Ondrej Palkovsky.)
+
+* The :cmacro:`DL_EXPORT` and :cmacro:`DL_IMPORT` macros are now deprecated.
+ Initialization functions for Python extension modules should now be declared
+ using the new macro :cmacro:`PyMODINIT_FUNC`, while the Python core will
+ generally use the :cmacro:`PyAPI_FUNC` and :cmacro:`PyAPI_DATA` macros.
+
+* The interpreter can be compiled without any docstrings for the built-in
+ functions and modules by supplying :option:`--without-doc-strings` to the
+ :program:`configure` script. This makes the Python executable about 10% smaller,
+ but will also mean that you can't get help for Python's built-ins. (Contributed
+ by Gustavo Niemeyer.)
+
+* The :cfunc:`PyArg_NoArgs` macro is now deprecated, and code that uses it
+ should be changed. For Python 2.2 and later, the method definition table can
+ specify the :const:`METH_NOARGS` flag, signalling that there are no arguments,
+ and the argument checking can then be removed. If compatibility with pre-2.2
+ versions of Python is important, the code could use ``PyArg_ParseTuple(args,
+ "")`` instead, but this will be slower than using :const:`METH_NOARGS`.
+
+* :cfunc:`PyArg_ParseTuple` accepts new format characters for various sizes of
+ unsigned integers: ``B`` for :ctype:`unsigned char`, ``H`` for :ctype:`unsigned
+ short int`, ``I`` for :ctype:`unsigned int`, and ``K`` for :ctype:`unsigned
+ long long`.
+
+* A new function, :cfunc:`PyObject_DelItemString(mapping, char \*key)` was added
+ as shorthand for ``PyObject_DelItem(mapping, PyString_New(key))``.
+
+* File objects now manage their internal string buffer differently, increasing
+ it exponentially when needed. This results in the benchmark tests in
+ :file:`Lib/test/test_bufio.py` speeding up considerably (from 57 seconds to 1.7
+ seconds, according to one measurement).
+
+* It's now possible to define class and static methods for a C extension type by
+ setting either the :const:`METH_CLASS` or :const:`METH_STATIC` flags in a
+ method's :ctype:`PyMethodDef` structure.
+
+* Python now includes a copy of the Expat XML parser's source code, removing any
+ dependence on a system version or local installation of Expat.
+
+* If you dynamically allocate type objects in your extension, you should be
+ aware of a change in the rules relating to the :attr:`__module__` and
+ :attr:`__name__` attributes. In summary, you will want to ensure the type's
+ dictionary contains a ``'__module__'`` key; making the module name the part of
+ the type name leading up to the final period will no longer have the desired
+ effect. For more detail, read the API reference documentation or the source.
+
+.. % ======================================================================
+
+
+Port-Specific Changes
+---------------------
+
+Support for a port to IBM's OS/2 using the EMX runtime environment was merged
+into the main Python source tree. EMX is a POSIX emulation layer over the OS/2
+system APIs. The Python port for EMX tries to support all the POSIX-like
+capability exposed by the EMX runtime, and mostly succeeds; :func:`fork` and
+:func:`fcntl` are restricted by the limitations of the underlying emulation
+layer. The standard OS/2 port, which uses IBM's Visual Age compiler, also
+gained support for case-sensitive import semantics as part of the integration of
+the EMX port into CVS. (Contributed by Andrew MacIntyre.)
+
+On MacOS, most toolbox modules have been weaklinked to improve backward
+compatibility. This means that modules will no longer fail to load if a single
+routine is missing on the current OS version. Instead calling the missing
+routine will raise an exception. (Contributed by Jack Jansen.)
+
+The RPM spec files, found in the :file:`Misc/RPM/` directory in the Python
+source distribution, were updated for 2.3. (Contributed by Sean Reifschneider.)
+
+Other new platforms now supported by Python include AtheOS
+(http://www.atheos.cx/), GNU/Hurd, and OpenVMS.
+
+.. % ======================================================================
+
+
+.. _section-other:
+
+Other Changes and Fixes
+=======================
+
+As usual, there were a bunch of other improvements and bugfixes scattered
+throughout the source tree. A search through the CVS change logs finds there
+were 523 patches applied and 514 bugs fixed between Python 2.2 and 2.3. Both
+figures are likely to be underestimates.
+
+Some of the more notable changes are:
+
+* If the :envvar:`PYTHONINSPECT` environment variable is set, the Python
+ interpreter will enter the interactive prompt after running a Python program, as
+ if Python had been invoked with the :option:`-i` option. The environment
+ variable can be set before running the Python interpreter, or it can be set by
+ the Python program as part of its execution.
+
+* The :file:`regrtest.py` script now provides a way to allow "all resources
+ except *foo*." A resource name passed to the :option:`-u` option can now be
+ prefixed with a hyphen (``'-'``) to mean "remove this resource." For example,
+ the option '``-uall,-bsddb``' could be used to enable the use of all resources
+ except ``bsddb``.
+
+* The tools used to build the documentation now work under Cygwin as well as
+ Unix.
+
+* The ``SET_LINENO`` opcode has been removed. Back in the mists of time, this
+ opcode was needed to produce line numbers in tracebacks and support trace
+ functions (for, e.g., :mod:`pdb`). Since Python 1.5, the line numbers in
+ tracebacks have been computed using a different mechanism that works with
+ "python -O". For Python 2.3 Michael Hudson implemented a similar scheme to
+ determine when to call the trace function, removing the need for ``SET_LINENO``
+ entirely.
+
+ It would be difficult to detect any resulting difference from Python code, apart
+ from a slight speed up when Python is run without :option:`-O`.
+
+ C extensions that access the :attr:`f_lineno` field of frame objects should
+ instead call ``PyCode_Addr2Line(f->f_code, f->f_lasti)``. This will have the
+ added effect of making the code work as desired under "python -O" in earlier
+ versions of Python.
+
+ A nifty new feature is that trace functions can now assign to the
+ :attr:`f_lineno` attribute of frame objects, changing the line that will be
+ executed next. A ``jump`` command has been added to the :mod:`pdb` debugger
+ taking advantage of this new feature. (Implemented by Richie Hindle.)
+
+.. % ======================================================================
+
+
+Porting to Python 2.3
+=====================
+
+This section lists previously described changes that may require changes to your
+code:
+
+* :keyword:`yield` is now always a keyword; if it's used as a variable name in
+ your code, a different name must be chosen.
+
+* For strings *X* and *Y*, ``X in Y`` now works if *X* is more than one
+ character long.
+
+* The :func:`int` type constructor will now return a long integer instead of
+ raising an :exc:`OverflowError` when a string or floating-point number is too
+ large to fit into an integer.
+
+* If you have Unicode strings that contain 8-bit characters, you must declare
+ the file's encoding (UTF-8, Latin-1, or whatever) by adding a comment to the top
+ of the file. See section :ref:`section-encodings` for more information.
+
+* Calling Tcl methods through :mod:`_tkinter` no longer returns only strings.
+ Instead, if Tcl returns other objects those objects are converted to their
+ Python equivalent, if one exists, or wrapped with a :class:`_tkinter.Tcl_Obj`
+ object if no Python equivalent exists.
+
+* Large octal and hex literals such as ``0xffffffff`` now trigger a
+ :exc:`FutureWarning`. Currently they're stored as 32-bit numbers and result in a
+ negative value, but in Python 2.4 they'll become positive long integers.
+
+ There are a few ways to fix this warning. If you really need a positive number,
+ just add an ``L`` to the end of the literal. If you're trying to get a 32-bit
+ integer with low bits set and have previously used an expression such as ``~(1
+ << 31)``, it's probably clearest to start with all bits set and clear the
+ desired upper bits. For example, to clear just the top bit (bit 31), you could
+ write ``0xffffffffL &~(1L<<31)``.
+
+ .. % The empty groups below prevent conversion to guillemets.
+
+* You can no longer disable assertions by assigning to ``__debug__``.
+
+* The Distutils :func:`setup` function has gained various new keyword arguments
+ such as *depends*. Old versions of the Distutils will abort if passed unknown
+ keywords. A solution is to check for the presence of the new
+ :func:`get_distutil_options` function in your :file:`setup.py` and only uses the
+ new keywords with a version of the Distutils that supports them::
+
+ from distutils import core
+
+ kw = {'sources': 'foo.c', ...}
+ if hasattr(core, 'get_distutil_options'):
+ kw['depends'] = ['foo.h']
+ ext = Extension(**kw)
+
+* Using ``None`` as a variable name will now result in a :exc:`SyntaxWarning`
+ warning.
+
+* Names of extension types defined by the modules included with Python now
+ contain the module and a ``'.'`` in front of the type name.
+
+.. % ======================================================================
+
+
+.. _acks:
+
+Acknowledgements
+================
+
+The author would like to thank the following people for offering suggestions,
+corrections and assistance with various drafts of this article: Jeff Bauer,
+Simon Brunning, Brett Cannon, Michael Chermside, Andrew Dalke, Scott David
+Daniels, Fred L. Drake, Jr., David Fraser, Kelly Gerber, Raymond Hettinger,
+Michael Hudson, Chris Lambert, Detlef Lannert, Martin von Löwis, Andrew
+MacIntyre, Lalo Martins, Chad Netzer, Gustavo Niemeyer, Neal Norwitz, Hans
+Nowak, Chris Reedy, Francesco Ricciardi, Vinay Sajip, Neil Schemenauer, Roman
+Suzi, Jason Tishler, Just van Rossum.
+