summaryrefslogtreecommitdiffstats
path: root/Doc/library/xmllib.rst
diff options
context:
space:
mode:
Diffstat (limited to 'Doc/library/xmllib.rst')
-rw-r--r--Doc/library/xmllib.rst271
1 files changed, 140 insertions, 131 deletions
diff --git a/Doc/library/xmllib.rst b/Doc/library/xmllib.rst
index d14754c..43cbce9 100644
--- a/Doc/library/xmllib.rst
+++ b/Doc/library/xmllib.rst
@@ -28,214 +28,223 @@ parsing text files formatted in XML (Extensible Markup Language).
The :class:`XMLParser` class must be instantiated without arguments. [#]_
-This class provides the following interface methods and instance variables:
+ This class provides the following interface methods and instance variables:
-.. attribute:: XMLParser.attributes
+ .. attribute:: attributes
- A mapping of element names to mappings. The latter mapping maps attribute names
- that are valid for the element to the default value of the attribute, or if
- there is no default to ``None``. The default value is the empty dictionary.
- This variable is meant to be overridden, not extended since the default is
- shared by all instances of :class:`XMLParser`.
+ A mapping of element names to mappings. The latter mapping maps attribute
+ names that are valid for the element to the default value of the
+ attribute, or if there is no default to ``None``. The default value is
+ the empty dictionary. This variable is meant to be overridden, not
+ extended since the default is shared by all instances of
+ :class:`XMLParser`.
-.. attribute:: XMLParser.elements
+ .. attribute:: elements
- A mapping of element names to tuples. The tuples contain a function for
- handling the start and end tag respectively of the element, or ``None`` if the
- method :meth:`unknown_starttag` or :meth:`unknown_endtag` is to be called. The
- default value is the empty dictionary. This variable is meant to be overridden,
- not extended since the default is shared by all instances of :class:`XMLParser`.
+ A mapping of element names to tuples. The tuples contain a function for
+ handling the start and end tag respectively of the element, or ``None`` if
+ the method :meth:`unknown_starttag` or :meth:`unknown_endtag` is to be
+ called. The default value is the empty dictionary. This variable is
+ meant to be overridden, not extended since the default is shared by all
+ instances of :class:`XMLParser`.
-.. attribute:: XMLParser.entitydefs
+ .. attribute:: entitydefs
- A mapping of entitynames to their values. The default value contains
- definitions for ``'lt'``, ``'gt'``, ``'amp'``, ``'quot'``, and ``'apos'``.
+ A mapping of entitynames to their values. The default value contains
+ definitions for ``'lt'``, ``'gt'``, ``'amp'``, ``'quot'``, and ``'apos'``.
-.. method:: XMLParser.reset()
+ .. method:: reset()
- Reset the instance. Loses all unprocessed data. This is called implicitly at
- the instantiation time.
+ Reset the instance. Loses all unprocessed data. This is called
+ implicitly at the instantiation time.
-.. method:: XMLParser.setnomoretags()
+ .. method:: setnomoretags()
- Stop processing tags. Treat all following input as literal input (CDATA).
+ Stop processing tags. Treat all following input as literal input (CDATA).
-.. method:: XMLParser.setliteral()
+ .. method:: setliteral()
- Enter literal mode (CDATA mode). This mode is automatically exited when the
- close tag matching the last unclosed open tag is encountered.
+ Enter literal mode (CDATA mode). This mode is automatically exited when
+ the close tag matching the last unclosed open tag is encountered.
-.. method:: XMLParser.feed(data)
+ .. method:: feed(data)
- Feed some text to the parser. It is processed insofar as it consists of
- complete tags; incomplete data is buffered until more data is fed or
- :meth:`close` is called.
+ Feed some text to the parser. It is processed insofar as it consists of
+ complete tags; incomplete data is buffered until more data is fed or
+ :meth:`close` is called.
-.. method:: XMLParser.close()
+ .. method:: close()
- Force processing of all buffered data as if it were followed by an end-of-file
- mark. This method may be redefined by a derived class to define additional
- processing at the end of the input, but the redefined version should always call
- :meth:`close`.
+ Force processing of all buffered data as if it were followed by an
+ end-of-file mark. This method may be redefined by a derived class to
+ define additional processing at the end of the input, but the redefined
+ version should always call :meth:`close`.
-.. method:: XMLParser.translate_references(data)
+ .. method:: translate_references(data)
- Translate all entity and character references in *data* and return the
- translated string.
+ Translate all entity and character references in *data* and return the
+ translated string.
-.. method:: XMLParser.getnamespace()
+ .. method:: getnamespace()
- Return a mapping of namespace abbreviations to namespace URIs that are currently
- in effect.
+ Return a mapping of namespace abbreviations to namespace URIs that are
+ currently in effect.
-.. method:: XMLParser.handle_xml(encoding, standalone)
+ .. method:: handle_xml(encoding, standalone)
- This method is called when the ``<?xml ...?>`` tag is processed. The arguments
- are the values of the encoding and standalone attributes in the tag. Both
- encoding and standalone are optional. The values passed to :meth:`handle_xml`
- default to ``None`` and the string ``'no'`` respectively.
+ This method is called when the ``<?xml ...?>`` tag is processed. The
+ arguments are the values of the encoding and standalone attributes in the
+ tag. Both encoding and standalone are optional. The values passed to
+ :meth:`handle_xml` default to ``None`` and the string ``'no'``
+ respectively.
-.. method:: XMLParser.handle_doctype(tag, pubid, syslit, data)
+ .. method:: handle_doctype(tag, pubid, syslit, data)
- .. index::
- single: DOCTYPE declaration
- single: Formal Public Identifier
+ .. index::
+ single: DOCTYPE declaration
+ single: Formal Public Identifier
- This method is called when the ``<!DOCTYPE...>`` declaration is processed. The
- arguments are the tag name of the root element, the Formal Public Identifier (or
- ``None`` if not specified), the system identifier, and the uninterpreted
- contents of the internal DTD subset as a string (or ``None`` if not present).
+ This method is called when the ``<!DOCTYPE...>`` declaration is processed.
+ The arguments are the tag name of the root element, the Formal Public
+ Identifier (or ``None`` if not specified), the system identifier, and the
+ uninterpreted contents of the internal DTD subset as a string (or ``None``
+ if not present).
-.. method:: XMLParser.handle_starttag(tag, method, attributes)
+ .. method:: handle_starttag(tag, method, attributes)
- This method is called to handle start tags for which a start tag handler is
- defined in the instance variable :attr:`elements`. The *tag* argument is the
- name of the tag, and the *method* argument is the function (method) which should
- be used to support semantic interpretation of the start tag. The *attributes*
- argument is a dictionary of attributes, the key being the *name* and the value
- being the *value* of the attribute found inside the tag's ``<>`` brackets.
- Character and entity references in the *value* have been interpreted. For
- instance, for the start tag ``<A HREF="http://www.cwi.nl/">``, this method would
- be called as ``handle_starttag('A', self.elements['A'][0], {'HREF':
- 'http://www.cwi.nl/'})``. The base implementation simply calls *method* with
- *attributes* as the only argument.
+ This method is called to handle start tags for which a start tag handler
+ is defined in the instance variable :attr:`elements`. The *tag* argument
+ is the name of the tag, and the *method* argument is the function (method)
+ which should be used to support semantic interpretation of the start tag.
+ The *attributes* argument is a dictionary of attributes, the key being the
+ *name* and the value being the *value* of the attribute found inside the
+ tag's ``<>`` brackets. Character and entity references in the *value*
+ have been interpreted. For instance, for the start tag ``<A
+ HREF="http://www.cwi.nl/">``, this method would be called as
+ ``handle_starttag('A', self.elements['A'][0], {'HREF':
+ 'http://www.cwi.nl/'})``. The base implementation simply calls *method*
+ with *attributes* as the only argument.
-.. method:: XMLParser.handle_endtag(tag, method)
+ .. method:: handle_endtag(tag, method)
- This method is called to handle endtags for which an end tag handler is defined
- in the instance variable :attr:`elements`. The *tag* argument is the name of
- the tag, and the *method* argument is the function (method) which should be used
- to support semantic interpretation of the end tag. For instance, for the endtag
- ``</A>``, this method would be called as ``handle_endtag('A',
- self.elements['A'][1])``. The base implementation simply calls *method*.
+ This method is called to handle endtags for which an end tag handler is
+ defined in the instance variable :attr:`elements`. The *tag* argument is
+ the name of the tag, and the *method* argument is the function (method)
+ which should be used to support semantic interpretation of the end tag.
+ For instance, for the endtag ``</A>``, this method would be called as
+ ``handle_endtag('A', self.elements['A'][1])``. The base implementation
+ simply calls *method*.
-.. method:: XMLParser.handle_data(data)
+ .. method:: handle_data(data)
- This method is called to process arbitrary data. It is intended to be
- overridden by a derived class; the base class implementation does nothing.
+ This method is called to process arbitrary data. It is intended to be
+ overridden by a derived class; the base class implementation does nothing.
-.. method:: XMLParser.handle_charref(ref)
+ .. method:: handle_charref(ref)
- This method is called to process a character reference of the form ``&#ref;``.
- *ref* can either be a decimal number, or a hexadecimal number when preceded by
- an ``'x'``. In the base implementation, *ref* must be a number in the range
- 0-255. It translates the character to ASCII and calls the method
- :meth:`handle_data` with the character as argument. If *ref* is invalid or out
- of range, the method ``unknown_charref(ref)`` is called to handle the error. A
- subclass must override this method to provide support for character references
- outside of the ASCII range.
+ This method is called to process a character reference of the form
+ ``&#ref;``. *ref* can either be a decimal number, or a hexadecimal number
+ when preceded by an ``'x'``. In the base implementation, *ref* must be a
+ number in the range 0-255. It translates the character to ASCII and calls
+ the method :meth:`handle_data` with the character as argument. If *ref*
+ is invalid or out of range, the method ``unknown_charref(ref)`` is called
+ to handle the error. A subclass must override this method to provide
+ support for character references outside of the ASCII range.
-.. method:: XMLParser.handle_comment(comment)
+ .. method:: handle_comment(comment)
- This method is called when a comment is encountered. The *comment* argument is
- a string containing the text between the ``<!--`` and ``-->`` delimiters, but
- not the delimiters themselves. For example, the comment ``<!--text-->`` will
- cause this method to be called with the argument ``'text'``. The default method
- does nothing.
+ This method is called when a comment is encountered. The *comment*
+ argument is a string containing the text between the ``<!--`` and ``-->``
+ delimiters, but not the delimiters themselves. For example, the comment
+ ``<!--text-->`` will cause this method to be called with the argument
+ ``'text'``. The default method does nothing.
-.. method:: XMLParser.handle_cdata(data)
+ .. method:: handle_cdata(data)
- This method is called when a CDATA element is encountered. The *data* argument
- is a string containing the text between the ``<![CDATA[`` and ``]]>``
- delimiters, but not the delimiters themselves. For example, the entity
- ``<![CDATA[text]]>`` will cause this method to be called with the argument
- ``'text'``. The default method does nothing, and is intended to be overridden.
+ This method is called when a CDATA element is encountered. The *data*
+ argument is a string containing the text between the ``<![CDATA[`` and
+ ``]]>`` delimiters, but not the delimiters themselves. For example, the
+ entity ``<![CDATA[text]]>`` will cause this method to be called with the
+ argument ``'text'``. The default method does nothing, and is intended to
+ be overridden.
-.. method:: XMLParser.handle_proc(name, data)
+ .. method:: handle_proc(name, data)
- This method is called when a processing instruction (PI) is encountered. The
- *name* is the PI target, and the *data* argument is a string containing the text
- between the PI target and the closing delimiter, but not the delimiter itself.
- For example, the instruction ``<?XML text?>`` will cause this method to be
- called with the arguments ``'XML'`` and ``'text'``. The default method does
- nothing. Note that if a document starts with ``<?xml ..?>``, :meth:`handle_xml`
- is called to handle it.
+ This method is called when a processing instruction (PI) is encountered.
+ The *name* is the PI target, and the *data* argument is a string
+ containing the text between the PI target and the closing delimiter, but
+ not the delimiter itself. For example, the instruction ``<?XML text?>``
+ will cause this method to be called with the arguments ``'XML'`` and
+ ``'text'``. The default method does nothing. Note that if a document
+ starts with ``<?xml ..?>``, :meth:`handle_xml` is called to handle it.
-.. method:: XMLParser.handle_special(data)
+ .. method:: handle_special(data)
- .. index:: single: ENTITY declaration
+ .. index:: single: ENTITY declaration
- This method is called when a declaration is encountered. The *data* argument is
- a string containing the text between the ``<!`` and ``>`` delimiters, but not
- the delimiters themselves. For example, the entity declaration ``<!ENTITY
- text>`` will cause this method to be called with the argument ``'ENTITY text'``.
- The default method does nothing. Note that ``<!DOCTYPE ...>`` is handled
- separately if it is located at the start of the document.
+ This method is called when a declaration is encountered. The *data*
+ argument is a string containing the text between the ``<!`` and ``>``
+ delimiters, but not the delimiters themselves. For example, the entity
+ declaration ``<!ENTITY text>`` will cause this method to be called with
+ the argument ``'ENTITY text'``. The default method does nothing. Note
+ that ``<!DOCTYPE ...>`` is handled separately if it is located at the
+ start of the document.
-.. method:: XMLParser.syntax_error(message)
+ .. method:: syntax_error(message)
- This method is called when a syntax error is encountered. The *message* is a
- description of what was wrong. The default method raises a :exc:`RuntimeError`
- exception. If this method is overridden, it is permissible for it to return.
- This method is only called when the error can be recovered from. Unrecoverable
- errors raise a :exc:`RuntimeError` without first calling :meth:`syntax_error`.
+ This method is called when a syntax error is encountered. The *message*
+ is a description of what was wrong. The default method raises a
+ :exc:`RuntimeError` exception. If this method is overridden, it is
+ permissible for it to return. This method is only called when the error
+ can be recovered from. Unrecoverable errors raise a :exc:`RuntimeError`
+ without first calling :meth:`syntax_error`.
-.. method:: XMLParser.unknown_starttag(tag, attributes)
+ .. method:: unknown_starttag(tag, attributes)
- This method is called to process an unknown start tag. It is intended to be
- overridden by a derived class; the base class implementation does nothing.
+ This method is called to process an unknown start tag. It is intended to
+ be overridden by a derived class; the base class implementation does nothing.
-.. method:: XMLParser.unknown_endtag(tag)
+ .. method:: unknown_endtag(tag)
- This method is called to process an unknown end tag. It is intended to be
- overridden by a derived class; the base class implementation does nothing.
+ This method is called to process an unknown end tag. It is intended to be
+ overridden by a derived class; the base class implementation does nothing.
-.. method:: XMLParser.unknown_charref(ref)
+ .. method:: unknown_charref(ref)
- This method is called to process unresolvable numeric character references. It
- is intended to be overridden by a derived class; the base class implementation
- does nothing.
+ This method is called to process unresolvable numeric character
+ references. It is intended to be overridden by a derived class; the base
+ class implementation does nothing.
-.. method:: XMLParser.unknown_entityref(ref)
+ .. method:: unknown_entityref(ref)
- This method is called to process an unknown entity reference. It is intended to
- be overridden by a derived class; the base class implementation calls
- :meth:`syntax_error` to signal an error.
+ This method is called to process an unknown entity reference. It is
+ intended to be overridden by a derived class; the base class
+ implementation calls :meth:`syntax_error` to signal an error.
.. seealso::