diff options
author | Fred Drake <fdrake@acm.org> | 2004-09-10 01:20:21 (GMT) |
---|---|---|
committer | Fred Drake <fdrake@acm.org> | 2004-09-10 01:20:21 (GMT) |
commit | 961c2882a920f137543585175db9957bf48223f3 (patch) | |
tree | df8ee0408a4d7890b09a51611012ab746bda48e7 /Doc/lib | |
parent | a2544ee7f0ecbab945d1bb6bb8de6a72c71768ed (diff) | |
download | cpython-961c2882a920f137543585175db9957bf48223f3.zip cpython-961c2882a920f137543585175db9957bf48223f3.tar.gz cpython-961c2882a920f137543585175db9957bf48223f3.tar.bz2 |
document the exceptions raised by sgmllib, htmllib, and HTMLParser
Diffstat (limited to 'Doc/lib')
-rw-r--r-- | Doc/lib/libhtmllib.tex | 15 | ||||
-rw-r--r-- | Doc/lib/libhtmlparser.tex | 13 | ||||
-rw-r--r-- | Doc/lib/libsgmllib.tex | 12 |
3 files changed, 34 insertions, 6 deletions
diff --git a/Doc/lib/libhtmllib.tex b/Doc/lib/libhtmllib.tex index 1cddd0a..a84dd85 100644 --- a/Doc/lib/libhtmllib.tex +++ b/Doc/lib/libhtmllib.tex @@ -35,8 +35,8 @@ The interface to feed data to an instance is through the \method{feed()} method, which takes a string argument. This can be called with as little or as much text at a time as desired; \samp{p.feed(a); p.feed(b)} has the same effect as \samp{p.feed(a+b)}. When the data -contains complete HTML tags, these are processed immediately; -incomplete elements are saved in a buffer. To force processing of all +contains complete HTML markup constructs, these are processed immediately; +incomplete constructs are saved in a buffer. To force processing of all unprocessed data, call the \method{close()} method. For example, to parse the entire contents of a file, use: @@ -60,7 +60,7 @@ should define the \method{do_\var{tag}()} method. \end{itemize} -The module defines a single class: +The module defines a parser class and an exception: \begin{classdesc}{HTMLParser}{formatter} This is the basic HTML parser class. It supports all entity names @@ -68,6 +68,12 @@ required by the XHTML 1.0 Recommendation (\url{http://www.w3.org/TR/xhtml1}). It also defines handlers for all HTML 2.0 and many HTML 3.0 and 3.2 elements. \end{classdesc} +\begin{excdesc}{HTMLParseError} +Exception raised by the \class{HTMLParser} class when it encounters an +error while parsing. +\versionadded{2.4} +\end{excdesc} + \begin{seealso} \seemodule{formatter}{Interface definition for transforming an @@ -118,7 +124,8 @@ implementation adds a textual footnote marker using an index into the list of hyperlinks created by \method{anchor_bgn()}. \end{methoddesc} -\begin{methoddesc}{handle_image}{source, alt\optional{, ismap\optional{, align\optional{, width\optional{, height}}}}} +\begin{methoddesc}{handle_image}{source, alt\optional{, ismap\optional{, + align\optional{, width\optional{, height}}}}} This method is called to handle images. The default implementation simply passes the \var{alt} value to the \method{handle_data()} method. diff --git a/Doc/lib/libhtmlparser.tex b/Doc/lib/libhtmlparser.tex index 5ba8e86..b85ba56 100644 --- a/Doc/lib/libhtmlparser.tex +++ b/Doc/lib/libhtmlparser.tex @@ -4,6 +4,8 @@ \declaremodule{standard}{HTMLParser} \modulesynopsis{A simple parser that can handle HTML and XHTML.} +\versionadded{2.2} + This module defines a class \class{HTMLParser} which serves as the basis for parsing text files formatted in HTML\index{HTML} (HyperText Mark-up Language) and XHTML.\index{XHTML} Unlike the parser in @@ -23,6 +25,17 @@ that end tags match start tags or call the end-tag handler for elements which are closed implicitly by closing an outer element. \end{classdesc} +An exception is defined as well: + +\begin{excdesc}{HTMLParseError} +Exception raised by the \class{HTMLParser} class when it encounters an +error while parsing. This exception provides three attributes: +\member{msg} is a brief message explaining the error, \member{lineno} +is the number of the line on which the broken construct was detected, +and \member{offset} is the number of characters into the line at which +the construct starts. +\end{excdesc} + \class{HTMLParser} instances have the following methods: diff --git a/Doc/lib/libsgmllib.tex b/Doc/lib/libsgmllib.tex index 3699d24..27bf0b0 100644 --- a/Doc/lib/libsgmllib.tex +++ b/Doc/lib/libsgmllib.tex @@ -14,7 +14,6 @@ only exists as a base for the \refmodule{htmllib} module. Another HTML parser which supports XHTML and offers a somewhat different interface is available in the \refmodule{HTMLParser} module. - \begin{classdesc}{SGMLParser}{} The \class{SGMLParser} class is instantiated without arguments. The parser is hardcoded to recognize the following @@ -40,7 +39,16 @@ spaces, tabs, and newlines are allowed between the trailing \end{itemize} \end{classdesc} -\class{SGMLParser} instances have the following interface methods: +A single exception is defined as well: + +\begin{excdesc}{SGMLParseError} +Exception raised by the \class{SGMLParser} class when it encounters an +error while parsing. +\versionadded{2.1} +\end{excdesc} + + +\class{SGMLParser} instances have the following methods: \begin{methoddesc}{reset}{} |