summaryrefslogtreecommitdiffstats
path: root/Doc/lib/liburllib.tex
diff options
context:
space:
mode:
authorFred Drake <fdrake@acm.org>1999-02-22 22:42:14 (GMT)
committerFred Drake <fdrake@acm.org>1999-02-22 22:42:14 (GMT)
commit1ec71cb5563b8444246559d0935418610546f13b (patch)
treedce75dc88caf88c3fc89ec1b86ad5150aef1625d /Doc/lib/liburllib.tex
parent4505895e68b1866de819566b87de44543993d5cf (diff)
downloadcpython-1ec71cb5563b8444246559d0935418610546f13b.zip
cpython-1ec71cb5563b8444246559d0935418610546f13b.tar.gz
cpython-1ec71cb5563b8444246559d0935418610546f13b.tar.bz2
Incorporated updates to describe geturl() by Sjoerd Mullender
<Sjoerd.Mullender@cwi.nl>.
Diffstat (limited to 'Doc/lib/liburllib.tex')
-rw-r--r--Doc/lib/liburllib.tex35
1 files changed, 20 insertions, 15 deletions
diff --git a/Doc/lib/liburllib.tex b/Doc/lib/liburllib.tex
index c47afe8..73898f5 100644
--- a/Doc/lib/liburllib.tex
+++ b/Doc/lib/liburllib.tex
@@ -26,10 +26,10 @@ server somewhere on the network. If the connection cannot be made, or
if the server returns an error code, the \exception{IOError} exception
is raised. If all went well, a file-like object is returned. This
supports the following methods: \method{read()}, \method{readline()},
-\method{readlines()}, \method{fileno()}, \method{close()} and
-\method{info()}.
+\method{readlines()}, \method{fileno()}, \method{close()},
+\method{info()} and \method{geturl()}.
-Except for the \method{info()} method,
+Except for the \method{info()} and \method{geturl()} methods,
these methods have the same interface as for
file objects --- see section \ref{bltin-file-objects} in this
manual. (It is not a built-in file object, however, so it can't be
@@ -47,7 +47,14 @@ request. When the method is local-file, returned headers will include
a Date representing the file's last-modified time, a Content-Length
giving file size, and a Content-Type containing a guess at the file's
type. See also the description of the
-\module{mimetools}\refstmodindex{mimetools} module.
+\refmodule{mimetools}\refstmodindex{mimetools} module.
+
+The \method{geturl()} method returns the real URL of the page. In
+some cases, the HTTP server redirects a client to another URL. The
+\function{urlopen()} function handles this transparently, but in some
+cases the caller needs to know which URL the client was redirected
+to. The \method{geturl()} method can be used to get at this
+redirected URL.
If the \var{url} uses the \file{http:} scheme identifier, the optional
\var{data} argument may be given to specify a \code{POST} request
@@ -57,7 +64,7 @@ see the \function{urlencode()} function below.
\end{funcdesc}
-\begin{funcdesc}{urlretrieve}{url\optional{, filename}\optional{, hook}}
+\begin{funcdesc}{urlretrieve}{url\optional{, filename\optional{, hook}}}
Copy a network object denoted by a URL to a local file, if necessary.
If the URL points to a local file, or a valid cached copy of the
object exists, the object is not copied. Return a tuple
@@ -154,19 +161,17 @@ web client using these functions without using threads.
\item
The data returned by \function{urlopen()} or \function{urlretrieve()}
is the raw data returned by the server. This may be binary data
-(e.g. an image), plain text or (for example) HTML. The HTTP protocol
-provides type information in the reply header, which can be inspected
-by looking at the \code{content-type} header. For the Gopher protocol,
-type information is encoded in the URL; there is currently no easy way
-to extract it. If the returned data is HTML, you can use the module
-\module{htmllib}\refstmodindex{htmllib} to parse it.
-\index{HTML}
-\indexii{HTTP}{protocol}
-\indexii{Gopher}{protocol}
+(e.g. an image), plain text or (for example) HTML\index{HTML}. The
+HTTP\indexii{HTTP}{protocol} protocol provides type information in the
+reply header, which can be inspected by looking at the
+\code{content-type} header. For the Gopher\indexii{Gopher}{protocol}
+protocol, type information is encoded in the URL; there is currently
+no easy way to extract it. If the returned data is HTML, you can use
+the module \refmodule{htmllib}\refstmodindex{htmllib} to parse it.
\item
Although the \module{urllib} module contains (undocumented) routines
to parse and unparse URL strings, the recommended interface for URL
-manipulation is in module \module{urlparse}\refstmodindex{urlparse}.
+manipulation is in module \refmodule{urlparse}\refstmodindex{urlparse}.
\end{itemize}