summaryrefslogtreecommitdiffstats
path: root/Lib/urllib.py
Commit message (Collapse)AuthorAgeFilesLines
* Sjoerd Mullender:Guido van Rossum1998-12-181-6/+16
| | | | | | | | | | | | | | | | | | | | | | | | | File names with "funny" characters get translated wrong by pathname2url (any variety). E.g. the (Unix) file "/ufs/sjoerd/#tmp" gets translated into "/ufs/sjoerd/#tmp" which, when interpreted as a URL is file "/ufs/sjoerd/" with fragment ID "tmp". Here's an easy fix. (An alternative fix would be to change the various implementations of pathname2url and url2pathname to include calls to quote and unquote. [The main problem is with the normal use of URLs: url = url2pathname(file) transmit url url, tag = splittag(url) urlopen(url) ] In addition, this patch fixes some uses of unquote: - the host part of URLs should be unquoted - the file path in the FTP URL should be unquoted before it is split into components. - because of the latter, I removed all unquoting from ftpwrapper, and moved it to the caller, but that is not essential
* Changes by Eric Raymond:Guido van Rossum1998-09-281-14/+32
| | | | | | | | | | | | 1. Generate a correct Content-Length header visible through the info() method if a request to open an FTP URL gets a length in the response to RETR. 2. Take a third argument to urlretrieve() that makes it possible to progress- meter an urlretrieve call (this is what I needed the above change for). See the second patch band below for details. 3. To avoid spurious errors, I commented out the gopher test. The target document no longer exists.
* On the Mac, use Internet Config to find the proxies (Jack Jansen).Guido van Rossum1998-08-061-15/+49
| | | | Also added two XXX comments about lingering thread unsafeness.
* Speed up the implementation of quote().Guido van Rossum1998-07-221-8/+17
| | | | | | | | Fix the implementation of quote_plus(). (It wouldn't treat '+' in the original data right.) Add urlencode(dict) which is handy to create the data for sending a POST request with urlopen().
* Oops! Of course, Tim is right -- when the item is not a hex number,Guido van Rossum1998-06-291-2/+2
| | | | the '%' should be put back in.
* Speed-up unquote(), inspired by post from Daniel Walton.Guido van Rossum1998-06-281-18/+16
|
* Rewrite the (test) main program so that when used as a script, it canGuido van Rossum1998-06-251-5/+29
| | | | retrieve one or more URLs to stdout. Use -t to run the self-test.
* Use the getpass module instead of having platform-specific echo on/offGuido van Rossum1998-06-121-18/+5
| | | | code here.
* Patches to make the proxy code work again. (Why does that always breakGuido van Rossum1998-05-051-2/+3
| | | | | | as soon as I change things even just a little bit? :-) Even works when accessing a password-protected page through the proxy. Prompted by complaints from, and correct operation verified by, Nigel O'Brian.
* Feeble attempt at making urlopen more robust -- don't call splituser()Guido van Rossum1998-04-271-2/+6
| | | | | when splithost() returned no useable host, to avoid calling splituser() on None.
* Oops -- remove some debug print statements!Guido van Rossum1998-04-111-4/+0
|
* Oops, pulled over by the tab police :-)Guido van Rossum1998-04-031-9/+9
|
* Change by Sjoerd (with minor reformatting):Guido van Rossum1998-04-031-4/+18
| | | | | | | | | | guess the mime type of a local file. Change suggested by Sjoerd (with different implementation): when retrieve() creates a temporary file, preserve the suffix. Corrollary of the first change: also return the mime type of a local file in retrieve().
* A few lines were indented using spaces instead of tabs -- fix them.Guido van Rossum1998-03-301-8/+8
|
* A few lines were indented using spaces instead of tabs -- fix them.Guido van Rossum1998-03-261-64/+65
|
* Added support for "data" URL, by Sjoerd Mullender.Guido van Rossum1998-03-121-0/+40
|
* Fix a horrible race condition -- various routines were storing theGuido van Rossum1998-02-131-9/+7
| | | | | | | | | | | most recently opened URL in self.openedurl of the URLopener instance. This doesn't really work if multiple threads share the same opener instance! Fix: openedurl was actually simply the type prefix (e.g. "http:") followed by the rest of the URL; since the rest of the URL is available and the type is effectively determined by where you are in the code, I can reconstruct the full URL easily, e.g. "http:" + url.
* Two suggested features by Sjoerd:Guido van Rossum1998-02-051-6/+7
| | | | | | - use the tempcache in the open() method, too. - use the "unwrap"ped url as key for the tempcache.
* (1) Use matchobj.groups(), not matchbj.group() to get all groups.Guido van Rossum1998-02-051-4/+8
| | | | | (2) Provisional hack to avoid dying when trying to turn echo on or off on Macs, where os.system() doesn't exist.
* Fix bad new bug in ftp code -- the test for existing file using NLSTGuido van Rossum1998-01-191-0/+4
| | | | would set the transfer to text mode instead of the specified mode.
* Solve two annoying problems with ftp URLs for Jack: when repeatedGuido van Rossum1997-12-281-0/+14
| | | | | | | | | | retrieving files from the same host and directory, you had to close the previous instance before opening a new one; and retrieving a non-existent file would return an empty file. (The latter fix relies on maybe an undocumented property of NLST -- NLST of a file returns just that file, while NLST of a non-existent file returns nothing. A side effect, unfortunately, seems to be that now ftp-retrieving an *empty* directory may fail. Ah well.)
* When a port is specified in an ftp:// URL, must convert it to a number!Guido van Rossum1997-12-021-0/+2
|
* Assert that the proxies object passed in to the URLopener constructorGuido van Rossum1997-11-181-0/+1
| | | | is indeed a dictionary (or a mapping).
* This should hopefully finally clean up the remaining __del__ relatedGuido van Rossum1997-10-271-9/+11
| | | | | problems with this module, even if an instance of a derived class is kept alive longer than the urllib module itself...
* splitpasswd(): The parameter is named "user", not "host".Fred Drake1997-10-141-1/+1
|
* Use sys.exc_info() where needed.Guido van Rossum1997-09-291-44/+106
| | | | Use "re" module, making it threadsafe.
* Initialize self.__tempfiles to [] in the constructor (else it remainsGuido van Rossum1997-09-081-0/+1
| | | | | a shared class variable -- but each instance will attempt to clean it up entirely ob cleanup).
* One patch from Sjoerd and one from Jack.Guido van Rossum1997-09-031-7/+14
| | | | | | | | | | | Sjoerd: add separate administration of temporary files created y URLopener.retrieve() so cleanup can properly remove them. The old code removed everything in tempcache which was a bad idea if the user had passed a non-temp file into it. (I added a line to delete the tempcache in cleanup() -- it still seems to make sense.) Jack: in basejoin(), interpret relative paths starting in "../". This is necessary if the server uses symbolic links.
* Explicitly close the socket and temp file in URLopener.retrieve(), soGuido van Rossum1997-08-261-0/+2
| | | | | | | | | that multiple retrievals using the same connection will work. This leaves open the more general problem that after f = urlopen("ftp://...") f must be closed before another retrieval from the same host should be attempted.
* Interpret three slashes in file: URL as local file (for Netscape onGuido van Rossum1997-08-201-4/+4
| | | | Windows/Mac).
* Clear the ftp cache when it contains more than 10 entries.Guido van Rossum1997-06-061-2/+21
|
* Catch *all* errors that ftplib can raise (ftplib.all_errors) ratherGuido van Rossum1997-06-061-8/+10
| | | | | | than just the four. Also folded some long lines.
* Add Host: header to URL request.Guido van Rossum1997-06-031-1/+5
|
* lowercase proxies env variables, for Windows.Guido van Rossum1997-05-281-0/+1
|
* spliturl() should not throw away everything past first newlineGuido van Rossum1997-04-161-2/+4
|
* (Jack+Guido:) interpret ../ in basejoin() according to RFC1808.Guido van Rossum1997-04-111-0/+7
|
* Open files in binary mode.Guido van Rossum1997-04-111-4/+4
|
* Fix two small bugs with proxies.Guido van Rossum1997-04-021-1/+2
|
* Put a new, more useful, set of references in the leading comment.Guido van Rossum1997-02-101-6/+13
|
* Move the 'import os' in URLopener.cleanup() to inside the blockGuido van Rossum1997-01-301-1/+1
| | | | | guarded by 'if self.tempcache', to reduce the likelihood of this causing an exception when invoked during __del__...
* Added quote_plus() and unquote_plus(), to do space/plus substitutionsGuido van Rossum1996-12-131-0/+14
| | | | for form field values.
* Add optional data argument to urlopen() and open_http(), to trigger POST.Guido van Rossum1996-12-101-9/+26
|
* Added safeguard against failure in __del__.Guido van Rossum1996-11-271-0/+2
|
* When re-raising an exception raised by a module used internally asGuido van Rossum1996-11-201-3/+5
| | | | IOError, keep the traceback.
* Fix the way the Authorization header is sent (how could this have worked?).Guido van Rossum1996-11-111-1/+1
|
* Fix another case where...Guido van Rossum1996-10-221-0/+1
|
* Fix some cases where self.openedurl wasn't set.Guido van Rossum1996-10-101-2/+3
|
* One fix by sjoerd and one suggested by him. Bumped __version__ to 1.5.Guido van Rossum1996-09-101-9/+24
| | | | | Fix 1: add a method geturl() to the file-like object returned by urlopen(). Fix 2: treat http error 301 the same as error 302.
* Bump exposed __version__ to 1.4.Guido van Rossum1996-08-261-6/+9
| | | | Correctly handle a URL containing user:password@host when using a proxy.
* Slightly faster (un)quoting.Guido van Rossum1996-08-261-8/+8
|