summaryrefslogtreecommitdiffstats
path: root/Lib/urlparse.py
Commit message (Collapse)AuthorAgeFilesLines
* Merge p3yk branch with the trunk up to revision 45595. This breaks a fairThomas Wouters2006-04-211-13/+118
| | | | | | | | | | | | | | | | | | | | number of tests, all because of the codecs/_multibytecodecs issue described here (it's not a Py3K issue, just something Py3K discovers): http://mail.python.org/pipermail/python-dev/2006-April/064051.html Hye-Shik Chang promised to look for a fix, so no need to fix it here. The tests that are expected to break are: test_codecencodings_cn test_codecencodings_hk test_codecencodings_jp test_codecencodings_kr test_codecencodings_tw test_codecs test_multibytecodec This merge fixes an actual test failure (test_weakref) in this branch, though, so I believe merging is the right thing to do anyway.
* Bug #1407902: Added support for sftp:// URIs to urlparse.Georg Brandl2006-01-201-11/+11
|
* add support for svn: and svn+ssh: URL schemes to urlparseFred Drake2005-07-291-1/+2
|
* Patch #712317: In URLs such as http://www.example.com?query=spam, treat '?' asJohannes Gijsbers2005-01-091-13/+12
| | | | | | a delimiter. Previously, the 'network location' (<authority> in RFC 2396) would become 'www.example.com?query=spam', while RFC 2396 does not allow a '?' in <authority>. See bug #548176 for further discussion.
* Use cStringIO where available.Raymond Hettinger2004-12-311-2/+5
|
* rsync is now a recognized protocol that uses "netloc" (i.e. specifies a networkBrett Cannon2004-06-291-1/+1
| | | | | | location) in its addressing. Closes bug #981299.
* Revert last change.Raymond Hettinger2004-05-071-12/+12
|
* Use sets instead of lists for membership testingRaymond Hettinger2004-05-061-12/+12
|
* See rev. 1.42 for log messageBrett Cannon2003-10-121-6/+2
|
* Patch #712124: Remove obsolete comment.Martin v. Löwis2003-03-301-1/+0
|
* Revert change 1.37.Raymond Hettinger2003-01-071-12/+12
| | | | | The nanoseconds saved by using dict.fromkeys aren't worth the loss in clarity. Linear searches live on.
* * add mms (windows media) as another schemeSkip Montanaro2003-01-061-19/+15
| | | | * reformat schemes to 80 columns
* Used dictionaries rather than lists for membership testing.Raymond Hettinger2003-01-061-12/+12
|
* SF feature #618024, urlparse fails on imap://Neal Norwitz2003-01-061-5/+5
|
* Added missing entries to __all__.Fred Drake2002-10-161-1/+2
|
* Fix for 1.33: urlsplit() should only add '//' if scheme != ''.Guido van Rossum2002-10-141-1/+1
| | | | Will add test and backport.
* Fix SF # 591713, Fix "file:" URL to have right no. of /'s, by Bruce AthertonNeal Norwitz2002-09-251-1/+1
| | | | | | Add a test too. urljoin() would make file:/tmp/foo instead of file:///tmp/foo Bugfix candidate, I will backport.
* amk's fix attached toMichael W. Hudson2002-03-181-1/+3
| | | | [ 516299 ] urlparse can get fragments wrong
* Fix parsing of parameters from a URL; urlparse() did not check that it onlyFred Drake2001-11-161-34/+46
| | | | | | | | split parameters from the last path segment. Introduces two new functions, urlsplit() and urlunsplit(), that do the simpler job of splitting the URL without monkeying around with the parameters field, since that was not being handled properly. This closes bug #478038.
* Remove unused variableAndrew M. Kuchling2001-08-131-1/+1
|
* final round of __all__ lists (I hope) - skipped urllib2 because Moshe may beSkip Montanaro2001-03-011-0/+2
| | | | giving it a slight facelift
* Whitespace normalization. Top level of Lib now fixed-point for reindent.py!Tim Peters2001-01-151-185/+185
|
* urlunparse(): Do not add a leading slash to the path if it is empty.Fred Drake2001-01-051-6/+12
| | | | | | | | | urljoin(): Make this conform to RFC 1808 for all examples given in that RFC (both "Normal" and "Abnormal"), so long as that RFC does not conflict the older RFC 1630, which also specified relative URL resolution. This closes SF bug #110832 (Jitterbug PR#194).
* Be explicit about scheme_chars -- string.letters is locale dependentGuido van Rossum2000-12-191-20/+18
| | | | | | | | so we can't use it. While I'm at it, got rid of string module use. (Found several new hard special cases for a hypothetical conversion tool: from string import join, find, rfind; and a local assignment "find=string.find".)
* Pekka Pessi <Pekka.Pessi@nokia.com>:Fred Drake2000-06-201-3/+3
| | | | | Patch to add support for sip: (Session Initiation Protocol, RFC2543) URLs.
* Anthony Baxter <anthony@interlink.com.au>:Fred Drake2000-04-141-4/+4
| | | | | | | | The following adds support for RTSP (RFC2326) URLs to the standard urlparse.py module. (Augmented by FLD to include rtspu:, specified in the same RFC & OK'd by Anthony.)
* Some cleanup -- don't use splitfields/joinfields, standardizeGuido van Rossum2000-04-101-19/+17
| | | | indentation (tabs only), rationalize some code in urljoin...
* The third and final doc-string sweep by Ka-Ping Yee.Guido van Rossum2000-02-041-13/+16
| | | | | | | | The attached patches update the standard library so that all modules have docstrings beginning with one-line summaries. A new docstring was added to formatter. The docstring for os.py was updated to mention nt, os2, ce in addition to posix, dos, mac.
* No need to import find(). (Andrew Dalke & kjpylint)Guido van Rossum1999-05-031-1/+1
|
* Sjoerd Mullender writes:Guido van Rossum1999-03-181-2/+2
| | | | | | | | | | If a filename on Windows starts with \\, it is converted to a URL which starts with ////. If this URL is passed to urlparse.urlparse you get a path that starts with // (and an empty netloc). If you pass the result back to urlparse.urlunparse, you get a URL that starts with //, which is parsed differently by urlparse.urlparse. The fix is to add the (empty) netloc with accompanying slashes if the path in urlunparse starts with //. Do this for all schemes that use a netloc.
* Delete non-standard-conforming code in urljoin() that would use theGuido van Rossum1999-03-171-12/+0
| | | | | | | | | | | | | | | | | | | | | | netloc from the base url as the default netloc for the resulting url even if the schemes differ. Once upon a time, when the web was wild, this was a valuable hack because some people had a URL referencing an ftp server colocated with an http server without having the host in the ftp URL (so they could replicate it or change the hostname easily). More recently, after the file: scheme got added back to the list of schemes that accept a netloc, it turns out that this caused weirdness when joining an http: URL with a file: URL -- the resulting file: URL would always inherit the host from the http: URL because the file: scheme supports a netloc but in practice never has one. There are two reasons to get rid of the old, once-valuable hack, instead of removing the file: scheme from the uses_netloc list. One, the RFC says that file: uses the netloc syntax, and does not endorse the old hack. Two, neither netscape 4.5 nor IE 4.0 support the old hack.
* Steve Clift pointed out that 'file' allows a netloc.Guido van Rossum1999-02-221-0/+1
|
* Fixed bug in the common-case code for HTTP URLs; it would lose the query,Andrew M. Kuchling1999-01-061-4/+7
| | | | | fragment, and/or parameter information. 3 cases added to the test suite to check for this bug.
* Add XXX comment about a test that doesn't seem right -- no time toGuido van Rossum1998-12-211-0/+1
| | | | explore this now.
* Easy optimizations of urlparse for the common case of parsing an http URL.Jeremy Hylton1998-09-021-8/+33
| | | | | | | 1. use dict.get instead of try/except KeyError 2. if the url scheme is 'http' then avoid the series of 'if var in [someseq]:'. instead, inline all of the code. 3. find = string.find
* fix typo in keyword argument 'allow_frament' should be 'allow_fragment'Jeremy Hylton1998-08-251-6/+6
|
* Add Gopher to list of protocols that support query strings.Guido van Rossum1998-01-191-0/+1
|
* Patch my Marc Lemburg to fix urljoin("/a", "..") and urljoin("/a", "..#1").Guido van Rossum1997-12-031-1/+3
|
* After some discussion with Jeremy and Fred, decided to limit theGuido van Rossum1997-07-141-1/+1
| | | | | | | | | default urlparse cache size to 20 instead of 2000. The main use of the cache seems to be to gain some speed in Grail, which is calling urljoin with the same base for each anchor. 2000 is a bit too big for Jeremy, who doesn't need the cache at all. 20 should keep at least 95% of the Grail speedup while wasting an insignificant amount of memory in Jeremy's application.
* Test urlparse cache with try/except instead of has_key.Guido van Rossum1997-07-111-1/+3
| | | | This makes it thread-safe again.
* Added characteristics of shttp, https, and snews.Guido van Rossum1997-01-021-3/+12
|
* Crude but effective hack to clear the parser cache every so often.Guido van Rossum1996-12-271-0/+4
| | | | (Fred Drake.)
* optimizations due to Fred Drake; added urldefrag() functionGuido van Rossum1996-05-281-18/+35
|
* added hdl protocol propertiesGuido van Rossum1996-05-281-3/+3
|
* Update reference (it's now RFC 1808); added http to list of protocolsGuido van Rossum1996-03-291-9/+3
| | | | that use parameters.
* remove file: from list of protocols taking hostGuido van Rossum1995-08-101-1/+1
|
* subtle changes to relative rurl joinsGuido van Rossum1995-08-041-3/+3
|
* Add hacks for switching protocol and path but leaving host unchangedGuido van Rossum1994-12-301-0/+12
|
* New tty/pty modules by Steen; new urlparser.Guido van Rossum1994-09-121-0/+195