diff options
author | Jeremy Hylton <jeremy@alum.mit.edu> | 2004-08-07 17:40:50 (GMT) |
---|---|---|
committer | Jeremy Hylton <jeremy@alum.mit.edu> | 2004-08-07 17:40:50 (GMT) |
commit | 5d9c3031c805ffb634688a6fcae0e7790688ce53 (patch) | |
tree | 43acb77a31e43498fe410844c30b83461fa66e4e /Lib/test/test_urllib2.py | |
parent | 1baa2480215a2cd168e2fde10e640650c1807496 (diff) | |
download | cpython-5d9c3031c805ffb634688a6fcae0e7790688ce53.zip cpython-5d9c3031c805ffb634688a6fcae0e7790688ce53.tar.gz cpython-5d9c3031c805ffb634688a6fcae0e7790688ce53.tar.bz2 |
Fix urllib2.urlopen() handling of chunked content encoding.
The change to use the newer httplib interface admitted the possibility
that we'd get an HTTP/1.1 chunked response, but the code didn't handle
it correctly. The raw socket object can't be pass to addinfourl(),
because it would read the undecoded response. Instead, addinfourl()
must call HTTPResponse.read(), which will handle the decoding.
One extra wrinkle is that the HTTPReponse object can't be passed to
addinfourl() either, because it doesn't implement readline() or
readlines(). As a quick hack, use socket._fileobject(), which
implements those methods on top of a read buffer. (suggested by mwh)
Finally, add some tests based on test_urllibnet.
Thanks to Andrew Sawyers for originally reporting the chunked problem.
Diffstat (limited to 'Lib/test/test_urllib2.py')
-rw-r--r-- | Lib/test/test_urllib2.py | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/Lib/test/test_urllib2.py b/Lib/test/test_urllib2.py index 6e9901f..c68d244 100644 --- a/Lib/test/test_urllib2.py +++ b/Lib/test/test_urllib2.py @@ -423,6 +423,8 @@ class HandlerTests(unittest.TestCase): self.msg = msg self.status = status self.reason = reason + def read(self): + return '' class MockHTTPClass: def __init__(self): self.req_headers = [] |