with urllib urlopen read function but get none

By : user2956942
Date : November 22 2020, 03:03 PM
Hope this helps Hum I try with the python package requests and first have an error : requests.exceptions.TooManyRedirects: Exceeded 30 redirects.
It seems it redirects from url to another and loop like that. Maybe it failed with urllib. Also I checked doc of urlopen and seems to have some problem with https request.
code :
import requests


s = requests.session()
s.headers['User-Agent'] = 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2062.120 Safari/537.36'

response = s.get(url)
print response.text

Why urllib.urlopen.read() does not correspond to source code?

By : Shinto Jose
Date : March 29 2020, 07:55 AM
this will help What you are getting from urlopen is the raw webpage meaning no javascript is executed css is not used; where as what you get from Chrome (or other browsers) is final webpage which included executable javascript (which might alter the HTML), css rendering etc. all of which does not happen in urlopen...
Hence the difference, hope this is clear
Python Urllib UrlOpen Read

By : Fred
Date : March 29 2020, 07:55 AM
I hope this helps you . There are a few things you can do. If the URLs are on different domains, then you might just fan out the work to threads, each downloading a page from a different domain.
If your URLs all point to the same server and you do not want stress the server, then you can just retrieve the URLs sequentially. If the server is happy with a couple of parallel requests, the you can look into pools of workers. You could start, say a pool of four workers and add all your URL to a queue, from which the workers will pull new URLs.
what does read() in urlopen('http.....').read() do? [urllib]

By : Ian Webb
Date : March 29 2020, 07:55 AM
hope this fix your issue Quoting BS docs:
Mocking urllib.request.urlopen's read function returns MagicMock signature

By : Aniket
Date : March 29 2020, 07:55 AM
around this issue To follow @jonrsharpe's comment and the Python: Mocking a context manager thread, to properly mock the context manager in this case, you would need this interestingly looking line:
code :
mock_urlopen.return_value.__enter__.return_value.read.return_value = 'ok'
#^^^^^^context manager to return response^^^^^^^|^^^read method^^^
Why can't I decode the zhihu.com response using urllib.request.urlopen(url).read()?

By : testachio testronaut
Date : March 29 2020, 07:55 AM
help you fix your problem The site returns a gzipped response, even though you did not request one:
