首页 > 其他 > 详细

爬虫基础之Request类

时间:2020-01-16 20:44:16      阅读:73      评论:0      收藏:0      [点我收藏+]

categories:

  • 爬虫
    tags:
  • Request
  • 模拟请求拉钩网

request.Request类

如果想要在请求的时候增加一些请求头,用request.Request

from urllib import request
headers = {
    'userAgent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36'
}
req = request.Request("http://www.baidu.com/",headers = headers)
resp = request.urlopen(req)
print(resp.read()[:10])
#返回结果如下
b'<!DOCTYPE '

模拟请求拉钩网站

from urllib import request,parse
headers = {
    'userAgent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36',
    'Referer':'https://www.lagou.com/jobs/list_python?labelWords=&fromSearch=true&suginput=',
}
url = 'https://www.lagou.com/jobs/list_python?labelWords=&fromSearch=true&suginput='
data = {
    'first':'true',
    'pn':1,
    'kd':'python'
}
req = request.Request(url,headers=headers,data=parse.urlencode(data).encode('utf-8'),method='POST')
resp = request.urlopen(req)
print(resp.read().decode('utf-8'))

爬虫基础之Request类

原文:https://www.cnblogs.com/anyux/p/12203062.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!