【Python3 爬虫】U05_requests库

时间：2020-03-26 14:42:47 阅读：49 评论：0 收藏：0 [点我收藏+]

1.安装和文档地址
2.发送get请求
- 2.1 发送最简单的get请求
- 2.2 添加headers和查询参数
3.发送post请求
- 3.1 发送post请求

虽然Python中的标准库urllib模块已经可以满足我们的大多数需求，但是它的API使用起来让人感觉不是很好，而requests宣传是HTTP for Humans,说明使用更简洁方便。

1.安装和文档地址

安装

pip install requests

文档地址
中文文档：https://cn.python-requests.org/zh_CN/latest/
开源地址：https://github.com/kennethreitz/requests

2.发送get请求

2.1 发送最简单的get请求

import requests
resp = requests.get("https://www.sohu.com/")
print(resp.content.decode(‘utf-8‘))

2.2 添加headers和查询参数

如果想要添加headers,可以传入headers参数来增加请求头中的headers信息。如果要将参数放在url中传递，可以利用params参数。
实例代码：

import requests
params = {
    ‘q‘: ‘Python‘
}
headers = {
    ‘user-agent‘:‘Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.149 Safari/537.36‘
}

# params接受一个字典或者字符串查询参数，字典类型自动转换为url编码
response = requests.get(‘https://www.so.com/s‘, params = params, headers = headers)

# 查看完整的url地址
print(response.url)

# 查看响应内容，response.text返回的是Unicode格式的数据
print(response.text)

# 查看响应内容，response.content返回的字节流数据
print(response.content.decode(‘utf-8‘))

# 查看响应头的字符编码
print(response.encoding)

# 查看响应码
print(response.status_code)

response.text和response.content的区别

response.text
类型：str
解码类型：根据HTTP头部对响应的编码作出有根据的推测，推测的文本编码
如何修改编码方式：response.encoding="GBK"
response.content
类型：bytes
解码类型：没有指定
如何修改编码方式：response.content.deocde("utf-8")
推荐使用response.content.deocde()的方式获取响应的html页面

3.发送post请求

3.1 发送post请求


import requests

url = ‘http://httpbin.org/post‘
d = {‘key1‘: ‘value1‘, ‘key2‘: ‘value2‘}
r = requests.post(url, data=d)
print(r.text)

输出结果：
技术分享图片

【Python3 爬虫】U05_requests库

原文：https://www.cnblogs.com/OliverQin/p/12574093.html

踩

(1)

(2)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)