爬虫----异步---高性能爬虫----aiohttp 和asycio 的使用

时间：2019-05-25 17:40:06 阅读：239 评论：0 收藏：0 [点我收藏+]

前情提要:

　　首先膜拜loco大佬

　　　　肯定有人像我一样.不会异步,发一下.

一:性能比对

　　　　多进程,多线程,(这里不建议使用,太消耗性能)

　　　　进程池和线程池 (可以适当的使用)

　　　　单线程+异步协程 (推荐使用)

二:案例演示

　　　　1->1: 普通的啥也不用的

　　　　1->2:

　　　　　　2->1:

　　　　　　使用线程池

　　　　　　技术分享图片

　　　　　　2->2:结果

　　　　技术分享图片

三:异步协程

　　　　1: 协程的参数设定

　　　　　　2:协程的简单使用

技术分享图片

　　　　　　3:task的使用

技术分享图片

4:future 的使用

技术分享图片

回调函数的使用

技术分享图片

四:支持异步请求网络的模块: aiohttp

import aiohttp
import asyncio

async def get_page(url):             
    async with aiohttp.ClientSession() as session:      #with 前面都要加async
        async with await session.get(url=url) as response:  # 有io阻塞的都要加await 
挂起
            page_text = await response.text() #read()  json()
            print(page_text)
start = time.time()
urls = [
    ‘http://127.0.0.1:5000/bobo‘,
    ‘http://127.0.0.1:5000/jay‘,
    ‘http://127.0.0.1:5000/tom‘,
    ‘http://127.0.0.1:5000/bobo‘,
    ‘http://127.0.0.1:5000/jay‘,
    ‘http://127.0.0.1:5000/tom‘,
    ‘http://127.0.0.1:5000/bobo‘,
    ‘http://127.0.0.1:5000/jay‘,
    ‘http://127.0.0.1:5000/tom‘
]
tasks = []
loop = asyncio.get_event_loop()
for url in urls:
    c = get_page(url)
    task = asyncio.ensure_future(c)
    tasks.append(task)
loop.run_until_complete(asyncio.wait(tasks))
print(‘总耗时：‘,time.time()-start)

爬虫----异步---高性能爬虫----aiohttp 和asycio 的使用

原文：https://www.cnblogs.com/baili-luoyun/p/10923183.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)