首页 > 其他 > 详细

看妹子,无聊

时间:2020-02-28 12:55:13      阅读:89      评论:0      收藏:0      [点我收藏+]

使用selenium观看,使用requests保存,边看边存,使用selenium比较慢,看一张停一秒,但是不会被网站封ip

代码

import os
from selenium import webdriver
from time import sleep
import requests

url = "https://www.mzitu.com/192254"

driver = webdriver.Chrome()
driver.get(url)
driver.implicitly_wait(3)
driver.maximize_window()
sleep(3)
file_num = 1
num = 1
while True:
    try:
        link = driver.find_element_by_xpath(/html/body/div[2]/div[1]/div[3]/p/a/img).get_attribute("src")
        print(link)

        title = driver.find_element_by_xpath(/html/body/div[2]/div[1]/div[3]/p/a/img).get_attribute("alt")
        print(title)
        headers = {
                    "User-Agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.110 Safari/537.36",
                    "Referer": "https://www.mzitu.com/xinggan/"
                }
        res = requests.get(link,headers=headers).content
        file_path = rD:\jar\pic12p{}.format(file_num)
        if num % 300 == 0:
            file_num += 1
        if not os.path.exists(file_path):
            os.mkdir(file_path)
        jpg_path = os.path.join(file_path, {}.jpg.format(num))
        try:
            with open(jpg_path, wb) as f:
                f.write(res)
        except:
            pass
        num += 1
        sleep(1)
        driver.find_element_by_xpath(/html/body/div[2]/div[1]/div[3]/p/a/img).click()
    except:
        pass

sleep(3)
driver.quit()

 

看妹子,无聊

原文:https://www.cnblogs.com/sleep10000years/p/12376553.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!