我的第一个爬虫程序，爬图片

时间：2020-09-17 18:31:24 阅读：59 评论：0 收藏：0 [点我收藏+]

from typing import Dict

import os
import requests
import re
from bs4 import BeautifulSoup

# import platform

# print(platform.machine())

myheaders: Dict[str, str] = {
‘User-Agent‘: ‘Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.110 Safari/537.36‘}
url = ‘http://www.cntour.cn/‘
strtext = requests.get(url, headers=myheaders)
soup = BeautifulSoup(strtext.text, ‘lxml‘)
data = soup.select(‘#main>div>div.mtop.firstMod.clearfix>div.centerBox>ul.newsList>li>a‘)
# print(soup)
fileDir = ‘tupian‘
dataList = soup.find_all(‘img‘)

if not os.path.exists(fileDir):
print(‘create a filepath‘)
os.mkdir(os.path.join(os.getcwd(), fileDir))

fileDir = os.path.join(os.getcwd(), fileDir)
j = 1
for b in dataList:
picName = str(j) + ‘.jpg‘
picUrl = url + b.get(‘src‘)
# print(b.get(‘src‘),b.get(‘alt‘))

# print(os.path.join(os.getcwd(), filedir))
fPath = os.path.join(fileDir, picName)
print(fPath)
result = requests.get(url=picUrl)
with open(fPath, ‘wb+‘)as f: # 循环写入图片
f.write(result.content)
j = j + 1

print(‘success‘)

我的第一个爬虫程序，爬图片

原文：https://www.cnblogs.com/my85016629/p/13686326.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)