爬取搜狗热门游戏榜单

时间：2020-03-21 16:55:20 阅读：56 评论：0 收藏：0 [点我收藏+]

1.打开网站：http://top.sogou.com/game/quanbu_1.html（搜狗热门游戏榜单）：

技术分享图片

2.打开网页源代码，爬取需要内容：技术分享图片

3.导入相应数据库，利用代码获取信息。

import requests
from bs4 import BeautifulSoup
import pandas as pd
url = ‘http://top.sogou.com/game/quanbu_1.html‘
headers = {‘User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.102 Safari/537.36 Edge/18.18362‘}#伪装爬虫
r=requests.get(url)
r.encoding=r.apparent_encoding
t=r.text
soup=BeautifulSoup(t,‘lxml‘)
title=[]
index=[]
for m in soup.find_all(class_="pub-list renwu"):
title.append(m.get_text().strip())
for n in soup.find_all(class_="num"):
heat.append(n.get_text().strip())
data=[title,index]
print(data)
s=pd.DataFrame(data,index=["标题","搜索指数"])
print(s.T)

4.获得数据

技术分享图片

爬取搜狗热门游戏榜单

原文：https://www.cnblogs.com/somde/p/12540041.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)