【Pyton】【小甲鱼】爬虫

时间：2017-04-05 20:07:11 阅读：283 评论：0 收藏：0 [点我收藏+]

一、什么是爬虫？

可以理解为一只蜘蛛，在不同的网页上爬来爬去，获取我们需要的资源

二、Python如何访问互联网

urllib（一个包）=url（网页地址）+lib（）

技术分享

第一部分：protocol：//

第二部分：网址

第三部分：具体资源目录

三、一个例子爬出网页中的前端代码

1 #爬出网页中的内容
2 >>> import urllib.request
3 >>> response=urllib.request.urlopen("http://www.fishc.com")
4 >>> html=response.read()
5 >>> print(html)
6 #打印粗来的是二进制的一堆代码，那么如果想打印出同网页一样的规范代码，那么就需要解码。下面一行代码就可以了。
7 >>> html=html.decode(‘utf-8‘)
8 >>> print(html)

【Pyton】【小甲鱼】爬虫

原文：http://www.cnblogs.com/zhuzhubaoya/p/6670250.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)