文件编码解读

时间：2017-05-08 21:35:25 阅读：265 评论：0 收藏：0 [点我收藏+]

 1 lines (8 sloc)  333 Bytes
 2 from urllib.request import urlopen
 3 from bs4 import BeautifulSoup
 4 
 5 html = urlopen("http://en.wikipedia.org/wiki/Python_(programming_language)")
 6 bsObj = BeautifulSoup(html, "html.parser")
 7 content = bsObj.find("div", {"id":"mw-content-text"}).get_text()
 8 content = bytes(content, "UTF-8")
 9 content = content.decode("UTF-8")
10 print(content)

1 from urllib.request import urlopen
2 
3 textPage = urlopen("http://www.pythonscraping.com/pages/warandpeace/chapter1.txt")
4 print(str(textPage.read(),‘utf-8‘))用字符串转换编码

文件编码解读

原文：http://www.cnblogs.com/caojunjie/p/6827793.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)