搜狗音乐爬虫下载python

时间：2018-06-26 22:20:46 阅读：230 评论：0 收藏：0 [点我收藏+]

import requests
import re

session = requests.Session()
r = session.get(‘http://www.kugou.com/yy/rank/home/1-8888.html?from=homepage‘)
html = r.text
pattern = r‘<a href="(.+?)" data-active="playDwn" data-index="\d+" class="pc_temp_songname" title="(.+?)" hidefocus="true">.+?</a>‘
m = re.findall(pattern, html)
if m:
    for line in m:
        # print line
        mp3name = line[1]
        r = session.get(line[0])
        html = r.text
        m = re.search(r‘\[\{"hash":"(.+?)".+"album_id":(\d*)\}\]‘, html)
        if m:
            hash,album_id = m.group(1),m.group(2)
            url = ‘http://www.kugou.com/yy/index.php?r=play/getdata&hash=%s&album_id=%s&_=1508983920130‘ % (hash, album_id)
            print(url)
            r = session.get(url)
            d = r.json()
            if d["status"] == 1:
                mp3url = d["data"]["play_url"]
                r = session.get(mp3url, stream=True)
                with open(r‘d:\mp3\%s.mp3‘ % mp3name, "wb") as f:
                        for chunk in r.iter_content(chunk_size=512):
                            if chunk:
                                f.write(chunk)

搜狗音乐爬虫下载python

原文：https://www.cnblogs.com/chif/p/9231433.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)