python爬虫：由B站UID获得用户投稿视频封面

0.前言

博主可能要阳了，现在感觉有点头晕，可能有点发热。

本来是想学一下Java的网络库的，就随便想了一个案例，然后Java爬虫有点烦，就打算先用python试试水。除了踩点分析花了点时间，代码还是很简单的。这Java版本要是写完了我会另写一篇新的传上来，要是这几天不更新就说明鼠鼠我阳了TAT。

1.全部代码

这么简单就不写注释了，应该也能看懂。

				
					import requests
import os
import re


class BiliCoverSpider:
    def __init__(self, id):
        self.id = id
        self.page = 1
        self.user_url_format = 'https://api.bilibili.com/x/space/wbi/arc/search?mid={ID}&ps=30&tid=0&pn={PAGE}&keyword=&order=pubdate&order_avoided=true&w_rid=64a17313d0ab4fe3a74503517fe017b4&wts=1671862074'
        self.data = []

    def get_url(self, url):
        headers = {
            'user-agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.198 Safari/537.36'
        }
        res = requests.get(url, headers=headers)
        return res

    def hundle_name(self, name):
        name = name.replace(":", '')
        name = name.replace("?", '')
        name = name.replace("/", '')
        name = name.replace("\\", '')
        name = name.replace("|", '')
        name = name.replace("<", '')
        name = name.replace(">", '')
        name = name.replace("*", '')
        name = name.replace('"', '')
        return name

    def saveImageWithBv(self, url):
        html = self.get_url(url)
        r = re.search(r'//i\d.hdslb.com/bfs/archive/\w+.jpg', html.text)
        imgurl = "https:" + html.text[r.span()[0]:r.span()[1]]
        img = self.get_url(imgurl)
        with open('1.jpg', 'wb') as f:
            f.write(img.content)

    def saveImage(self, url, title):
        img = self.get_url(url)
        with open(self.hundle_name(title) + '.jpg', 'wb') as f:
            f.write(img.content)

    def renewData(self):
        user_url = self.user_url_format.format(ID=self.id, PAGE=self.page)
        html = self.get_url(user_url)
        self.data = html.json()['data']['list']['vlist']

    def getAllCover(self):
        self.renewData()
        if len(self.data) == 0:
            return
        try:
            os.mkdir(self.data[0]['author'])
            os.chdir(self.data[0]['author'])
        except:
            os.chdir(self.data[0]['author'])
        while (True):
            if len(self.data) == 0:
                return
            for i in self.data:
                print(i['pic'], i['title'])
                self.saveImage(i['pic'], i['title'])
            self.page += 1
            self.renewData()

if __name__ == '__main__':
    id = int(input("输入UID："))
    spy = BiliCoverSpider(id)
    spy.getAllCover()

2.效果演示

以用户昵称创建一个文件夹，将视频封面都存在里面。

3.碎碎念

没想到快到年末了，居然顶不住了，还以为我能躲过一劫，没想到啊没想到。

要是没有阳回头会在这篇后面更新一下。阳了就开一篇我与新冠的斗争。

订阅评论

1 评论

最旧

柒氿

python爬虫：由B站UID获得用户投稿视频封面

0.前言

1.全部代码

2.效果演示

3.碎碎念

《python爬虫：由B站UID获得用户投稿视频封面》