• 欢迎光临~

实验二:解析4k图片下载

开发技术 开发技术 2022-11-16 次浏览
import requests
from lxml import html
url = 'https://pic.netbian.com/4kyingshi/'
#UA伪装: 让爬虫对应的请求载体标识伪装成一款浏览器,将对应的User-Agent封装到字典
headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/106.0.0.0 Safari/537.36'
}
response = requests.get(url,headers=headers)
response.encoding = 'gbk'
page_text = response.text
# print(page_text)
etree = html.etree
tree = etree.HTML(page_text)
li_list = tree.xpath('//div[@class="slist"]/ul/li')
# print(li_list)
for li in li_list:
    img = 'https://pic.netbian.com' + li.xpath('./a/img/@src')[0]
    print(img)
    img_name = li.xpath('./a/b/text()')[0] + '.jpg'
    print(img_name)
    #爬取每张图片,是二进制的数据,用content取
    img_data = requests.get(img).content
    #保存图片
    with open('./pic/'+img_name,'wb') as fp:
        fp.write(img_data)
        print('下载成功')

需要提前创建一个跟代码同级的文件夹,名字为pic

程序员灯塔
转载请注明原文链接:实验二:解析4k图片下载
喜欢 (0)
违法和不良信息举报电话:022-22558618 举报邮箱:dljd@tidljd.com