python爬取妹子图片

程序员文章站 2022-05-02 23:42:15

`# -*- coding:utf-8 -*- import urllib import urllib2 from bs4 import BeautifulSoup u...

`# -*- coding:utf-8 -*-
import urllib
import urllib2
from bs4 import BeautifulSoup
url="https://www.mzitu.com/all"
request=urllib2.Request(url)
response=urllib2.urlopen(request)
start_html=response.read()
soup=BeautifulSoup(start_html,'lxml')
all_a=soup.find('p',class_='all').find_all('a')
for a in all_a:
    href=a['href']
    #print href
    html=urllib.urlopen(href)
    html_soup=BeautifulSoup(html,'lxml')
    max_span=html_soup.find_all('span')[10].string
    for page in range(1,int(max_span)+1):
        page_url=href+'/'+str(page)
        img_html=urllib.urlopen(page_url)
        img_soup=BeautifulSoup(img_html,'lxml')
        #find('p',claa_="main_img"):是在所有的p中查找class属性为main-img的p,下面的意思在已经找到的那个p中再查找img标签
        img_url=img_soup.find('p',class_='main-image').find('img')['src']    #find是只查找一个结果之后就找了，因此不是一个列表的类型，可以直接索引
        print img_url
        x=img_url[-6:-4]
        urllib.urlretrieve(img_url,"g:\\img\\%s.jpg"%x)
[python官方文档](https://python.usyiyi.cn/translate/python_278/library/index.html)

上一篇：说话的时候不认真,沉默的时候太用心

下一篇：智慧里面的珍珠

python爬取妹子图片

python爬取网页内容转换为PDF文件

Python使用Selenium爬取淘宝异步加载的数据方法

网易云歌单信息爬取及数据分析（python爬虫）

python爬取个性签名的方法

python如何爬取个性签名

Python实现爬取马云的微博功能示例

Python爬取Coursera课程资源的详细过程

Python实现爬取知乎神回复简单爬虫代码分享

Python 爬取陈都灵百度图片

Python探索之爬取电商售卖信息代码示例