python 爬虫爬取大众点评11月之星

程序员文章站 2022-05-02 22:02:53

...

import requests
from bs4 import BeautifulSoup
import re

def getHTMLText(url):
    try:
        r = requests.get(url)
        r.raise_for_status()
        r.encoding = r.apparent_encoding
        return r.text
    except:
        return ""

def getStockList(lst, stockURL,city_lst):
    html = getHTMLText(stockURL)
    soup = BeautifulSoup(html, 'html.parser') 
    a = soup.find_all(href=re.compile("/memberlist/star/1711/\d*?"))
    for i in a:
        try:
            href = i.attrs['href']
            city = i.text          
            lst.append(href)
            city_lst.append(city)
        except:
            continue

def getStockInfo(lst, fpath,cst):
    count=0
    for stock in lst:
        url = "http://www.dianping.com" + stock  
        html = getHTMLText(url)

        try:
            if html=="":
                continue
            infoDict = {}
            soup = BeautifulSoup(html, 'html.parser')
            stockInfo = soup.find_all('h4')
            for n in stockInfo:

                name = n.find('a').text
                number = int(eval((n.find('span').text)))
                infoDict[name]=number


            with open(fpath, 'a', encoding='utf-8') as f:
                f.write(str(cst[count])+":"+str(infoDict.items()) + '\n' )
                count = count + 1
                print("\r当前速度:{:.2f}%".format(count*100/len(lst)),end="")
        except:
            count = count + 1
            print("\r当前速度:{:.2f}%".format(count*100/len(lst)),end="")
            continue
    print len(lst)

def main():
    stock_list_url = 'http://www.dianping.com/memberlist/star/1711/2'

    output_file = 'E:/Dazhongdianping.txt'
    slist=[]
    clist=[]
    getStockList(slist, stock_list_url,clist)

    getStockInfo(slist, output_file,clist)
main()

相关标签： python 爬虫大众点评

上一篇：爬取深圳证券交易所官网中“万科”相关公告的标题和网址，批量下载深圳证券交易所官网中“万科”相关公告的PDF文件。

下一篇： Python网络爬虫爬取新浪新闻

python 爬虫爬取大众点评11月之星

python爬虫实列（爬取大众点评评论）

【Python3爬虫】大众点评爬虫（破解CSS反爬）

python爬虫之通过pyquery爬取大众点评评论信息

Python 爬取大众点评店铺评论

python 爬虫爬取大众点评11月之星

python爬虫爬取大众点评中所有行政区内的商户将获取信息存于excle中

python爬大众点评评论（爬虫），scrapy爬虫

python爬虫——按城市及店铺面爬取大众点评分类

python爬虫实列（爬取大众点评评论）

【Python3爬虫】大众点评爬虫（破解CSS反爬）

python 爬虫 爬取大众点评11月之星

python爬虫实列（爬取大众点评评论）

【Python3爬虫】大众点评爬虫（破解CSS反爬）

python爬虫之通过pyquery爬取大众点评评论信息

Python 爬取大众点评店铺评论

python 爬虫 爬取大众点评11月之星

python爬虫 爬取大众点评中所有行政区内的商户 将获取信息存于excle中

python爬大众点评评论（爬虫），scrapy爬虫

python爬虫——按城市及店铺面爬取大众点评分类

python爬虫实列（爬取大众点评评论）

【Python3爬虫】大众点评爬虫（破解CSS反爬）

python 爬虫爬取大众点评11月之星

python 爬虫爬取大众点评11月之星

python爬虫爬取大众点评中所有行政区内的商户将获取信息存于excle中