python 爬虫大众点评美食排名

程序员文章站 2022-05-02 22:15:05

...

import requests
from bs4 import BeautifulSoup
import re

def getHTMLText(url):
    try:
        r = requests.get(url)
        r.raise_for_status()
        r.encoding = r.apparent_encoding
        return r.text
    except:
        return ""

def getStockList(lst, stockURL,city_lst,infodict):
    html = getHTMLText(stockURL)
    soup = BeautifulSoup(html, 'html.parser') 
    a = soup.find_all("h4")
    b = soup.find_all("a", href = re.compile("http://www.dianping.com/search/category/33/0/r\d{4}"))
    print (a,b)
    for i in a[2:len(a)-2]:

        name = i.text

        lst.append(name)
        print (name)


    count=0
    for j in b:

        try:
            address = j.text.split()[0]+j.text.split()[1]
            city_lst.append(address)
            infodict[lst[count]]=address
            print (address)

            count+=1
        except:
            count+=1
            continue



def main():
    stock_list_url = 'http://www.dianping.com/search/category/33/10/r3300'

    output_file = 'E:/dzdpmspm.txt'
    slist=[]
    clist=[]
    infoDict={}
    getStockList(slist, stock_list_url,clist,infoDict)
    for n in range(2,51):
        stock_list_url="http://www.dianping.com/search/category/33/10/r3300p"+str(n)+"?aid=91959818%2C93071129"
        getStockList(slist, stock_list_url,clist,infoDict)
        with open(output_file, 'a', encoding='utf-8') as f:
                f.write(str(infoDict.items()) + '\n' )

                print("\r当前速度:{:.2f}%".format(n*100/50),end="")
main()

上一篇：监听窗口大小改变，同时根据窗口大小修改某个元素的大小

下一篇：从头学习爬虫（二十四）重构篇----WebMagic框架分析之scheduler

python 爬虫大众点评美食排名

Python爬虫抓取豆瓣算法类书籍综合排名导出为XLS文件

python爬虫实列（爬取大众点评评论）

Python数据分析：大众点评数据进行选址

CSDN 2020 博客之星实时数据排名（Python 爬虫 + PyEcharts）

c++爬虫大众点评数据

Python之爬虫-中国大学排名

【Python3爬虫】大众点评爬虫（破解CSS反爬）

python爬虫（中国大学排名定向爬虫--解析）

十大吃货必备软件推荐大众点评排行第一，随便走排名第二

python爬虫之通过pyquery爬取大众点评评论信息

python 爬虫 大众点评美食排名

Python爬虫抓取豆瓣算法类书籍综合排名导出为XLS文件

python爬虫实列（爬取大众点评评论）

Python数据分析：大众点评数据进行选址

CSDN 2020 博客之星实时数据排名（Python 爬虫 + PyEcharts）

c++爬虫大众点评数据

Python之爬虫-中国大学排名

【Python3爬虫】大众点评爬虫（破解CSS反爬）

python爬虫（中国大学排名定向爬虫--解析）

十大吃货必备软件推荐 大众点评排行第一，随便走排名第二

python爬虫之通过pyquery爬取大众点评评论信息

python 爬虫大众点评美食排名

十大吃货必备软件推荐大众点评排行第一，随便走排名第二