爬虫，百度搜索热点排行

程序员文章站 2022-04-09 18:16:11

百度时候总能看到热搜排行，以上代码就是爬虫获取排行 ......

#!/usr/bin/env python
# -*- coding:utf-8 -*-

#爬虫，搜索热点排行
import urllib.request
import urllib
import re
import json
import xlwt
import os

#获取网站首页全部内容
cnt = 50 #只能1-50
url = 'https://zhidao.baidu.com/question/api/hotword?rn='+cnt.__str__()+'&t=1535421904906'
print(url)
user_agent = 'mozilla/5.0 (windows; u; windows nt 6.1; en-us; rv:1.9.1.6) gecko/20091201 firefox/3.5.6'
req = urllib.request.request(url, headers={'user-agent': user_agent})
response = urllib.request.urlopen(req)
content = response.read().decode('utf-8')
#print(content)

workbook = xlwt.workbook()
sheet1 = workbook.add_sheet('sheet1',cell_overwrite_ok=true)

sheet1.write(0,0,'排名')
sheet1.write(0,1,'新闻名称')
sheet1.write(0,2,'搜索人数')
sheet1.write(0,3,'变化数量')
sheet1.write(0,4,'新的新闻')
sheet1.write(0,5,'热度上升')

datalist = json.loads(content)['data']
j = 1
for data in datalist:
    print(data)
    sheet1.write(j, 0,j)
    sheet1.write(j, 1,data['keyword'])
    sheet1.write(j, 2, data['searches'])
    sheet1.write(j, 3, data['changerate'])
    isnew = data['isnew'];
    if isnew==0:
        isnew = '否'
    elif isnew==1:
        isnew = '是'
    sheet1.write(j, 4, isnew.__str__())
    trend = data['trend']
    style5 = xlwt.xfstyle()
    font = xlwt.font()
    style5.font = font
    if trend == 'fall':
        font.colour_index = 3
        trend = '下降'
    elif trend == 'rise':
        font.colour_index = 2
        trend = '上升'
    sheet1.write(j, 5, trend,style5)
    j = j + 1

#保存该excel文件,有同名文件时直接覆盖
path = 'd:\\python'
if not os.path.isdir(path):
    os.makedirs(path)
paths = path + '\\'
filename = 'test1'
workbook.save('{}{}.xls'.format(paths,filename))
print('创建excel文件完成！')

　　百度时候总能看到热搜排行，以上代码就是爬虫获取排行

爬虫，百度搜索热点排行

上一篇： python元组和字典的简单学习

下一篇： Python之父重回决策层，社区未来如何发展？

爬虫，百度搜索热点排行

百度搜索2018年度排行榜出炉

百度2019年度搜索排行榜发布

百度2018年度搜索排行榜发布

使用BeautifulSoup爬虫程序获取百度搜索结果的标题和url示例

百度、360、搜狗、神马搜索份额多少？2018中国搜索引擎排行

爬虫，百度搜索热点排行

百度2017年度搜索排行榜发布

百度2020年度搜索排行榜发布

python实现的一只从百度开始不断搜索的小爬虫

python实现的一只从百度开始不断搜索的小爬虫