找回密码
 立即注册
搜索
查看: 6975|回复: 5

[技术探讨] 肺炎疫情状态python爬虫

[复制链接]

9

主题

243

回帖

4530

积分

元老级技术达人

积分
4530
金钱
4263
HASS币
100
发表于 2020-1-28 09:10:46 | 显示全部楼层
re正则,给你参考一下吧
import re
import json
import requests

import time


def test():
    url = 'http://3g.dxy.cn/newh5/view/pneumonia'
    headers = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Safari/537.36'}
    try:
        response = requests.get(url,headers=headers)
        response.encoding = "utf-8"
        resp = response.text
        patt = re.compile(r'getAreaStat = (.*?)\}catch')
        getAreaStat = patt.findall(resp)
        getAreaStat = getAreaStat[0]
        getAreaStat_json = json.loads(getAreaStat)
        return getAreaStat_json

    except Exception as e:
        print("获取出错!!!", e)

if __name__ == '__main__':
    time_start = time.time()
    test_json = test()
    print(test_json[0])
    time_end = time.time()
    timecost = time_end-time_start
    print("耗时:", "%.2f" % timecost, "秒")
E5-2650L-V2+华南X79+16GB*2
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

Archiver|手机版|小黑屋|Hassbian ( 晋ICP备17001384号-1 )

GMT+8, 2025-7-1 13:33 , Processed in 0.073250 second(s), 12 queries , MemCached On.

Powered by Discuz! X3.5

© 2001-2025 Discuz! Team.

快速回复 返回顶部 返回列表