資源簡介
抓取國家統計局區劃、城鄉劃分代碼的簡易python爬蟲源碼與爬取結果sql文件,本資源僅用于學習交流,不針對任何網站、軟件、個人。
代碼片段和文件信息
#?!/usr/bin/env?python3
#?-*-?coding:?utf-8?-*-
import?urllib.request
import?urllib.error
import?time
class?HtmlDownloader(object):
????def?download(self?url):
????????if?url?is?None:
????????????raise?Exception(‘url?is?None‘)
????????print(url)
????????request?=?urllib.request.Request(url?None?{‘Cookie‘:?‘AD_RS_COOKIE=20083363‘
?????????????????????????????????????????????????????‘User-Agent‘:?‘Mozilla/5.0?(Windows?NT?10.0;?Win64;?x64)?\
?????????????????????????????????????????????????????AppleWeb\Kit/537.36?(KHTML?like?Gecko)\
??????????????????????????????????????????????????????Chrome/58.0.3029.110?Safari/537.36‘})
????????try:
????????????with?urllib.request.urlopen(request)?as?response:
????????????????print(response.getcode())
????????????????if?response.g
?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????目錄???????????0??2017-05-26?16:34??CodeSpider\
?????目錄???????????0??2017-05-26?16:35??CodeSpider\.idea\
?????文件??????????10??2017-05-25?19:26??CodeSpider\.idea\.name
?????文件?????????398??2017-05-25?19:26??CodeSpider\.idea\CodeSpider.iml
?????文件?????????159??2017-05-25?19:26??CodeSpider\.idea\encodings.xm
?????文件?????????689??2017-05-25?19:26??CodeSpider\.idea\misc.xm
?????文件?????????272??2017-05-25?19:26??CodeSpider\.idea\modules.xm
?????文件???????28862??2017-05-26?16:35??CodeSpider\.idea\workspace.xm
?????文件????????1100??2017-05-26?14:30??CodeSpider\html_downloader.py
?????文件????????2702??2017-05-26?15:23??CodeSpider\html_parser.py
?????文件????????1356??2017-05-26?16:34??CodeSpider\mysql_handler.py
?????文件????????2911??2017-05-26?14:24??CodeSpider\spider_main.py
?????文件???????????0??2017-05-25?19:10??CodeSpider\__init__.py
?????目錄???????????0??2017-05-26?15:23??CodeSpider\__pycache__\
?????文件????????1210??2017-05-26?14:31??CodeSpider\__pycache__\html_downloader.cpython-35.pyc
?????文件????????3173??2017-05-26?15:23??CodeSpider\__pycache__\html_parser.cpython-35.pyc
?????文件????????1609??2017-05-26?11:29??CodeSpider\__pycache__\mysql_handler.cpython-35.pyc
?????文件?????3598221??2017-05-26?16:10??code_spider.sql
評論
共有 條評論