資源簡介
使用Python編寫的爬取百度百科詞條信息的Demo源代碼,具體看博客:http://blog.csdn.net/tianmaxingkong_/article/details/52959784
代碼片段和文件信息
#?coding=utf-8
import?urllib2
class?HtmlDownloader(object):
????def?download(self?url):
????????if?url?is?None:
????????????return
????????response?=?urllib2.urlopen(url)
????????if?response.getcode()?!=?200:
????????????return?None
????????return?response.read().decode(‘utf-8‘)
?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????目錄???????????0??2016-10-28?10:27??test_spider\
?????文件????????2297??2016-10-28?10:20??test_spider\spider_main.py
?????文件?????????822??2016-10-28?10:27??test_spider\html_outputer.py
?????文件????????1831??2016-10-28?10:14??test_spider\html_parser.pyc
?????文件?????????783??2016-10-28?10:14??test_spider\html_downloader.pyc
?????文件????????1618??2016-10-28?10:14??test_spider\url_manager.pyc
?????文件???????29684??2016-10-28?10:27??test_spider\output.html
?????文件????????1598??2016-10-28?10:27??test_spider\html_outputer.pyc
?????文件?????????285??2016-10-28?09:42??test_spider\html_downloader.py
?????文件?????????656??2016-10-28?09:39??test_spider\url_manager.py
?????文件????????1685??2016-10-28?10:06??test_spider\html_parser.py
?????文件???????????0??2016-10-28?08:19??test_spider\__init__.py
- 上一篇:tcpudp;端口掃描器
- 下一篇:攜程機票python爬取腳本優(yōu)化版本
評論
共有 條評論