資源簡介
python爬蟲,比較基礎(chǔ),適合初學(xué)者,Main.py里的初始URL可以更換成你想要挖掘的網(wǎng)站
代碼片段和文件信息
import?urllib2
class?HtmlDownloader(object):
????def?download(self?url):
????????if?url?is?None:
????????????return?None
????????response?=?urllib2.urlopen(url)
????????if?response.getcode()?!=?200:
????????????return?None
????????return?response.read()
?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????目錄???????????0??2017-10-06?15:17??python爬蟲\
?????文件?????????272??2017-10-05?00:22??python爬蟲\html_downloader.py
?????文件?????????678??2017-10-05?00:22??python爬蟲\html_downloader.pyc
?????文件?????????784??2017-10-06?11:28??python爬蟲\html_outputer.py
?????文件????????1371??2017-10-06?11:28??python爬蟲\html_outputer.pyc
?????文件????????1235??2017-10-06?11:48??python爬蟲\html_parser.py
?????文件????????1714??2017-10-06?11:49??python爬蟲\html_parser.pyc
?????文件????????1173??2017-10-06?11:54??python爬蟲\Main.py
?????文件??????475449??2017-10-06?12:06??python爬蟲\output.html
?????文件?????????724??2017-10-04?23:52??python爬蟲\url_manager.py
?????文件????????1497??2017-10-04?23:53??python爬蟲\url_manager.pyc
評論
共有 條評論