Python爬蟲，爬取136書屋的小說beautifulsoup4.py

大小: 2KB

文件類型: .py

金幣: 1

下載: 0 次

發布日期: 2021-01-04
語言: Python
標簽: bs??

高速下載

資源簡介

Python爬蟲，爬取136書屋的小說beautifulsoup4.py 使用beautifulsoup4包進行html和xml的解析，使用urllib打開和操作網址使用前請先安裝beautifulsoup4和urllib包，本示例使用的是Python2.7

資源截圖

小圖大圖

代碼片段和文件信息

#coding=utf-8

from?urllib?import?URLopener
from?bs4?import?BeautifulSoup?as?BS
import?os
import?sys

if?__name__?==?‘__main__‘:
????Bfolder?=?r“D:\LILUO\6.MyTools\12.beautifulsoup4\books“
????
????url?=?“http://www.136book.com/“
????html?=?URLopener（）.open（url）
????soup?=?BS（html.read（）?“html.parser“）
????
????a?=?soup.find_all（name=‘a‘）
????BookDict?=?{}
????for?each?in?a:
????????if?“http://www.136book.com/“?in?each.get（‘href‘）:
????????????if?each.get（‘title‘）:
????????????????BookDict[each.get（‘href‘）]?=?each.get（‘title‘）
????html.close（）

????for?burl?in?BookDict:
????????#burl?=?“http://www.136book.com/zetianji/“
????????bhtml?=?URLopener（）.open（burl）
????????bsoup?=?BS（bhtml.read（）?“html.parser“）
????????ba?=?bsoup.find_all（name=‘a‘）
????????path?=?Bfol

上一篇：openmv識別特定顏色且打印坐標到串口
下一篇：Python 凸包算法

91av视频/亚洲h视频/操亚洲美女/外国一级黄色毛片 - 国产三级三级三级三级

Python爬蟲，爬取136書屋的小說beautifulsoup4.py

資源簡介

資源截圖

代碼片段和文件信息

評論

相關資源