多線程網(wǎng)站圖片

大小: 1.99KB

文件類型: .py

金幣: 1

下載: 0 次

發(fā)布日期: 2021-01-30
語言: Python
標(biāo)簽: 多線程??線程??下載??網(wǎng)頁??

高速下載

資源簡介

多線程下載網(wǎng)站圖片

資源截圖

小圖大圖

代碼片段和文件信息

#python2.7?打造多線程網(wǎng)絡(luò)爬蟲
#庫:threading??安裝lxmlrequestsbs4（beautifulsoup?4）
#?-*-?coding:utf-8?-*-
import?requeststhreading#多線程
from?lxml?import?etree#解析方式
from?bs4?import?BeautifulSoup
#獲取源碼
def?get_html（url）:
	#url=‘https://www.doutula.com/article/list/?page=1‘
	headers={‘User-Agent‘:‘Mozilla/5.0?（Windows?NT?6.1;?WOW64）?AppleWebKit/537.36?（KHTML?like?Gecko）?Chrome/53.0.2785.104?Safari/537.36?Core/1.53.2595.400?QQBrowser/9.6.10872.400‘}
	request=requests.get（url=urlheaders=headers）
	response=request.content#獲取源碼
	#print?response
	return?response
#找到圖片的超鏈接獲取源碼
#獲取外頁??獲取超鏈接
def?get_img_html（html）:
	soup=BeautifulSoup（html‘lxml‘）#解析網(wǎng)頁方式自帶html.parser
	all_a=soup.find_all（‘a(chǎn)‘class_=‘list-group-item‘）#找到a標(biāo)簽
	for?i?in?all_a:#i是鏈接
		img_html=get_html（i[‘href‘]）#獲取超鏈接源碼
		g

上一篇：爬取網(wǎng)站信息并寫入Excel
下一篇：python 微信機器人源碼

91av视频/亚洲h视频/操亚洲美女/外国一级黄色毛片 - 国产三级三级三级三级

多線程網(wǎng)站圖片

資源簡介

資源截圖

代碼片段和文件信息

評論

相關(guān)資源