91av视频/亚洲h视频/操亚洲美女/外国一级黄色毛片 - 国产三级三级三级三级

<tbody id="9mne5"></tbody>

<optgroup id="9mne5"><ruby id="9mne5"><dfn id="9mne5"></dfn></ruby></optgroup>

<tbody id="9mne5"></tbody>

源碼之巔峰

python寫的小型搜索引擎

收藏(0)

大小: 203KB

文件類型: .zip

金幣: 2

下載: 1 次

發布日期: 2021-06-05
語言: Python
標簽: python??搜索引擎??網絡抓包??網頁下載??

高速下載

資源簡介

自己利用空閑時間寫的一款再dos下運行的簡單搜索引擎，可以再自己給定的網頁范圍內查找信息，并且下載指定網頁上的內容。內中包含簡單的工程文檔，代碼還算規范，所以不需要太多注釋就基本能看懂。學習python沒多久的同學可以看一下，對于學習python能夠給出一定的啟發

資源截圖

小圖大圖

代碼片段和文件信息

#-*-?coding:utf-8?-*-

import?urllib
import?os
import?re
import?HtmlToText
import?SearchEngineLog

class?SearchEngine:
	def?__init__（self）:
		self.lstSearchedItems?	=?[]
		self.lstKeywords?		=?[]
		self.strKeywords?		=?‘‘
		self.iItemsEachPage?	=?10
		self.iCurrentPage???????=?1
		self.strConfigFile?		=?‘./config.ini‘
		self.log?				=?SearchEngineLog.SearchEngineLog（）
		self.initCommand（）
		self.readConfigFile（）
		self.headWidth			=?80
		self.strCurrentSite		=?‘‘
		
	def?initCommand（self）:
		self.cmdCommand?		=?‘command‘
		self.cmdKeywords?		=?‘keywords‘
		self.cmdQuit?			=?‘q‘
		self.cmdBack?			=?‘b‘
		self.cmdNextPage?		=?‘n‘
		self.cmdPrevPage?		=?‘l‘
		self.cmdRefresh?		=?‘r‘
		self.cmdSavePage?		=?‘s‘

	def?work（self）:
		while?True:
			self.mainSurface（‘‘）
			self.useKeywordsInput（）
			
	def?mainSurface（selfinfo）:
		self.strSurface?=?‘MAIN_SURFACE‘
		os.system（‘cls‘）
		print?‘=‘?*?self.headWidth
		print?‘?‘?*?（（?self.headWidth?-?len（‘SEARCH?ENGINE‘）?）/2）?‘SEARCH?ENGINE‘
		print?‘?‘?*?（（?self.headWidth?-?len（info）?）/2）?info
		print?‘=‘?*?self.headWidth
		
	def?searchSurface（selfinfo）:
		self.strSurface?=?‘SEARCH_SURFACE‘
		os.system（‘cls‘）
		print?‘=‘?*?self.headWidth
		print?‘The?search?result?of?:??‘??self.strKeywords
		print?‘-‘?*?self.headWidth
		if?len（self.lstSearchedItems）?==?0:
			print?‘Cannot?find?“%s“!‘?%?self.strKeywords
		else?:
			iCount?=?0
			for?item?in?self.lstSearchedItems:
				if?item[0]?>?（self.iCurrentPage-1）*self.iItemsEachPage?and?item[0]?<=?self.iItemsEachPage?*?self.iCurrentPage:
					iCount?=?iCount?+?1
					print?‘%d??%s‘?%?（iCountitem[1]）
					print?‘??‘item[2]
					print?‘‘
		print?‘=‘?*?self.headWidth
		print?‘Current?page:?%d/%d‘?%?（self.iCurrentPagelen（self.lstSearchedItems）/self.iItemsEachPage?+?1）
		
	def?downWebsite（selfurlpath）:
		print?urlpath?‘is?downloading...‘
		####創建存放網頁內容的文件夾
		regex?=?r‘（.*//www.）（.*）（.com|.cn|.net）‘
		res?=?re.match（regexurlpath）
		saveFolder?=?res.group（2）
		cmd?=?‘md?‘?+?saveFolder
		os.system（cmd）
		textFilePath?=?saveFolder+?‘/‘?+res.group（2）+‘.html‘
		####下載文本網頁
		print?‘downloading?the?html?file...‘
		webContex?=?‘‘
		try:
			ul?=?urllib.urlopen（urlpath）
			webContext?=?ul.read（）
			ul.close（）
		except?Exceptionerr:
			print?‘Cannot?open?%splease?check?your?network!‘?%?urlpath
			self.log.errorLog（‘download?website?“%s“?fail‘?%?urlpath）
			exit（-1）
		try:
			file?=?open（textFilePath‘w‘）
			file.write（webContext）
			file.close（）
		except?Exceptionerr:
			print?‘Create?file?“%s“?fail!‘?%?textFilePath
			self.log.errorLog（‘create?file?“%s“?fail‘?%?textFilePath）
		####下載圖片
		print?‘downloading?pictures...‘
		regex?=?r‘（http:.+?\.png|http:.+?\.jpg|http:.+?\.jpeg|http:.+?\.gif|http:.+?\.bmp）‘
		lstPictures?=?re.findall（regexwebContext）
		for?picPath?in?lstPictures:
			regex?=?r‘（.*）（<|>|“）（.*）‘
			if?re.match（regexpicPath）:?continue
			regex?=?r‘（.*）（/.*）‘
			picName

?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????目錄???????????0??2016-12-28?17:25??SearchEngine\
?????文件???????49374??2016-12-28?17:12??SearchEngine\Capture.PNG
?????文件???????45274??2016-12-28?17:12??SearchEngine\Capture1.PNG
?????文件???????61643??2016-12-28?17:13??SearchEngine\Capture2.PNG
?????文件???????34963??2016-12-28?17:15??SearchEngine\Capture3.PNG
?????文件???????33687??2016-12-28?17:20??SearchEngine\Capture4.PNG
?????文件?????????105??2016-12-28?17:14??SearchEngine\config.ini
?????文件????????3873??2016-12-28?16:58??SearchEngine\document.txt
?????文件????????2673??2016-12-28?17:20??SearchEngine\SearchEngine.log
?????文件????????8351??2016-12-28?17:19??SearchEngine\SearchEngine.py
?????文件???????????0??2016-12-28?16:54??SearchEngine\SearchEngine.pyc
?????文件????????1757??2016-12-28?15:35??SearchEngine\SearchEngineLog.py
?????文件????????3525??2016-12-28?15:36??SearchEngine\SearchEngineLog.pyc

上一篇：自制驗證碼數據集生成程序
下一篇：python3_爬取網上資源存入數據庫中

評論

共有條評論

相關資源

二級考試python試題12套（包括選擇題和
pywin32_python3.6_64位
python+ selenium教程
PycURL（Windows7/Win32）Python2.7安裝包 P
英文原版-Scientific Computing with Python
7.圖像風格遷移基于深度學習 pyt
基于Python的學生管理系統
A Byte of Python（簡明Python教程）（第
Python實例174946
Python 人臉識別
Python 人事管理系統
基于python-flask的個人博客系統
計算機視覺應用開發流程
python 調用sftp斷點續傳文件
python socket游戲
基于Python爬蟲爬取天氣預報信息
python函數編程和講解
Python開發的個人博客
基于python的三層神經網絡模型搭建
python實現自動操作windows應用
python人臉識別（opencv）
python 繪圖（方形、線條、圓形）
python疫情卡UN管控
python 連連看小游戲源碼
基于PyQt5的視頻播放器設計
一個簡單的python爬蟲
csv文件行列轉換python實現代碼
Python操作Mysql教程手冊
Python Machine Learning Case Studies
python獲取硬件信息

<listing id="0hqam"><b id="0hqam"><rp id="0hqam"></rp></b></listing>

<tbody id="0hqam"><acronym id="0hqam"><rp id="0hqam"></rp></acronym></tbody>

<dl id="0hqam"><source id="0hqam"></source></dl>

<dl id="0hqam"><source id="0hqam"></source></dl>