資源簡(jiǎn)介
requests庫(kù)和beautifulsoup庫(kù)爬取國(guó)內(nèi)大學(xué)排名
代碼片段和文件信息
import?requests
from?bs4?import?BeautifulSoup
import?bs4
def?getHTMLText(url):
????try:
????????r?=?requests.get(url?timeout?=?30)
????????r.raise_for_status()
????????r.encoding?=?r.apparent_encoding
????????return?r.text
????except:
????????return?““
def?fillUnivList(ulist?html):
????soup??=?BeautifulSoup(html?“html.parser“)
????for?tr?in?soup.find(‘tbody‘).children:
????????if?isinstance(tr?bs4.element.Tag):
????????????tds?=?tr(‘td‘)
????????????ulist.append([t
評(píng)論
共有 條評(píng)論