資源簡介
Python關(guān)于豆瓣電影信息的爬蟲,抓起1w條電影數(shù)據(jù)只要一分鐘左右,
代碼片段和文件信息
import?json
from?multiprocessing?import?Pool
import?pymongo
import?requests
#?電影數(shù)據(jù)爬蟲
#?電影ID?電影title電影Genders
#?directors導(dǎo)演?rate評分?cover_x?star?title?url?casts主演?cover海報?id
headers?=?{
????“Accept“:?“text/htmlapplication/xhtml+xmlapplication/xml;q=0.9image/webpimage/apng*/*;q=0.8“
????“Accept-Encoding“:?“gzipdeflatebr“
????“Accept-Language“:?“zh-CNzh;q=0.9“
????“Cache-Control“:?“no-cache“
????“Connection“:?“keep-alive“
????#?“Cookie“:?“bid=imNup50_JnI“
????“Host“:?“movie.douban.com“
????“Pragma“:?“no-cache“
????“Upgrade-Insecure-Requests“:?“1“
????“User-Agent“:?“Mozilla/5.0?(Windows?NT?10.0;?Win64;?x64)?AppleWebKit/537.36?(KHTML?like?Gecko)?“
??????????????????“Chrome/71.0.3578.98?Safari/537.36?“
}
url?=?“https://movie.douban.com/j/new_search_subjects?sort=T&range=010&tags=&start={}“
#?聲名數(shù)據(jù)庫對象
clien
評論
共有 條評論