資源簡介
這里是coursera課程Hadoop Platform and Application Framework的所有項目源代碼,都通過測試考核,所以應該準確無誤。原課程講解Hadoop和Spark,有興趣的小伙伴們可以參考,不過建議自己編程,有助于能力的提高。原課程的講義以及軟件開發(fā)環(huán)境安裝等就不再附上,可以去官網(wǎng)免費下載,我這里只附上官網(wǎng)沒有的代碼。

代碼片段和文件信息
#!/usr/bin/env?python???
#the?above?just?indicates?to?use?python?to?intepret?this?file
#?---------------------------------------------------------------
#This?mapper?code?will?input?a?line?of?text?and?output?
#?
#?---------------------------------------------------------------
import?sys?????????????#a?python?module?with?system?functions?for?this?OS
#?------------------------------------------------------------
#??this?‘for?loop‘?will?set?‘line‘?to?an?input?line?from?system?
#????standard?input?file
#?------------------------------------------------------------
for?line?in?sys.stdin:??
#-----------------------------------
#sys.stdin?call?‘sys‘?to?read?a?line?from?standard?input?
#?note?that?‘line‘?is?a?string?object?ie?variable?and?it?has?methods?that?you?can?apply?to?it
#?as?in?the?next?line
#?---------------------------------
????line?=?line.strip()??#strip?is?a?method?ie?function?associated
?????????????????????????#??with?string?variable?it?will?strip?
?????????????????????????#???the?carriage?return?(by?default)
????keys?=?line.split()??#split?line?at?blanks?(by?default)?
?????????????????????????#???and?return?a?list?of?keys
????for?key?in?keys:?????#a?for?loop?through?the?list?of?keys
????????value?=?1????????
????????print(‘{0}\t{1}‘.format(key?value)?)?#the?{}?is?replaced?by?0th1st?items?in?format?list
????????????????????????????#also?note?that?the?Hadoop?default?is?‘tab‘?separates?key?from?the?value
?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????目錄???????????0??2015-12-30?14:30??code\
?????目錄???????????0??2015-12-30?03:15??code\041?assignment?wordcount\
?????文件????????8080??2015-12-29?21:40??code\041?assignment?wordcount\041?readme.txt
?????文件????????3749??2015-12-29?22:12??code\041?assignment?wordcount\A?note?on?debugging?mapreduce?programs.txt
?????文件??????????41??2015-12-29?21:22??code\041?assignment?wordcount\testfile1
?????文件??????????29??2015-12-29?21:22??code\041?assignment?wordcount\testfile2
?????文件????????1460??2015-12-29?21:17??code\041?assignment?wordcount\wordcount_mapper.py
?????文件?????????100??2015-12-29?21:38??code\041?assignment?wordcount\wordcount_num0_output.txt
?????文件????????1963??2015-12-29?21:20??code\041?assignment?wordcount\wordcount_reducer.py
?????目錄???????????0??2015-12-30?03:15??code\042?assignment?join1\
?????文件????????1363??2015-12-30?02:16??code\042?assignment?join1\042?readme.txt
?????文件??????????37??2015-12-29?23:08??code\042?assignment?join1\join1_FileA.txt
?????文件?????????122??2015-12-29?23:08??code\042?assignment?join1\join1_FileB.txt
?????文件????????1580??2015-12-29?22:36??code\042?assignment?join1\join1_mapper.py
?????文件????????3953??2015-12-29?22:39??code\042?assignment?join1\join1_reducer.py
?????目錄???????????0??2015-12-30?03:16??code\043?assignment?join2\
?????文件????????1368??2015-12-30?02:16??code\043?assignment?join2\043?readme.txt
?????文件????????1714??2015-12-29?23:22??code\043?assignment?join2\join2_genchanA.txt
?????文件????????3430??2015-12-29?23:22??code\043?assignment?join2\join2_genchanB.txt
?????文件????????5152??2015-12-29?23:22??code\043?assignment?join2\join2_genchanC.txt
?????文件???????17114??2015-12-29?23:22??code\043?assignment?join2\join2_gennumA.txt
?????文件???????34245??2015-12-29?23:22??code\043?assignment?join2\join2_gennumB.txt
?????文件???????51400??2015-12-29?23:22??code\043?assignment?join2\join2_gennumC.txt
?????文件????????1216??2015-12-30?01:11??code\043?assignment?join2\join2_mapper.py
?????文件?????????370??2015-12-30?03:02??code\043?assignment?join2\join2_output.txt
?????文件????????1017??2015-12-30?02:59??code\043?assignment?join2\join2_reducer.py
?????文件?????????333??2015-12-29?23:20??code\043?assignment?join2\make_data_join2.txt
?????文件????????1188??2015-12-29?23:22??code\043?assignment?join2\make_join2data.py
?????文件???????14739??2015-12-30?01:39??code\043?assignment?join2\original?command?aide.txt
?????目錄???????????0??2015-12-30?04:31??code\051?assignment?spark\
?????文件?????????513??2015-12-30?03:50??code\051?assignment?spark\051?Install?IPython.txt
............此處省略5個文件信息
評論
共有 條評論