91av视频/亚洲h视频/操亚洲美女/外国一级黄色毛片 - 国产三级三级三级三级

資源簡介

matlab編寫的文本分類的程序,可以對已經分好詞的文本進行分類,先自己導入數據,用libsvm中的svm進行分類和預測,特征用tfidf算法,還利用卡方檢驗進行了特征選擇,可自行設定閾值。

資源截圖

代碼片段和文件信息

%----wordtrain為每條視頻分詞后的標題
%----每次運行時,要導入wordtrain.txt文本
text=textread(‘wordtrain.txt‘‘%s‘);???????????%提取文本中的單詞
stopword=textread(‘stopwordchinese.txt‘‘%s‘);???%提取stopword中的單詞
[a]=worddictionary(textstopword);%a(13).word為所有出現過的并除去信用詞詞條
[counttfidfweight]=tfidf(wordtrain‘a(13).word);
model=svmtrain(wordtrain_labelweight‘-c?1?-g?0.07‘);

?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????文件?????????281??2013-10-24?00:10??text?classification\datatest.txt
?????文件?????????337??2013-10-24?00:15??text?classification\datatrain.txt
?????文件?????????404??2013-10-24?17:08??text?classification\extractwords.m
?????文件??????????70??2013-10-23?22:46??text?classification\inputchinese1.txt
?????文件????????9904??2013-10-23?20:52??text?classification\porterStemmer.m
?????文件????????6364??2009-11-23?15:29??text?classification\stopwordchinese.txt
?????文件?????????239??2013-10-23?23:57??text?classification\test.mat
?????文件????????2723??2013-10-23?17:21??text?classification\tfidf.m
?????文件????????1547??2013-10-23?22:55??text?classification\worddictionary.m
?????文件?????????525??2013-10-24?21:08??text?classification\wordpredict.asv
?????文件?????????525??2013-10-24?09:20??text?classification\wordpredict.m
?????文件?????????358??2013-10-24?00:19??text?classification\wordtest.txt
?????文件?????????441??2013-10-24?00:22??text?classification\wordtrain.txt
?????文件?????????195??2013-10-24?08:51??text?classification\wordtrain_label.mat
?????文件???????26967??2013-10-24?16:55??text?classification\文本特征詞提取步驟.docx
?????文件?????????717??2013-10-23?23:30??text?classification\新建?文本文檔.txt

評論

共有 條評論