資源簡介
利用knn算法實現(xiàn)了一個中文網(wǎng)頁自動分類器, 包括網(wǎng)頁預(yù)處理,ictclas中文分詞,基于tf-idf的文本特征表示,基于df的特征選取和基于knn的分類算法,最后通過struts2框架web發(fā)布

代碼片段和文件信息
??/*
??WVTool?-?Word?Vector?Tool
??Copyright?(C)?2001-2007
????Michael?Wurst???????
??web:???http://wvtool.sourceforge.net
??This?program?is?free?software;?you?can?redistribute?it?and/or
??modify?it?under?the?terms?of?the?GNU?General?Public?License?as?
??published?by?the?Free?Software?Foundation;?either?version?2?of?the
??License?or?(at?your?option)?any?later?version.?
??This?program?is?distributed?in?the?hope?that?it?will?be?useful?but
??WITHOUT?ANY?WARRANTY;?without?even?the?implied?warranty?of
??MERCHANTABILITY?or?FITNESS?FOR?A?PARTICULAR?PURPOSE.?See?the?GNU
??General?Public?License?for?more?details.
??You?should?have?received?a?copy?of?the?GNU?General?Public?License
??along?with?this?program;?if?not?write?to?the?Free?Software
??Foundation?Inc.?59?Temple?Place?Suite?330?Boston?MA?02111-1307
??USA.
*/
package?edu.udo.cs.wvtool.config;
import?edu.udo.cs.wvtool.util.WVToolException;
/**
?*?Exception?that?is?thrown?if?a?configuration?problem?is?encountered.
?*?
?*?@author?Michael?Wurst
?*?@version?$Id$
?*?
?*/
public?class?WVTConfigException?extends?WVToolException?{
????//?This?is?just?a?dummy?class
}
?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????文件????????105??2007-10-05?20:20??src\alternate.properties
?????文件????????236??2007-10-30?11:39??src\edu\udo\cs\wvtool\config\package.html
?????文件???????1177??2007-10-30?11:39??src\edu\udo\cs\wvtool\config\WVTConfigException.java
?????文件???????6618??2007-10-30?11:39??src\edu\udo\cs\wvtool\config\WVTConfiguration.java
?????文件???????2761??2007-10-30?11:39??src\edu\udo\cs\wvtool\config\WVTConfigurationFact.java
?????文件???????1736??2007-10-30?11:39??src\edu\udo\cs\wvtool\config\WVTConfigurationRule.java
?????文件???????2551??2007-10-30?11:39??src\edu\udo\cs\wvtool\crawler\CrawledInputList.java
?????文件????????246??2007-10-30?11:39??src\edu\udo\cs\wvtool\crawler\package.html
?????文件???????2144??2007-10-30?11:39??src\edu\udo\cs\wvtool\crawler\WVToolCrawler.java
?????文件???????9406??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\GermanStemmer.java
?????文件??????31879??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\LovinsStemmer.java
?????文件????????285??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\package.html
?????文件??????16726??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\Stemmer.java
?????文件??????21557??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\Stopwords.java
?????文件???????1943??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\StopwordsCzech.java
?????文件???????3099??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\StopwordsFrench.java
?????文件???????5284??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\StopWordsGerman.java
?????文件??????16530??2007-10-30?11:39??src\edu\udo\cs\wvtool\external\xm
?????文件???????1355??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\charmapper\DummyCharConverter.java
?????文件????????225??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\charmapper\package.html
?????文件???????1777??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\charmapper\WVTCharConverter.java
?????文件????????225??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\inputfilter\package.html
?????文件???????1965??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\inputfilter\PDFInputFilter.java
?????文件???????2879??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\inputfilter\SelectingInputFilter.java
?????文件???????4470??2007-11-08?15:05??src\edu\udo\cs\wvtool\generic\inputfilter\SimpleTagIgnoringReader.java
?????文件???????3244??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\inputfilter\TagIgnoringReader.java
?????文件???????1975??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\inputfilter\TextInputFilter.java
?????文件???????1861??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\inputfilter\WVTInputFilter.java
?????文件???????2549??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\inputfilter\xm
?????文件????????219??2007-10-30?11:39??src\edu\udo\cs\wvtool\generic\loader\package.html
............此處省略115個文件信息
評論
共有 條評論