資源簡介
該程序調用IKAnalyzer3.2.3.jar提供的接口實現了對漢字詞的簡單分詞,目前尚不支持對帶有標點符號的段落的解析。

代碼片段和文件信息
import?java.io.IOException;
import?org.wltea.analyzer.*;
import?org.wltea.analyzer.dic.Dictionary;
import?java.io.*;
import?java.util.ArrayList;
import?java.util.List;
public?class?IKAnalyzerDemo?{
public?static?void?main(String?args[])?throws?IOException{
String?str=“管理和服務必須做到最好這是一種態度所以我們應該好好學習同學們聽明白了嗎“?;
//String?str=“如今預付費式的會員卡已經成為都市人的一種時尚“;
List?list=new?ArrayList();
list.add(“必須做到“);
Dictionary.loadExtendWords(list);
list.clear();
StringReader?in?=?new?StringReader(str);
IKSegmentation?ik=new?IKSegmentation(intrue);
String?out=““;
int?i=0;
while(true){
Lexeme?token=new?Lexeme(0?0?0?0);
token=ik.next();
if(token==null)
break;
str=token.getLexemeText();
int?pos=token.getBegin();
if(pos str=““;
else{
str=str+“/“;
i++;
}
out=out+str;
}
System.out.println(out);
}
}
?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????文件??????19968??2011-10-31?09:25??說明.doc
?????文件????????413??2011-10-31?09:22??Ngram\.classpath
?????文件????????381??2011-10-24?10:08??Ngram\.project
?????文件????????629??2011-10-24?10:08??Ngram\.settings\org.eclipse.jdt.core.prefs
?????文件???????2078??2011-10-31?09:22??Ngram\bin\IKAnalyzerDemo.class
?????文件????????161??2011-10-31?09:18??Ngram\src\ext_stopword.dic
?????文件????????479??2009-09-22?11:37??Ngram\src\IKAnalyzer.cfg.xm
?????文件????????984??2011-10-31?09:21??Ngram\src\IKAnalyzerDemo.java
?????目錄??????????0??2011-10-31?09:36??Ngram\.settings
?????目錄??????????0??2011-10-31?09:36??Ngram\bin
?????目錄??????????0??2011-10-31?09:36??Ngram\src
?????目錄??????????0??2011-10-31?09:36??Ngram
-----------?---------??----------?-----??----
????????????????25093????????????????????12
評論
共有 條評論