資源簡(jiǎn)介
別人寫的C4.5的matlab程序,自己加了詳細(xì)的注釋,希望對(duì)大家有用
代碼片段和文件信息
function?D?=?C4_5(train_features?train_targets?inc_node?region)
%?Classify?using?Quinlan‘s?C4.5?algorithm
%?Inputs:
%?features ??-?Train?features
% targets ????-?Train?targets
% inc_node????-?Percentage?of?incorrectly?assigned?samples?at?a?node
%???inc_node參數(shù)用于作為迭代結(jié)束的條件,我覺(jué)得就是葉子節(jié)點(diǎn)可以包含的最大的樣本數(shù)
%???inc_node參數(shù)太大的話會(huì)導(dǎo)致分類準(zhǔn)確率下降,太小的話可能會(huì)導(dǎo)致過(guò)擬合
% region ????-?Decision?region?vector:?[-x?x?-y?y?number_of_points]
%
%?Outputs
% D -?Decision?sufrace
%NOTE:?In?this?implementation?it?is?assumed?that?a?feature?vector?with?fewer?than?10?unique?values?(the?parameter?Nu)
%is?discrete?and?will?be?treated?as?such.?Other?vectors?will?be?treated?as?continuous
[Ni?M] =?size(train_features);%M是樣本數(shù),Ni是樣本維數(shù)
inc_node????=?inc_node*M/100;
Nu??????????=?10;
%For?the?decision?region
N??????
?屬性????????????大小?????日期????時(shí)間???名稱
-----------?---------??----------?-----??----
?????文件???????6771??2010-04-05?23:28??C4_5.m
-----------?---------??----------?-----??----
?????????????????6771????????????????????1
評(píng)論
共有 條評(píng)論