資源簡介
Apriori算法Python實現
代碼片段和文件信息
#!/usr/bin/python
#?-*-?coding:utf-8?-*-
from?itertools?import?combinations
def?load_data_set():
????‘‘‘
????Returns:
????--------
????like?this:?[[1?3?4]?[2?3?5]?[1?2?3?5]?[2?5]]
????‘‘‘
????with?open(“order2016-08-31“)?as?fp:
????????itemid_list?=?[]
????????for?line?in?fp:
????????????order_id?order_lst?=?line.rstrip(‘\n‘).split(‘\t‘)
????????????order_lst?=?eval(order_lst)
????????????if?len(order_lst)?>?1:
????????????????order_lst?=?[int(itemid)?for?itemid?in?order_lst]
????????????????itemid_list.append(order_lst)
????return?itemid_list
def?subtract_item_set(pre_discard_itemset?candidate_set):
????‘‘‘
????首先去除候選集中不符合非頻繁項集的那些元素,
????在當前候選集中去掉上一輪刪除的項集,
????比如{2?3}是非頻繁項集,那么就將刪除candidate_set中的{2?3?x}這些項集
????Parameters:
????-----------
評論
共有 條評論