資源簡介
用模糊強化學習,需預先設定模糊規則,本代碼描述了Cartpole學習的強化學習過程,有興趣可以學習一下

代碼片段和文件信息
import?FuzzySet
import?StateVariable
import?FQL
import?FIS
from?Environment?import?Environment
import?matplotlib.pyplot?as?plt
#?Create?FIS
x1?=?StateVariable.InputStateVariable(FuzzySet.Trapeziums(-2.4?-2?-1?-0.5)?FuzzySet.Trapeziums(-1?-0.5?0.5??1)?FuzzySet.Trapeziums(0.5?1?2?2.4)?)
x2?=?StateVariable.InputStateVariable(FuzzySet.Triangles(-2.4-0.51)?FuzzySet.Triangles(-0.512.4))
x3?=?StateVariable.InputStateVariable(FuzzySet.Triangles(-3.14159?-1.5?0)?FuzzySet.Triangles(-1.5?0?1.5)?FuzzySet.Triangles(0?1.5?3.1459))
x4?=?StateVariable.InputStateVariable(FuzzySet.Triangles(-3.14159?-1.5?0)?FuzzySet.Triangles(-1.5?0?1.5)?FuzzySet.Triangles(0?1.5?3.1459))
fis?=?FIS.Build(x1x2x3x4)
#?Create?Model
angel_list?=?[]
model?=?FQL.Model(gamma?=?0.9?alpha?=?0.1??ee_rate?=?0.999?q_initial_value?=?‘random‘
??????????????????action_set_length?=?21?fis?=?fis)
env?=?Environment()
for?iteration?in?range?(05000):
????if?iteration?%?100?==?0?or?reward?==?-1:
????????env.__init__()
????????action?=?model.get_initial_action(env.state)
????????reward?state_value?=?env.apply_action(action)
????action?=?model.run(state_value?reward)
????reward?state_value?=?env.apply_action(action)
????if?reward?!=?-1:
????????angel_list.append(state_value[2])
plt.figure(figsize=(143))
plt.plot(angel_list)
plt.ylabel(‘Pole?Angel‘)
plt.show()
?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????文件???????1408??2018-01-25?19:07??模糊強化學習\Cartpole.py
?????文件???????3116??2018-01-25?19:08??模糊強化學習\Environment.py
?????文件???????2912??2018-01-25?19:38??模糊強化學習\Environment.pyc
?????文件????????572??2018-01-25?19:03??模糊強化學習\FIS.py
?????文件???????1012??2018-01-25?19:38??模糊強化學習\FIS.pyc
?????文件???????3840??2018-01-25?18:39??模糊強化學習\FQL.py
?????文件???????4343??2018-01-25?19:38??模糊強化學習\FQL.pyc
?????文件???????1495??2018-01-25?19:11??模糊強化學習\FuzzySet.py
?????文件???????1583??2018-01-25?19:38??模糊強化學習\FuzzySet.pyc
?????文件????????201??2018-01-25?19:12??模糊強化學習\StateVariable.py
?????文件????????642??2018-01-25?19:38??模糊強化學習\StateVariable.pyc
?????目錄??????????0??2018-01-25?19:38??模糊強化學習
-----------?---------??----------?-----??----
????????????????21124????????????????????12
- 上一篇:基于Qt繪制的時間軸demo
- 下一篇:匯編實現pwm及按鍵控制
評論
共有 條評論