《sarsa.doc》查看内容详情
亲,网页上所展示的文章内容和下载文档内容是一致的,下载前请点击“查看内容详情”确认是否您所要下载的内容。
n.撒尔沙,由撒尔沙根中提炼的药; 英英释义: SARSASARSA (State-Action-Reward-State-Action) is an algorithm for learning a Markov decision process policy, used in the reinforcement learning area of machine learning. It was introduced in a technical note Online Q-Learning using