当前位置:
首页
资源下载

搜索资源 - Markov Decision Process
搜索资源列表
-
0下载:
一个很有用的Markov Decision Process matlab程序-a very useful Markov Decision Process procedures Matlab
-
-
0下载:
The MDP toolbox proposes functions related to the resolution of discrete-time Markov Decision Process : finite horizon, value iteration, policy iteration, linear programming algorithms with some variants.
The functions (m-functions) were developpe
-
-
0下载:
Markov Decision Process (MDP) Toolbox
-
-
4下载:
马氏过程的应用很广, 机器人路径计划, 自动飞行器导航,多目标跟踪, 电梯计划, 网络交换和路由, 银行客户保有等等。-Application of Markov process is broad, robot path plan, automatic vehicle navigation, multi-target tracking, lift plans, network switching and routing, bank customers to maintain and so on.
-
-
1下载:
马尔科夫决策过程值迭代算法value iteration,策略迭代等函数代码,从国外网站下载,非常详细,有用。-Markov decision process value iteration algorithm value iteration, policy iteration and so the function code, from the foreign website, very detailed and useful.
-
-
0下载:
the book <<Simulation and Monte Carlo With applications in finance and MCMC >> about MONte carlo method applying to finance problem and markov chain and markov decision process.
-
-
5下载:
马尔科夫决策过程的Matlab程序,包括一些例程-Markov Decision Process
-
-
0下载:
求解POMDP问题的一个重要方法,对策略空间进行简化-We propose a new approach to the problem
of searching a space of policies for a Markov
decision process (MDP) or a partially observable
Markov decision process (POMDP), given
a model.
-
-
0下载:
Markov Decision Process in java
-
-
0下载:
markov decision process (MDP) for Matlab
-
-
0下载:
Paper on the optimality of a non-stationary value iteration adaptive policy for a Partially Observed Markov Decision Proce-Paper on the optimality of a non-stationary value iteration adaptive policy for a Partially Observed Markov Decision Process
-
-
0下载:
马氏过程的应用很广, 机器人路径计划, 自动飞行器导航,多目标跟踪, 电梯计划, 网络交换和路由, 银行客户保有等等。-Application of Markov process is broad, robot path plan, automatic vehicle navigation, multi-target tracking, lift plans, network switching and routing, bank customers to maintain and so on.
-
-
0下载:
如何用matlab实现MDP中的值迭代算法或者策略迭代法-Markov decision process value iteration algorithm value iteration
-
-
0下载:
Analytical solution for 2-dimensional partially observable Markov Decision Processes.
-
-
0下载:
markov decision process in grid world game
-
-
0下载:
A Markov Decision Process (MDP) which decides what a trader should rationally and optimally do when s/he is in any state according to the descr iption with the source code.
-
-
2下载:
MDP代码,动态规划,基于马尔可夫决策过程,可以实现全局最优。-MDP code, dynamic programming, Markov decision process, we can achieve the global optimum
-
-
0下载:
清洁机器人根据马尔科夫决策过程确定移动轨迹,选择最优决策,确定移动取清洁还是去充电。-Cleaning robot is determined based on Markov decision process moving track, choose the best decisions about cleaning or moving to take charge.
-
-
0下载:
马尔卡夫决策过程理论定义了一个数学模型,可用于随机动态系统的最优决策过程。
强化学习利用这个数学模型将一个现实中的问题变成一个数学问题。
强化学习就是:追求最大回报G
追求最大回报G就是:找到最优的策略π?。
策略π?告诉在状态s,应该执行什么行动a。
最优策略可以由最优价值方法v?(s)或者q?(s,a)决定(The Markov decision process theory defines a mathematical model that can be used for the
-
-
1下载:
马尔可夫决策过程的例程,使用matlab实现(The example of Markov's decision-making process is implemented using MATLAB)
-