中国大学MOOC: There are two optimal policies for Dynamic Programming, one is ______________, and the other is policy iteration.动态规划有两种优化策略，一个是___________，而另一种是策略迭代。

公告：维护QQ群：833371870，欢迎加入！
公告：维护QQ群：833371870，欢迎加入！
公告：维护QQ群：833371870，欢迎加入！

2021-04-14

中国大学MOOC: There are two optimal policies for Dynamic Programming, one is ____, and the other is policy iteration.动态规划有两种优化策略，一个是_，而另一种是策略迭代。

答案：

查看

举一反三

There are two optimal policies..._________，而另一种是策略迭代。
动态规划（dynamic programming）是运筹学的一个分支，是解决（）最优化问题的数学方法。
中国大学MOOC: There are two approaches to searching for a plan, one is ________________ search, and the other is backward relevant-states search.有两种搜索计划的方式，一个是_____________搜索，而另一个是后向状态空间搜索。
动态规划的最优化原理是指最优策略的任意一个子策略也是最优的。（）
通常使用两种消极管理策略：一种是指数策略，另一种是免疫策略。()