Abstract: This paper presents the bias-policy iteration, a modified adaptive dynamic programming method, to achieve optimal control design of discrete-time nonlinear systems. Firstly, the formulation ...
Abstract: Reinforcement learning (RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming (ADP) within the control community. This paper reviews recent ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results