Abstract
Adaptive critic methods have common roots as generalizations of dynamic programming for neural reinforcement learning approaches. Since they approximate the dynamic programming solutions, they are potentially suitable for learning in noisy, nonlinear and nonstationary environments. In this study, a novel probabilistic dual heuristic programming (DHP) based adaptive critic controller is proposed. Distinct to current approaches, the proposed probabilistic (DHP) adaptive critic method takes uncertainties of forward model and inverse controller into consideration. Therefore, it is suitable for deterministic and stochastic control problems characterized by functional uncertainty. Theoretical development of the proposed method is validated by analytically evaluating the correct value of the cost function which satisfies the Bellman equation in a linear quadratic control problem. The target value of the critic network is then calculated and shown to be equal to the analytically derived correct value.
Original language | English |
---|---|
Title of host publication | Proceedings of the fifth international conference on informatics in control, automation and robotics |
Subtitle of host publication | Funchal, Madeira, May 11 - 15, 2008. Robotics and automation |
Editors | Filipe Joaquim |
Place of Publication | (PT) |
Pages | 281-288 |
Number of pages | 8 |
Publication status | Published - 2008 |
Event | 5th International Conference on Informatics in Control, Automation and Robotics - Madeira, Portugal Duration: 11 May 2008 → 15 May 2008 |
Conference
Conference | 5th International Conference on Informatics in Control, Automation and Robotics |
---|---|
Abbreviated title | ICINCO 2008 |
Country/Territory | Portugal |
City | Madeira |
Period | 11/05/08 → 15/05/08 |
Keywords
- adaptive critic methods
- functional uncertainty
- stochastic control