nr |
titel |
auteur |
tijdschrift |
jaar |
jaarg. |
afl. |
pagina('s) |
type |
1 |
Adaptive dynamic programming for online solution of a zero-sum differential game
|
Vrabie, Draguna |
|
2011 |
9 |
3 |
p. 353-360 |
artikel |
2 |
A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man
|
Foderaro, Greg |
|
2011 |
9 |
3 |
p. 391-399 |
artikel |
3 |
Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems
|
Ding, Jie |
|
2011 |
9 |
3 |
p. 370-380 |
artikel |
4 |
Approximate policy iteration: a survey and some new methods
|
Bertsekas, Dimitri P. |
|
2011 |
9 |
3 |
p. 310-335 |
artikel |
5 |
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
|
Powell, Warren B. |
|
2011 |
9 |
3 |
p. 336-352 |
artikel |
6 |
Asymptotic tracking by a reinforcement learning-based adaptive critic controller
|
Bhasin, Shubhendu |
|
2011 |
9 |
3 |
p. 400-409 |
artikel |
7 |
Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming
|
Wei, Qinglai |
|
2011 |
9 |
3 |
p. 381-390 |
artikel |
8 |
Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning
|
Sun, Xueqing |
|
2011 |
9 |
3 |
p. 440-450 |
artikel |
9 |
Moving least-squares approximations for linearly-solvable stochastic optimal control problems
|
Zhong, Mingyuan |
|
2011 |
9 |
3 |
p. 451-463 |
artikel |
10 |
Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization
|
Lampton, Amanda |
|
2011 |
9 |
3 |
p. 431-439 |
artikel |
11 |
Online optimal control of nonlinear discrete-time systems using approximate dynamic programming
|
Dierks, Travis |
|
2011 |
9 |
3 |
p. 361-369 |
artikel |
12 |
Semi-Markov adaptive critic heuristics with application to airline revenue management
|
Kulkarni, Ketaki |
|
2011 |
9 |
3 |
p. 421-430 |
artikel |
13 |
Special issue on approximate dynamic programming and reinforcement learning
|
Ferrari, Silvia |
|
2011 |
9 |
3 |
p. 309 |
artikel |
14 |
Stable reinforcement learning with recurrent neural networks
|
Knight, James Nate |
|
2011 |
9 |
3 |
p. 410-420 |
artikel |