Digitale Bibliotheek
Sluiten Bladeren door artikelen uit een tijdschrift
     Tijdschrift beschrijving
       Alle jaargangen van het bijbehorende tijdschrift
         Alle afleveringen van het bijbehorende jaargang
                                       Alle artikelen van de bijbehorende aflevering
 
                             14 gevonden resultaten
nr titel auteur tijdschrift jaar jaarg. afl. pagina('s) type
1 Adaptive dynamic programming for online solution of a zero-sum differential game Vrabie, Draguna
2011
9 3 p. 353-360
artikel
2 A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man Foderaro, Greg
2011
9 3 p. 391-399
artikel
3 Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems Ding, Jie
2011
9 3 p. 370-380
artikel
4 Approximate policy iteration: a survey and some new methods Bertsekas, Dimitri P.
2011
9 3 p. 310-335
artikel
5 A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications Powell, Warren B.
2011
9 3 p. 336-352
artikel
6 Asymptotic tracking by a reinforcement learning-based adaptive critic controller Bhasin, Shubhendu
2011
9 3 p. 400-409
artikel
7 Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming Wei, Qinglai
2011
9 3 p. 381-390
artikel
8 Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning Sun, Xueqing
2011
9 3 p. 440-450
artikel
9 Moving least-squares approximations for linearly-solvable stochastic optimal control problems Zhong, Mingyuan
2011
9 3 p. 451-463
artikel
10 Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization Lampton, Amanda
2011
9 3 p. 431-439
artikel
11 Online optimal control of nonlinear discrete-time systems using approximate dynamic programming Dierks, Travis
2011
9 3 p. 361-369
artikel
12 Semi-Markov adaptive critic heuristics with application to airline revenue management Kulkarni, Ketaki
2011
9 3 p. 421-430
artikel
13 Special issue on approximate dynamic programming and reinforcement learning Ferrari, Silvia
2011
9 3 p. 309
artikel
14 Stable reinforcement learning with recurrent neural networks Knight, James Nate
2011
9 3 p. 410-420
artikel
                             14 gevonden resultaten
 
 Koninklijke Bibliotheek - Nationale Bibliotheek van Nederland
Toegankelijkheidsverklaring