nr |
titel |
auteur |
tijdschrift |
jaar |
jaarg. |
afl. |
pagina('s) |
type |
1 |
Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes
|
Marbach, Peter |
|
2003 |
13 |
1-2 |
p. 111-148 |
artikel |
2 |
Foreword to the Learning, Optimization, and Decision Making in DEDS
|
Ho, Prof. Y. C. |
|
2003 |
13 |
1-2 |
p. 5 |
artikel |
3 |
From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning
|
Cao, Xi-Ren |
|
2003 |
13 |
1-2 |
p. 9-39 |
artikel |
4 |
Introduction to the Special Issue on Learning, Optimization, and Decision Making in DEDS
|
Cao, Xi-Ren |
|
2003 |
13 |
1-2 |
p. 7-8 |
artikel |
5 |
Least Squares Policy Evaluation Algorithms with Linear Function Approximation
|
NediĆ, A. |
|
2003 |
13 |
1-2 |
p. 79-110 |
artikel |
6 |
Performance Evaluation and Policy Selection in Multiclass Networks
|
Henderson, Shane G. |
|
2003 |
13 |
1-2 |
p. 149-189 |
artikel |
7 |
Recent Advances in Hierarchical Reinforcement Learning
|
Barto, Andrew G. |
|
2003 |
13 |
1-2 |
p. 41-77 |
artikel |