Tài liệu tham khảo |
Loại |
Chi tiết |
4. Dinah Rosenberg, Eilon Solanand, Nicolas Vieille – The annals of statistics Vol.30, No.4 (2002) – Black well optimality in Markov decision processes with partial observation |
Sách, tạp chí |
Tiêu đề: |
optimality in Markov decision processes with partial observation |
Tác giả: |
Dinah Rosenberg, Eilon Solanand, Nicolas Vieille |
Nhà XB: |
The annals of statistics |
Năm: |
2002 |
|
10. Richard S. Sutton, Andrew G. Barto – A Bradford book, MIT Press (1998) - Reinforcement learning: An Introduction |
Sách, tạp chí |
Tiêu đề: |
Reinforcement learning: An Introduction |
Tác giả: |
Richard S. Sutton, Andrew G. Barto |
Nhà XB: |
MIT Press |
Năm: |
1998 |
|
11. Relu Patrascu, Pascal Poupart, Dale Schuurmans, Craig Boutilier, Carlos Guestrin – Eighteenth national conference on Artificial intelligence, Canada (2002) - Greedy linear value-approximation for factored Markov decision processes |
Sách, tạp chí |
Tiêu đề: |
Greedy linear value-approximation for factored Markov decision processes |
Tác giả: |
Relu Patrascu, Pascal Poupart, Dale Schuurmans, Craig Boutilier, Carlos Guestrin |
Nhà XB: |
Eighteenth national conference on Artificial intelligence |
Năm: |
2002 |
|
13. Claude F. Touzet - 1999 International joint conference on neural networks (1999) - Neural Networks and Q-Learning for Robotics |
Sách, tạp chí |
Tiêu đề: |
Neural Networks and Q-Learning for Robotics |
Tác giả: |
Claude F. Touzet |
Nhà XB: |
1999 International joint conference on neural networks |
Năm: |
1999 |
|
15. Carlos Guestrin, Geoffrey Gordon – Proc. of the 18th Conference on uncertainty in artificial intelligence, pp. 197-206, (2002) - Distributed planning in hierarchical factored MDPs |
Sách, tạp chí |
Tiêu đề: |
Distributed planning in hierarchical factored MDPs |
Tác giả: |
Carlos Guestrin, Geoffrey Gordon |
Nhà XB: |
Proc. of the 18th Conference on uncertainty in artificial intelligence |
Năm: |
2002 |
|
18. Barto, A. G., Sutton, R. S., and Brouwer, P. S. - IEEE Transactions on systems, san, and cybernetics, 40:201-211 (1981) - Associative search network: A reinforcement learning associative memory |
Sách, tạp chí |
Tiêu đề: |
Associative search network: A reinforcement learning associative memory |
Tác giả: |
A. G. Barto, R. S. Sutton, P. S. Brouwer |
Nhà XB: |
IEEE Transactions on Systems, Man, and Cybernetics |
Năm: |
1981 |
|
19. Michael Bowling , Alborz Geramifard , David Wingate – Proc. of the 7th international joint conference on Autonomous agents and multiagent systems (2008) - Sigma point policy iteration |
Sách, tạp chí |
Tiêu đề: |
Proc. of the 7th international joint conference on Autonomous agents and multiagent systems |
Tác giả: |
Michael Bowling, Alborz Geramifard, David Wingate |
Nhà XB: |
Sigma |
Năm: |
2008 |
|
20. Daphne Koller, Ronald Parr - Proceedings of the 16th International joint conference on artificial intelligence (1999) - Computing factored value functions for policies in structured MDPs |
Sách, tạp chí |
Tiêu đề: |
Computing factored value functions for policies in structured MDPs |
Tác giả: |
Daphne Koller, Ronald Parr |
Nhà XB: |
Proceedings of the 16th International joint conference on artificial intelligence |
Năm: |
1999 |
|
22. Carlos Guestrin , Daphne Koller, Ronald Parr, shobha Venkataraman – Journal of Artificial intelligence research (2003) - Efficient solution algorithms for factored MDPs |
Sách, tạp chí |
Tiêu đề: |
Efficient solution algorithms for factored MDPs |
Tác giả: |
Carlos Guestrin, Daphne Koller, Ronald Parr, Shobha Venkataraman |
Nhà XB: |
Journal of Artificial Intelligence Research |
Năm: |
2003 |
|
24. Richard Bellman - Information and Control, 1, 228-239 (1958) - Dynamic programming and stochastic control processes |
Sách, tạp chí |
Tiêu đề: |
Dynamic programming and stochastic control processes |
Tác giả: |
Richard Bellman |
Nhà XB: |
Information and Control |
Năm: |
1958 |
|
33. John J. Leonard, Hans Jacob S. Feder - Robotics Research: the Ninth international symposium, Springer-Verlag (2000) - A computationally efficient method for large- scale concurrent mapping and localization |
Sách, tạp chí |
Tiêu đề: |
Robotics Research: the Ninth international symposium |
Tác giả: |
John J. Leonard, Hans Jacob S. Feder |
Nhà XB: |
Springer-Verlag |
Năm: |
2000 |
|
34. Yamauchi B. – Proc. of the second international conference on autonomous agents, 47-53 (1998) - Frontier-based exploration using multiple robots 35. Kazunori Iwata , Nobuhiro Ito , Koichiro Yamauchi , and Naohiro Ishii - |
Sách, tạp chí |
Tiêu đề: |
Frontier-based exploration using multiple robots |
Tác giả: |
Kazunori Iwata, Nobuhiro Ito, Koichiro Yamauchi, Naohiro Ishii |
Nhà XB: |
Proc. of the second international conference on autonomous agents |
Năm: |
1998 |
|
37. Joono Cheong, Wan K Chung, Youngil Youm - IEEE Transactions on robotics and automation vol. 20 (2004) - Inverse kinematics of multilink flexible |
Sách, tạp chí |
Tiêu đề: |
Inverse kinematics of multilink flexible |
Tác giả: |
Joono Cheong, Wan K Chung, Youngil Youm |
Nhà XB: |
IEEE Transactions on Robotics and Automation |
Năm: |
2004 |
|
38. Karan Singh, Eugene Fiume – Proc. of the 25th annual conference on Computer graphics and interactive techniques, 405 - 414 (1998) Wires: A Geometric Deformation Technique |
Sách, tạp chí |
Tiêu đề: |
Wires: A Geometric Deformation Technique |
Tác giả: |
Karan Singh, Eugene Fiume |
Nhà XB: |
Proc. of the 25th annual conference on Computer graphics and interactive techniques |
Năm: |
1998 |
|
1. A. Jasra, C. C. Holmes and D. A. Stephens - Statistical Science, Vol. 20, No. 1 (Feb., 2005), pp. 50-67 - Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling |
Khác |
|
2. JesseHoey, Pascal Poupart, Jennifer Boger, Craig Boutilier, Geoff Fernie, Alex Mihailidis – Proc. of the 19th international joint conference on Artificial intelligence (2005) - A decision-theoretic approach to task assistance for persons with dementia |
Khác |
|
3. Craig Boutilier, Relu Patrascu, Pascal Poupart, Dale Schuurmans – Proc. of the 19th international joint conference on Artificial intelligence (2005) - Regret-based utility elicitation in constraint-based decision problems |
Khác |
|
5. Omar Zia Khan, Pascal Poupart, James P. Black – Proc. of the 19th International conference on Automated planning and scheduling (2009) - Minimal sufficient explanations for factored Markov decision processes |
Khác |
|
6. Abhijit Gosavi – INFORMS Journal on Computing Vol. 21 , Issue 2 (2009) - Reinforcement learning: A Tutorial survey and recent advances |
Khác |
|
7. Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore – Journal of artificial intelligence research Vol. 4 (1996) - Reinforcement learning: A survey (1996) |
Khác |
|