Compartir
Model-based Reinforcement Learning: A Survey (en Inglés)
Aske Plaat
(Autor)
·
Thomas M. Moerland
(Autor)
·
Joost Broekens
(Autor)
·
Now Publishers
· Tapa Blanda
Model-based Reinforcement Learning: A Survey (en Inglés) - Moerland, Thomas M. ; Broekens, Joost ; Plaat, Aske
$ 168.190
$ 177.042
Ahorras: $ 8.852
Elige la lista en la que quieres agregar tu producto o crea una nueva lista
✓ Producto agregado correctamente a la lista de deseos.
Ir a Mis Listas
Origen: Estados Unidos
(Costos de importación incluídos en el precio)
Se enviará desde nuestra bodega entre el
Lunes 12 de Agosto y el
Lunes 26 de Agosto.
Lo recibirás en cualquier lugar de Argentina entre 1 y 3 días hábiles luego del envío.
Reseña del libro "Model-based Reinforcement Learning: A Survey (en Inglés)"
Sequential decision making, commonly formalized as Markov Decision Process (MDP) optimization, is an important challenge in artificial intelligence. Two key approaches to this problem are reinforcement learning (RL) and planning. This monograph surveys an integration of both fields, better known as model-based reinforcement learning. Model-based RL has two main steps: dynamics model learning and planning-learning integration. In this comprehensive survey of the topic, the authors first cover dynamics model learning, including challenges such as dealing with stochasticity, uncertainty, partial observability, and temporal abstraction. They then present a systematic categorization of planning-learning integration, including aspects such as: where to start planning, what budgets to allocate to planning and real data collection, how to plan, and how to integrate planning in the learning and acting loop. In conclusion the authors discuss implicit model-based RL as an end-to-end alternative for model learning and planning, and cover the potential benefits of model-based RL. Along the way, the authors draw connections to several related RL fields, including hierarchical RL and transfer learning. This monograph contains a broad conceptual overview of the combination of planning and learning for Markov Decision Process optimization. It provides a clear and complete introduction to the topic for students and researchers alike.