PILCO policy search framework 0.9

marc deisenroth, andrew mchutchon, joe hall, carl edward rasmussen — Fri, 27 Sep 2013 12:45:12 -0000

This software package allows the user to easily apply the powerful pilco framework to a wide range of continuous-valued RL and control problems, requiring only a small amount of problem-specific extra coding. The package includes five example scenarios as demonstrations of what is possible and to help the user apply the package to their own problems. The high-level steps of the pilco algorithm are the following: Learn a Gaussian process (GP) model of the system dynamics, perform deterministic approximate inference for policy evaluation, update the policy parameters using exact gradient information, apply the learned controller to the system. The software package provides an interface, which allows for setting up novel tasks without the need to be familiar with the intricate details of model learning, policy evaluation and improvement.

Comment by Marc Deisenroth on 2013-10-01 15:48

Marc Deisenroth — Tue, 01 Oct 2013 15:48:14 -0000

added thumbnail

mloss.org PILCO policy search framework

PILCO policy search framework 0.9

Comment by Marc Deisenroth on 2013-10-01 15:48