mloss.org PILCO policy search frameworkhttp://mloss.orgUpdates and additions to PILCO policy search frameworkenTue, 01 Oct 2013 15:48:14 -0000PILCO policy search framework 0.9http://mloss.org/software/view/508/<html><p>This software package allows the user to easily apply the powerful pilco framework to a wide range of continuous-valued RL and control problems, requiring only a small amount of problem-specific extra coding. The package includes five example scenarios as demonstrations of what is possible and to help the user apply the package to their own problems. The high-level steps of the pilco algorithm are the following: Learn a Gaussian process (GP) model of the system dynamics, perform deterministic approximate inference for policy evaluation, update the policy parameters using exact gradient information, apply the learned controller to the system. The software package provides an interface, which allows for setting up novel tasks without the need to be familiar with the intricate details of model learning, policy evaluation and improvement. </p></html>marc deisenroth, andrew mchutchon, joe hall, carl edward rasmussenFri, 27 Sep 2013 12:45:12 -0000http://mloss.org/software/rss/comments/508http://mloss.org/software/view/508/controlreinforcement learningapproximate inferencegaussian processespolicy search<b>Comment by Marc Deisenroth on 2013-10-01 15:48</b>http://mloss.org/comments/cr/14/508/#c695<p>added thumbnail</p> Marc DeisenrothTue, 01 Oct 2013 15:48:14 -0000http://mloss.org/comments/cr/14/508/#c695