-
- Description:
Over the past decade, contextual bandit algorithms have been gaining in popularity due to their effectiveness and flexibility in solving sequential decision problems---from online advertising and finance to clinical trial design and personalized medicine. At the same time, there are, as of yet, surprisingly few options that enable researchers and practitioners to simulate and compare the wealth of new and existing bandit algorithms in a standardized way. To help close this gap between analytical research and empirical evaluation the current paper introduces the object-oriented \proglang{R} package \pkg{contextual}: a user-friendly and, through its object-oriented design, easily extensible framework that facilitates parallelized comparison of contextual and context-free bandit policies through both simulation and offline analysis.
- Changes to previous version:
Major update: Offline Bandit API overhaul - now makes use of R formulae. More demo R scripts added. New Contextual Bandits and Policies. Bug fixes.
- BibTeX Entry: Download
- Supported Operating Systems: Agnostic
- Data Formats: Agnostic
- Tags: Reinforcement Learning, Simulation, Data Generator, Context Aware Recommendation, Bandits, Comparisons
- Archive: download here
Comments
No one has posted any comments yet. Perhaps you'd like to be the first?
Leave a comment
You must be logged in to post comments.