Reinforcement learning

Optimism-Based Adaptive Regulation of Linear-Quadratic Systems

The main challenge for adaptive regulation of linear-quadratic systems is the tradeoff between identification and control. An adaptive policy needs to address both the estimation of unknown dynamics parameters (exploration), as well as the regulation …