This course discusses the multi armed bandit problem and various strategies to solve it, introducing the concepts of state-rewards functions in RL and how to solve it using simple strategies. This is part of the Reinforcement Learning Workshop learning path. Complete this path to learn how to use reinforcement learning in a variety of ML and AI problems.
As part of the Reinforcement Learning Workshop path, you will know Python as well as have knowledge of basic AI and machine learning principles, as well as the basics of reinforcement learning. You will also be able to use TensorFlow 2 and OpenAI Gym, and have an understanding of Monte Carlo methods.