The objective is for the instrument to choose actions that maximize the expected reward over a given amount of time. The cause will reach the goal much faster by following a good policy. So the goal in reinforcement learning is to learn the best policy. Government agencies responsible intuition commun https://ukdirectoryof.com/listings13189231/tout-sur-protection-anti-restriction