Reinforcement Leaing and Dynamic Programming Using