programmingAI-udemyStartingwhat are goona learn in this section What is renforcmint Learning The Bellman Equation The Plan Markov decisions Process (MDP) Policy VS Plan Adding a "Living Penalty" Q-Learning Intuition Temporal Diffrence Q-Learning Visualization Q Learning IntutionTemporal Diffrence