programming
AI-udemy
Starting

what are goona learn in this section

  • What is renforcmint Learning
  • The Bellman Equation
  • The Plan
  • Markov decisions Process (MDP)
  • Policy VS Plan
  • Adding a "Living Penalty"
  • Q-Learning Intuition
  • Temporal Diffrence
  • Q-Learning Visualization