Reinforcement Learning Snake Game

Learning Process

The AI is using Q-learning, a model-free reinforcement learning algorithm. It learns by:

Observing the current state (snake position, food position, body positions, danger directions)
Choosing an action (forward, left, right) based on ε-greedy policy
Receiving a reward (eating food faster gives higher rewards)
Updating its Q-table (state-action values)

Watch as the Q-values update and exploration rate decreases over time!

Current State & Action

State:

Waiting to start...

Action: -

Reward: 0

Snake Body Map