We explored how a Deep RL agent uses human interpretable concepts to solve connect-four.
Based on 'Acquisition of Chess Knowledge in AlphaZero' paper by DeepMind and Google Brain, we used TCAV to explore concepts detection in RL agent for connect four.
Our agent architecture was inspired by AlphaZero and trained using the OpenSpiel library by DeepMind.
Our novelty is in the decision to study connect four as it was solved with a knowledge based approach in 1988. Which means that to some extent we understand this game better than chess!
Leave a comment
Log in with itch.io to leave a comment.