15 entries were submitted between 2023-01-20 16:00:00 and 2023-01-23 03:15:00. 52 ratings were given to 15 entries (100.0%) between 2023-01-23 03:15:00 and 2023-01-25 14:00:00. The average number of ratings per game was 3.5 and the median was .
| Criteria | Rank | Score* | Raw Score |
| Judge's choice | #1 | n/a | n/a |
| Reproducibility | #1 | 4.400 | 4.400 |
| Mechanistic interpretability | #2 | 4.400 | 4.400 |
| Novelty | #3 | 4.200 | 4.200 |
| Generality | #11 | 2.800 | 2.800 |
| ML Safety | #11 | 2.800 | 2.800 |
| Criteria | Rank | Score* | Raw Score |
| Mechanistic interpretability | #1 | 4.571 | 4.571 |
| Judge's choice | #2 | n/a | n/a |
| Generality | #4 | 3.286 | 3.286 |
| ML Safety | #4 | 3.429 | 3.429 |
| Reproducibility | #4 | 4.143 | 4.143 |
| Novelty | #8 | 3.000 | 3.000 |
| Criteria | Rank | Score* | Raw Score |
| Judge's choice | #3 | n/a | n/a |
| Reproducibility | #3 | 4.250 | 4.250 |
| Mechanistic interpretability | #5 | 4.250 | 4.250 |
| Generality | #5 | 3.000 | 3.000 |
| Novelty | #8 | 3.000 | 3.000 |
| ML Safety | #12 | 2.750 | 2.750 |
| Criteria | Rank | Score* | Raw Score |
| Judge's choice | #4 | n/a | n/a |
| Novelty | #4 | 3.750 | 3.750 |
| Reproducibility | #5 | 4.000 | 4.000 |
| Generality | #5 | 3.000 | 3.000 |
| ML Safety | #6 | 3.250 | 3.250 |
| Mechanistic interpretability | #9 | 3.750 | 3.750 |