25 entries were submitted between 2022-11-11 17:00:00 and 2022-11-13 13:15:00. 217 ratings were given to 24 entries (96.0%) between 2022-11-13 13:15:00 and 2022-11-13 16:30:00. The average number of ratings per game was 8.7 and the median was .
| Criteria | Rank | Score* | Raw Score |
| Generality | #1 | 3.444 | 3.444 |
| ML Safety | #4 | 3.111 | 3.111 |
| Novelty | #10 | 3.111 | 3.111 |
| Interpretability | #10 | 3.222 | 3.222 |
| Reproducibility | #10 | 3.778 | 3.778 |
| Criteria | Rank | Score* | Raw Score |
| Judge's choice | #1 | n/a | n/a |
| Generality | #1 | 3.444 | 3.444 |
| Interpretability | #3 | 3.778 | 3.778 |
| ML Safety | #4 | 3.111 | 3.111 |
| Novelty | #9 | 3.222 | 3.222 |
| Reproducibility | #17 | 3.222 | 3.222 |
| Criteria | Rank | Score* | Raw Score |
| ML Safety | #1 | 3.778 | 3.778 |
| Interpretability | #1 | 4.222 | 4.222 |
| Generality | #1 | 3.444 | 3.444 |
| Novelty | #2 | 3.778 | 3.778 |
| Reproducibility | #8 | 3.889 | 3.889 |
| Criteria | Rank | Score* | Raw Score |
| Novelty | #1 | 3.889 | 3.889 |
| ML Safety | #2 | 3.222 | 3.222 |
| Generality | #4 | 3.222 | 3.222 |
| Interpretability | #5 | 3.556 | 3.556 |
| Reproducibility | #11 | 3.667 | 3.667 |
| Criteria | Rank | Score* | Raw Score |
| Judge's choice | #3 | n/a | n/a |
| Generality | #5 | 3.064 | 3.250 |
| Reproducibility | #6 | 4.125 | 4.375 |
| Novelty | #6 | 3.300 | 3.500 |
| Interpretability | #6 | 3.536 | 3.750 |
| ML Safety | #8 | 2.946 | 3.125 |
| Criteria | Rank | Score* | Raw Score |
| Novelty | #4 | 3.545 | 3.545 |
| Generality | #6 | 3.000 | 3.000 |
| Reproducibility | #9 | 3.818 | 3.818 |
| Interpretability | #13 | 3.000 | 3.000 |
| ML Safety | #13 | 2.545 | 2.545 |
| Criteria | Rank | Score* | Raw Score |
| Novelty | #6 | 3.300 | 3.500 |
| Generality | #7 | 2.946 | 3.125 |
| Interpretability | #8 | 3.300 | 3.500 |
| ML Safety | #11 | 2.593 | 2.750 |
| Reproducibility | #19 | 3.182 | 3.375 |
| Criteria | Rank | Score* | Raw Score |
| Generality | #8 | 2.898 | 3.286 |
| Interpretability | #15 | 2.772 | 3.143 |
| ML Safety | #18 | 2.142 | 2.429 |
| Novelty | #19 | 2.520 | 2.857 |
| Reproducibility | #20 | 3.024 | 3.429 |
| Criteria | Rank | Score* | Raw Score |
| Reproducibility | #4 | 4.243 | 4.500 |
| Generality | #9 | 2.828 | 3.000 |
| Interpretability | #11 | 3.182 | 3.375 |
| Novelty | #14 | 2.711 | 2.875 |
| ML Safety | #16 | 2.239 | 2.375 |
| Criteria | Rank | Score* | Raw Score |
| Interpretability | #1 | 4.222 | 4.222 |
| Reproducibility | #1 | 4.556 | 4.556 |
| Judge's choice | #2 | n/a | n/a |
| Generality | #10 | 2.778 | 2.778 |
| ML Safety | #10 | 2.778 | 2.778 |
| Novelty | #11 | 2.889 | 2.889 |
| Criteria | Rank | Score* | Raw Score |
| ML Safety | #11 | 2.593 | 2.750 |
| Generality | #11 | 2.711 | 2.875 |
| Reproducibility | #13 | 3.536 | 3.750 |
| Interpretability | #17 | 2.593 | 2.750 |
| Novelty | #22 | 2.239 | 2.375 |
| Criteria | Rank | Score* | Raw Score |
| Interpretability | #7 | 3.300 | 3.300 |
| Generality | #12 | 2.600 | 2.600 |
| ML Safety | #17 | 2.200 | 2.200 |
| Novelty | #20 | 2.500 | 2.500 |
| Reproducibility | #24 | 2.400 | 2.400 |
| Criteria | Rank | Score* | Raw Score |
| Interpretability | #8 | 3.300 | 3.500 |
| Reproducibility | #12 | 3.653 | 3.875 |
| Generality | #13 | 2.593 | 2.750 |
| ML Safety | #14 | 2.475 | 2.625 |
| Novelty | #17 | 2.593 | 2.750 |
| Criteria | Rank | Score* | Raw Score |
| Reproducibility | #2 | 4.364 | 4.364 |
| Interpretability | #4 | 3.636 | 3.636 |
| ML Safety | #6 | 3.000 | 3.000 |
| Novelty | #8 | 3.273 | 3.273 |
| Generality | #14 | 2.545 | 2.545 |
| Criteria | Rank | Score* | Raw Score |
| Novelty | #5 | 3.364 | 3.364 |
| Reproducibility | #7 | 4.091 | 4.091 |
| Generality | #14 | 2.545 | 2.545 |
| ML Safety | #15 | 2.273 | 2.273 |
| Interpretability | #16 | 2.727 | 2.727 |
| Criteria | Rank | Score* | Raw Score |
| Reproducibility | #3 | 4.300 | 4.300 |
| ML Safety | #9 | 2.900 | 2.900 |
| Generality | #16 | 2.500 | 2.500 |
| Novelty | #16 | 2.700 | 2.700 |
| Interpretability | #20 | 2.200 | 2.200 |
| Criteria | Rank | Score* | Raw Score |
| Interpretability | #11 | 3.182 | 3.375 |
| Reproducibility | #13 | 3.536 | 3.750 |
| Novelty | #14 | 2.711 | 2.875 |
| Generality | #17 | 2.475 | 2.625 |
| ML Safety | #22 | 1.650 | 1.750 |
| Criteria | Rank | Score* | Raw Score |
| Reproducibility | #17 | 3.222 | 3.222 |
| Generality | #18 | 2.444 | 2.444 |
| Novelty | #18 | 2.556 | 2.556 |
| ML Safety | #21 | 1.889 | 1.889 |
| Interpretability | #23 | 2.000 | 2.000 |
| Criteria | Rank | Score* | Raw Score |
| Novelty | #3 | 3.600 | 3.600 |
| ML Safety | #6 | 3.000 | 3.000 |
| Generality | #19 | 2.300 | 2.300 |
| Reproducibility | #21 | 3.000 | 3.000 |
| Interpretability | #24 | 1.900 | 1.900 |
| Criteria | Rank | Score* | Raw Score |
| ML Safety | #3 | 3.214 | 3.214 |
| Judge's choice | #4 | n/a | n/a |
| Reproducibility | #5 | 4.214 | 4.214 |
| Novelty | #13 | 2.857 | 2.857 |
| Interpretability | #14 | 2.929 | 2.929 |
| Generality | #20 | 2.286 | 2.286 |