Play game
This Is Fine(-tuning): A benchmark testing LLMs robustness against bad fine-tuning data's itch.io pageResults
Criteria | Rank | Score* | Raw Score |
Safety | #1 | 3.667 | 3.667 |
Benchmark | #2 | 3.333 | 3.333 |
Generality | #4 | 2.333 | 2.333 |
Novelty | #4 | 2.667 | 2.667 |
Reproducibility | #5 | 2.667 | 2.667 |
Ranked from 3 ratings. Score is adjusted from raw score by the median number of ratings per game in the jam.
Judge feedback
Judge feedback is anonymous.
- An interesting approach to measuring adversarial data impacts! It’s probably hard to generalize this without creating a new benchmark per task but thinking more about the general direction of performance falloff is very encouraged.
Where did you participate?
Delft
What are the full names of the participants?
Jan Wehner, Joep Storm, Tijmen van Graft, Jaouad Hidayat
What is your team name?
Alignment Avengers
Leave a comment
Log in with itch.io to leave a comment.
Comments
No one has posted a comment yet