Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
Tags

gmukobi published Automated Sandwiching: Efficient Self-Evaluations of Conversation-Based Scalable Oversight Techniques

gmukobi published a tool 1 year ago
A downloadable tool.
Our project furthers the progress of Scale Oversight through automation of the sandwiching paradigm. In the Bowman et al. (2022) paper, the question is presented of how humans can effectively prompt unreliable, superhuman AIs to answer ques...