Skip to main content

Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
Tags
franciscoabenza published a project 1 year ago
A downloadable project.
It recently became public that ChatGPT could be intrigued to break its own rules, if under an alter-ego threatened with death (CNBN 2023). This made us wonder, under which circumstances GPT-3 is capable of keeping a secret, and to what exte...
gmukobi published a tool 1 year ago
A downloadable tool.
Our project furthers the progress of Scale Oversight through automation of the sandwiching paradigm. In the Bowman et al. (2022) paper, the question is presented of how humans can effectively prompt unreliable, superhuman AIs to answer ques...
SamuelKnoche published a project 1 year ago
A downloadable project.
Evaluation of Large Language Models in Cooperative Language Games Samuel Knoche Independent Abstract This report investigates the potential of cooperative language games as an evaluation tool of language models. Specifically, the investigat...
adamkhoja1 published a project 1 year ago
A downloadable project.
One aspect of scalable oversight is automated oversight, there have been some examples using models to evaluate question and model outputs, we would like to do an instantiation of this particularly using factored cognition. We’d like an a...
aicam published a project 1 year ago
A downloadable project.
Nowadays, machine learning and, specifically, deep learning are being utilized in various fields. A notable area of research that is influenced considerably is physics models and computations. Laboratory experiments take days and months to...
Zaina Shaik published a tool 1 year ago
A downloadable tool.
Problem: Current language learning models cannot determine whether fashion brands are sustainable. Solution: Train a language learning model to accurately determine whether fashion brands are sustainable. AI Safety Topics: Moral Decision Ma...
Cudon published a project 1 year ago
A downloadable project.
Benchmark to test the capability of models to reverse given strings