General: Very simple design with clear outputs. Like the 2x2x2 factorial design. Clearly explained graphs.
Alignment: Tells us that the framing of questions has a large effect on the model’s opinion. Often, leading questions are part of our normal interaction and GPT-3 is clearly biased.
AI Psychology: Showcases a clear cognitive psychology experiment that is newly implemented in GPT-3. A very nice application of the theme of the jam.
Novelty: I have not seen this specific experiment before, though I bet it does not necessarily surprise anyone too much that this is the case.
Generality: The dataset seems to pretty well represent the different cases by way of verb-noun combinations.
Reproducibility: Very clear instructions on the Github as to how to replicate the experiment! I expect it to replicate given the generality.
Comments
General: Very simple design with clear outputs. Like the 2x2x2 factorial design. Clearly explained graphs.
Alignment: Tells us that the framing of questions has a large effect on the model’s opinion. Often, leading questions are part of our normal interaction and GPT-3 is clearly biased.
AI Psychology: Showcases a clear cognitive psychology experiment that is newly implemented in GPT-3. A very nice application of the theme of the jam.
Novelty: I have not seen this specific experiment before, though I bet it does not necessarily surprise anyone too much that this is the case.
Generality: The dataset seems to pretty well represent the different cases by way of verb-noun combinations. Reproducibility: Very clear instructions on the Github as to how to replicate the experiment! I expect it to replicate given the generality.