I'm not sure I understand the situation being simulated here. One of the submissions is explanatory text from the student about how they used AI. Why would we "flag" that for malconduct, even if it's AI-generated? Or is it supposed to be from a student essay, despite the content?
Also, the "student profile" includes information such as "they ran the essay through an AI tool to make it sound more academic." How can we already know that? And if we already know the diction and grammar are from an LLM, what are we trying to decide? Whether the content is original to the student or not?