Does the image generation try to create something based off of the current things happening in the chat, or just of the AI character?
It gets the Character and the last Message as Input, but there will be more settings for that later. We know, that it doesn't pick up the current situation much.