Pardon me if this has already been addressed elsewhere.
I'm using Release 1.6.6b with Gemma Sparse.
Is there a way to reduce the context size to time out requests less often?
I'd like to go from 4096 to 3072.
Thanks again for this great game and for all your maintenance of it, including the community involvement.