I started using OpenRouter and everything was fine with me, the only question is whether this OpenRouter is paid or not, otherwise there are some credits going during use.
Check your logs on OpenRouter, it'll tell you how much you were charged per request. IIRC there are free models, but I'm not sure if OpenRouter charges you to use those models through them.
Thank you, I just started using this model, and it doesn't seem to be very expensive so far, for half a day of playing only $ 0.25. Yes, sometimes it freezes, but it's playable.
What's the response time like? It usually takes 30-60 seconds for messages to generate with my specs, if its faster than the time I'm getting then I'll definetly start using it.
It shouldn't take more than a few seconds with your specs. Which model are you using? Is your GPU AMD or Nvidia? Are you using the new version 1.6.6b with optimized memory usage?
I imagine Gemma-4-Sparse should be very fast on any Vulkan GPU. If you're using Qwen 3.5 with a version before 1.6.6b, it might be slow because of that.