Hi, me again. So I've bee running the Github versions and in 0.0.228 I'm honestly not seeing any changes between runing default ollama, and ollama with the following environmental parameters:
OLLAMA_FLASH_ATTENTION=1,OLLAMA_KV_CACHE_TYPE=q8_0,OLLAMA_NO_KV_OFFLOAD=1
or
OLLAMA_FLASH_ATTENTION=1,OLLAMA_KV_CACHE_TYPE=q4_0,OLLAMA_NO_KV_OFFLOAD=1
that normal behavior or just ollama being it's usual shitty self...
Also, SD WebUI forge is kinda dead. WebUI is still in active development as is ComfyUI but the original forge is no longer maintained. I think there's a fork but yeah... for all the improvements it brought, the guy that made it moved on. I think he's the guy that made controllnet and is now working on video.