Yeah, okay. My NVIDIA 5080 is currently using all 16 GB of its vram when I try to host an LLM locally. Not sure what I'm doing wrong, but it's painfully slow and borderline unplayable because dialogues usually take 5+ minutes to generate per response
LtFuzz
18
Posts
2
Following
A member registered Jul 31, 2020
Recent community posts
(WIP) Hailey's Treasure Adventure comments · Replied to Crimson Bird in (WIP) Hailey's Treasure Adventure comments