Skip to main content

Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
TagsGame Engines

Question about performance before I buy this. I have a 5070ti, 16GB vram, (also a 7950x and 96GB dram), should be plenty to run the normal 8GB model. But in the demo at least it takes a little under a minute to locally gen per conversation reply, and that's if nothing else happens like moving around or giving items. If it's trying to do anything like that then it's much longer. Is that normal you think, or might it be because I'm running it on Linux and it's losing performance somewhere? I could get used to it, but it's not the most fun thing to wait around.

(+1)

For Nemo, it should take no more than 2-3 seconds on a 5070 ti, but I actually worked on porting the game over to Linux today (which also uses a linux native backend for the LLM), so try the native build.

Thanks for working on a native version! Unfortunately, it fails on loading the model. Maybe it's trying to load koboldcpp.exe rather than koboldcpp-linux? Or it could be that you're using windows-based variables for the command line. I can't see any error so I checked diagnostic.bat but that doesn't exactly work. 

Instead, I edited diagnostic.bat to turn it into a linux shell file, the contents of which I've pasted below. Linux shell scripts don't have anything like a goto, really, so I just put the questions into while true loops. This does load koboldcpp so I think it's related to the command line arguments you're using.


#! /bin/bash

echo "Welcome to the diagnostic tool. This tool will launch Nemo via koboldcpp-linux with the same arguments as the game, while letting you view the error message in case of a crash."

#GPU_TYPE

while true

do

    read -p "Are you using an NVIDIA or AMD GPU? (Type \"NVIDIA\" or \"AMD\"): " GPU_TYPE

    if [ $GPU_TYPE == "NVIDIA" -o $GPU_TYPE == "AMD" ]

    then

    break

    else

    echo "Invalid input. Please type \"NVIDIA\" or \"AMD\"."

    fi

done

#VRAM_QUESTION

while true

do

    read -p "Do you have 6, 8, or more than 10 GB of VRAM? (Type \"6\", \"8\", or \"10+\"): " VRAM

    if [ $VRAM == "6" ]

    then

    GPULAYERS=17

    break

    elif [ $VRAM == "8" ]

    then

    GPULAYERS=27

    break

    elif [ $VRAM == "10+" ]

    then

    GPULAYERS=43

    break

    else

    echo "Invalid input. Please type \"6\", \"8\", or \"10+\"."

    fi

done

#LAUNCH_KOBOLDCPP

echo "Launching KoboldCPP with the specified settings..."

if [ $GPU_TYPE == "NVIDIA" ]

then

GPU_ARG="--usecublas"

elif [ $GPU_TYPE == "AMD" ]

then

GPU_ARG="--usevulkan"

fi

./koboldcpp-linux --model "Nemo.gguf" $GPU_ARG --gpulayers $GPULAYERS --quiet --multiuser 100 --contextsize 4096 --skiplauncher

(1 edit)

Well it definitely works on Linux Mint 22.2, but the game uses Linux's popen() function instead of the system() function (which i believe the terminal uses) to spawn the process, so I'll add that as a fallback tomorrow. What distro are you using?

Diagnostic.bat is included by mistake.

I'm on arch so it's likely I don't have a lot of common things installed lol

After trying this on Arch, it turned out to be a simple problem caused by Arch being more strict with file permissions.

It can be solved via chmod +x koboldcpp-linux, but I've also uploaded a new version where this happens automatically.

works great, thank you! You got a customer :)