Skip to main content

Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
TagsGame Engines
(6 edits)

3.5x speed upgrade with Nvidia .dll replacement working for InvokeAI  Standalone 3.2.0 and a 2x speed increase for standalone 2.3.2 and the regular install of 2.3.4. 

I've replaced the  CUDA  files for invokeAI 2.3.0 and it makes it 3.5x faster with my Nvidia 4090. I'm running InvokeAI 2.3.0 at 28 it/s just by replacing the CUDA .dll files alone. However, I cant seem to get the latest to be as fast by replacing the .dll files but at least it's still 2x faster on InvokeAI 2.3.2 and also the normal install of the latest 2.3.4 with .dll replacement. Not sure why I cant get it going as fast as 2.3.0 at 3.5x the speed. For in-painting I'm still using Invoke 2.3.0 with the upgraded .dll's and loving the 3.5x speed. Any idea why the versions after 2.3.0 are not performing as fast as InvokeAI 2.3.0 with the latest CUDA files? anyone know how to get more speed out of  Invoke 2.3.2-2.3.4?

I Downloaded the Nvidia zip file here and used the .dll files found in the bin folder:

https://developer.download.nvidia.com/compute/redist/cudnn/v8.7.0/local_installers/11.8/

Then I replace the old .dll files with the new ones into the following: InvokeAI\venv\Lib\site-packages\torch\lib

I'm testing this for sure. Where did you find version 2.3.0 

The improvement i'm seeing is by 1.5 seconds. Sometimes slower depending on the CUDA dlls version. 

I figured it out. The improvement you are able to see is because nvidia fixed the driver compatibility issue with CUDA 11.8 on the 4k series of cards. Everyone using a previous generation is already running at maximum performance.