Post by Naitariss in General Discussion

Viewing post in General Discussion

Hello, how were you able to increase the speed of responsiveness and response generation from AI?

By streaming the AI response instead of waiting for the AI to type out everything

Can you write a small tutorial? How to do this with local AI? I will be very grateful to you.