Hello, how were you able to increase the speed of responsiveness and response generation from AI?
By streaming the AI response instead of waiting for the AI to type out everything
Can you write a small tutorial? How to do this with local AI? I will be very grateful to you.