Double checked, it still makes calls to unregistered port that does not have a backend to check for heartbeat of inference server, and still times out - do you not use streaming responce?
And there is a lot of errors...
Okay i'll look at it sometime this week. Thanks for the heads up! Btw most of those errors are benign (mostly debug calls accidentally left in)