Thank you! I actually spent quite a bit of time trying to implement Qwen3-TTS (which is genuinely impressive), but it requires a high-end NVIDIA GPU to achieve acceptable latency for real-time usage — on CPU, it takes around 30 seconds to generate just 3 seconds of audio. I'll hold off on adding it until a DirectML streaming solution becomes available.