That worked, 6-8 min audio is not garbled, with either Owen3 or OmniVoice (haven't tried the others yet.) Now, all we need is 48k :)
I'd like to see it (but of course not at the cost of more pronunciation errors.) If and when it can be done seamlessly, that would be great. The trade-off of longer processing times, would be acceptable. I suspect that 48khz would fit better with many users' workflow (DAW templates, sharing with collaborators, etc.) Thanks for considering it!