Post by skhanna94 in How long can be a text to speak?

Viewing post in How long can be a text to speak?

That worked, 6-8 min audio is not garbled, with either Owen3 or OmniVoice (haven't tried the others yet.) Now, all we need is 48k :)

Mortar Tribe71 days ago

Awesome! My record is generating an 11 hour long audiobook without major issues.

I did briefly experiment with a higher quality 48khz output for qwen3 but it introduced pronunciation errors so I removed it. Might take a look again if you think it's worth it.

skhanna9471 days ago

I'd like to see it (but of course not at the cost of more pronunciation errors.) If and when it can be done seamlessly, that would be great. The trade-off of longer processing times, would be acceptable. I suspect that 48khz would fit better with many users' workflow (DAW templates, sharing with collaborators, etc.) Thanks for considering it!

Mortar Tribe71 days ago

Good to know, I'll add it to my list

itch.io

Viewing post in How long can be a text to speak?