Is the text to speech using Whisper?
Edit: NM, it is. So I've tested this and while I think it has great potential, there's an issue where it will not transcribe an entire sentence in Japanese on one line. You end up getting each character from the sentence on separate lines. If this could be fixed, I think this would be a very useful app.