Posted May 24, 2025 by Godnoken
G'day fellas!
0.4.2 will be a massive improvement for text capture in many scenarios, returning more accurate text, more text overall and better algorithms to sort out paragraph splitting and such.
Some results of these changes are displayed below;
Paragraph splitting first! These changes are extremely important to accurately translate. As you can see, the text was often split up even when no new paragraph or sentence started. This makes the text incomprehensible for the translators.
Next up, improvements to padding have been made. This has in some cases massively increased the amount of letters returning at the start & end of text.
And lastly, HUGE speed improvement for the RapidOCR captures! Text captures are nearly x3 (!) as fast.
Before (1.54 seconds~)
After (580 milliseconds~)
Before (1.4 seconds~)
After (495 milliseconds~)