Skip to main content

Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
TagsGame Engines

0.5.07 will be out soon. I'll let you know what was improved tomorrow, but it should be a substantial difference! 'Strean friendly' will still be broken though.

Thank you. I'll test it tomorrow if I don't forget. :)

(6 edits)

Hi,

I gave the newest version a try, using several different settings, and I made some videos with WGC, since you mentioned that Stream Friendly is still broken.

Here are the test results for the RapidOCR configurations:

  1. PP-OCRv5+Confidence Threshold 60+Box Threshold 0.3+Text Removal Algorithm 1+Deepl free+WGC
  2. PP-OCRv5+Confidence Threshold 60+Box Threshold 0.3+Text Removal Algorithm 2+Deepl free+WGC
  3. PP-OCRv5+Confidence Threshold 10+Box Threshold 0.1+Text Removal Algorithm 1+Deepl free+WGC
  4. PP-OCRv5+Confidence Threshold 10+Box Threshold 0.1+Text Removal Algorithm 2+Deepl free+WGC
  5. en_PP-OCRv3+Confidence Threshold 60+Box Threshold 0.3+Text Removal Algorithm 1+Deepl free+WGC
  6. en_PP-OCRv3+Confidence Threshold 60+Box Threshold 0.3+Text Removal Algorithm 2+Deepl free+WGC
  7. en_PP-OCRv3+Confidence Threshold 10+Box Threshold 0.1+Text Removal Algorithm 1+Deepl free+WGC
  8. en_PP-OCRv3+Confidence Threshold 10+Box Threshold 0.1+Text Removal Algorithm 2+Deepl free+WGC

I also ran a quick test with Tesseract OCR, but it was really bad.

  1. TesseractOCR+Confidence Threshold 50 +Text Removal Algorithm 1+Deepl free+WGC
  2. TesseractOCR +Confidence Threshold 10+Text Removal Algorithm 2+Deepl free+WGC

There are still some bugs, but some of the combinations are already pretty good and enjoyable. As for the translation quality, whenever there's no bug in the bubble, the translation itself is totally fine, aside from a couple of minor DeepL mistakes.

Hey man,

That's awesome! Looks like the en_PP_OCRv3 detection model + all these changes changed things from barely usable to almost flawless. There are still a tiny bit of spacing issues here and there with the new recognition model, and the detection model struggles to capture the lone "I" with this font, but other than that - it looks really really good. The text removal algorithm was improved a lot in 0.5.07. I've only had any issue with it when the font is extremely large (due to the text box detection not being good enough).

I'm very happy with these results. There's not a whole lot more adjusting I can do from this point, in this specific case. Pray for better open source models, maybe..

Thank you so much for your rigorous testing mate, it is good to see the application improving fast with good feedback! ๐Ÿš€๐Ÿš€

I have a few questions about the Stream Friendly mode;

  1. Do you have multiple monitors connected? If so, how many, and which one does GameTranslate run on?
  2. Does the same bug happen in Attached mode?
  3. Could you please go to General - Scroll Down - Click 'Crashdump folder' - Go one folder up to 'GameTranslate' - Click 'Data' folder - And at this point, run the app with Stream Friendly mode, select a very specific area (like one word), and see if the screenshot image in the 'Data' folder lines up perfectly with the area you selected.

Thanks! :)

Yep, that en_PP_OCRv3 detection model was really good.

I'm satisfied with it overall. Thanks for that.

It's working well now, but I'm still praying for some even better open-source models ๐Ÿ˜‚

Thanks ๐Ÿ˜Š

  1. No, there's just one.
  2. Yes.
  3. Yes, I think they line up well for a single word, a whole bubble, and even a full two-page image. I hope I tested that right!

Results:

One word:

A whole bubble:

A full two-page image:

en_PP_OCRv3:

PP_OCRv5:


Haha, yes, we'll cross our fingers for that ๐Ÿคž

Alright, thank you! And sorry, about 3. - I meant the screenshot.png specifically in that folder. Let's say you select a word almost pixel perfect - does the screenshot.png resemble the exact area you selected?

By the way - are you on Windows 11 or 10?

(2 edits)

Oh, okay, I think I get it now. And the answer is no. Even after carefully selecting the word "SAY" with a tight frame, the resulting screenshot is misaligned with the area I selected. I got the exact same result with both of the detection models. (My bad if I'm still not getting it lol.)


I'm using Windows 10.

Okay, nice! That's what I wanted to see.

In this case it doesn't matter which model you use, it's not a model issue, only a screen capture misalignment issue.

However, I am now extremely confused as to why the last screenshot with the box is completely misaligned when the previous screenshots you showed me had pixel perfect detection boxes..? This doesn't make any sense, haha. Just to confirm, the last picture is the individual_boxes_before_merge.png image?

Thanks! This gives me a bit more to go on. I might upload a specific version for you to see if any changes I make work. Probably won't be today though.

(1 edit)

Hi,

I finally managed to understand lol.

The last three images I uploaded were from the very end of my test run. Before those, I did a bunch of other tests and never once saw that blue box. I didn't even notice it in the final images until you pointed it out lol. If I make a really tight selection on "SAY," the blue box never appears. I just re-tested that, and it was gone again.

I also tried making a slightly wider selection, and in that case, the blue box always shows up, although the image is still a bit cropped.


Yes, the individual_boxes_before_merge.png image was the last one.

But then, as I mentioned before, if I use the Windows scaling workaround: "1. No, my Windows scaling is at 100%, but here's something interesting: if I start in Desktop mode with a 100-125-150-175 scaling and I make sure to exit and restart the mode with each change, the problem is still there. It doesn't matter what I start with, but if I start in Desktop mode with 100% and then switch the scaling to something like 125 while it's running, the issue is completely gone. At that point, I can set it to anything, even 175, and it works. I can even switch back to 100 and it's still fine."

I got these results with that method:

Tight selection:







Slightly wider selection:





Could there be some Windows bug on my end?

Oh, okay thank you. 

(1 edit)

Hey pal, hope you had a good weekend!

Ahaha, no worries!

Thank you very much for these screenshots. There is clearly a misalignment issue happening here, and while it may seem like the changing of windows scaling completely fixes the bug, it is only masking the problem better. There's still some cropping on both the left, bottom and right.. Giving me a bit of headache this, haha. I'll have to dive deep into it and see what I can do with this information.

I mean, there COULD be an issue on your end, but I would honestly not have any clue what it could be. Until I know if anyone else has the same issue, it'll be impossible for me to figure out if that's the case.

Oh and about the Custom API/MangaOCR lock bug. Could you try doing that again? I've added telemetry to the app now, so any errors that do not cause crashes can also be sent to my backend. It doesn't collect any personal information at all and can be disabled in General configuration if you don't want that to run.

(1 edit)

Hey, thanks for asking, my weekend was great :)

Just take your time, no need to hurry, it's not urgent for me :)

The bug is fixed now, thanks. 
I did see that, but don't worry, I won't turn it off, I think it's a good idea. I have no reason to doubt you (โŠ™ _ โŠ™ )

Keep up the good work ๐Ÿ’ช