Oh, okay, I think I get it now. And the answer is no. Even after carefully selecting the word "SAY" with a tight frame, the resulting screenshot is misaligned with the area I selected. I got the exact same result with both of the detection models. (My bad if I'm still not getting it lol.)



I'm using Windows 10.











