A Pixel Matching Problem

Roedy Green · Nov 19, 2005

this problem is like a miniature OCR problem.

Let's say you have some images that contain text, or screen snapshots
obtained by the Robot class. How can you extract the text from them?

You have the advantage that you probably have access to the fonts used
to create the text.

How might you go about creating an OCR for images? I was thinking of
writing this up as a student project.

The advantage you have over true OCR is all matches will be exact.

The complications are a variety of colours for foreground/background,
antialiasing and painting over muticoloured backgrounds. For JPGs you
may not have the fonts used. The edges of the text may be tweaked in
various ways, e.g. 3D, blurred, warped.

I thought you might proceed like this:

1. find rectangular regions containing only two colours.

2. draw an e in 10 font sizes for each font as your search templates.

3. look for a hole the right shape. When you find one, check all the
e's you have with that size/shape hole to see if you have a match on
the entire letter, check the whole rectangle.

4. if you have a match, calculate the baseline and starting point. now
draw a template ea in that same font and compare. If it fails try eb
etc. Work your way both left and right pulling that line.

You might construct a hashMap indexed by a digest of the glyph so you
can more rapidly check for matches. Your digest algorithm might trim
the glyph top/bottom/left/right so you don't need the stringWidth
information by actually drawing the character pair.

5. You carve the rectangle out of the bigger one, and break the
remaining into rectangles.

6. repeat until there are no more rectangles.

7. export the text each labelled with x.y where it was found.

8. In another program allow the user to highlight text, e.g. a column
or box to determine the linear order of the text desired.

Potential uses for such software include:

1. capturing filenames, error messages, crash locations that were
displayed in a non-cut/pasteable way.

2. by people trying to defeat my email munger. See
http://mindprod.com/applets/masker.html

3. by blind people extracting textual information from images.

4. To allow you to copy from any Swing Component.

5. to extract information from a screen snapshot without having to
retype it.

I Need Help with making a function that draws in a canvas using location data.	1	Dec 17, 2021
[QUIZ] Console-based Pixel Editor (#231)	3	Apr 16, 2010
Problem with a login script, SESSION user rights and put this together so it works with the other pages and MySQL. Code examples.	2	May 5, 2023
Idk need help in editing this source code	0	Nov 5, 2022
Help with Visual Lightbox: Scripts	2	May 3, 2023
Need assistance finetuning HTML, CSS, Javascript - sticky header issue	3	Feb 25, 2022
How Do I Set text on an Image and use the image as a border?	7	Mar 16, 2023
Matching Items in a vector	3	Aug 1, 2006

A Pixel Matching Problem

Roedy Green

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads