User815917640 posted
Yes we did.
We used open source OCR library: http://code.google.com/p/tesseract-ocr/
It is very good at redaction as compared to paid ones that I tried. Here's some code with logic as comments
1 Dim OcrOutput As List(Of tessnet2.Word) = Ocr.doOCR(Image1, Rectangle.Empty)
2 'Read words
3 For Each Word As tessnet2.Word In OcrOutput
4 'Read each character in words - so we can look for pattern
5 For Each WordChar As tessnet2.Character In Word.CharList
6 Next
7
8 'Then run regular expression on the words or character combinations to extract out SSN or anthing
9 'And get x,y,height,width for word/char to create rectangles that will be drawn on the image
10 Next
11
Hope this helps