I have developed an application using MODI which converts OCR scanned documents into text document( text only). I want to get the images and tables from OCR scanned document into text document with text. How to do it? I am struggling with it please help me ASAP.
- Moved by Cindy Meister MVPMVP, Moderator Thursday, July 28, 2011 9:19 AM Not Word-related (From:Word for Developers)
Thank you for posting.
Microsoft Office Document Imaging is a component of Office application. As far as I know, it is first introduced in Microsoft Office XP and is included in Office 2007. However, it is removed in Office 2010. For more details, please refer to this article:
If you use Office 2003,you can add the reference Microsoft Office Document Imaging 11.0 type Library to your project and please refer to the sample code in that article:
As for Office 2010, please refer to this KB article:
which introduces how to install MODI and other alternative methods.
Hope the information can help you and feel free to follow up after you have tried.
Bruce Song [MSFT]
MSDN Community Support | Feedback to us
Get or Request Code Sample from Microsoft
Please remember to mark the replies as answers if they help and unmark them if they provide no help.
Thanks for your reply.
I do not have any issue with MODI Library. It is working fine. I am using Office 2007 Licensed version. My issue is I need to extract or read table, Images with text from scanned document and put into to word document file with text.
I have already done to extract text from scanned document and store the extracted text into text document.
Hope you will provide me a sample in .net.