none
MODI Tiff Extensions Format RRS feed

  • Question

  • I'm using MODI to OCR Tiff Images. On a different machine (without MODI installed) I need to be able to retrieve the positions of the OCR'ed words (this is so I can highlight words when they have been found by a search process).

    I understand Microsoft has registered several private TIFF tags and I believe I need to read tag 37679 to get the text and 37681 to get the positions. I've found a brief explanation of how tag 37681 is formatted here:

    http://code.msdn.microsoft.com/office/Office-Document-TIFF-5487f6ce#content

    The problem is this is very sparse (and the sample code hard to follow). I also downloaded MSOfficeTIFFFormatGuidance - this again says very little about tag 37681.

    Can anyone give me any more information on how the data in Tag 37681 is formatted. Thanks.


    Cheers, Mike

    Monday, September 23, 2013 11:10 PM