none
RFP Document Parsing Engine RRS feed

  • General discussion

  • We are trying t develop an application, which parses the uploaded RFP (request for proposal) documents that are in DOCX, DOC or PDF formats. These are huge federal govenement documents, with mutiple sections that are meant for certain areas (C, L,M)... App needs to parse all the requirments in that document, based on the selection criteria, and save them into a RTM (requirments traciability matrix). We looked into OpenXML and found some limitations.

    Is there any Microsft native tool or API or libraray on MS Azure that we can use to build this application? Has anybody developed such solution to handle huge volume of documents? Appreciate any idea to help us go in the right direction.

    Monday, November 5, 2012 6:33 PM

All replies

  • Hi B2CS,

    Thanks for posting in the MSDN Forum.

    I suppose you want to develop a application to export the information from the specific document via specific criteria. Is it right?

    I would suggest use VSTO technic to create a add-in or an automation application to handle your issue, however PDF formatted document will not support to export in MSDN Forum. Please consult it form PDF's official website.

    Have a good day,

    Tom


    Tom Xu [MSFT]
    MSDN Community Support | Feedback to us

    Wednesday, November 7, 2012 1:55 AM
    Moderator
  • Hi B2CS,

    Any update?

    Have a good day,

    Tom


    Tom Xu [MSFT]
    MSDN Community Support | Feedback to us

    Friday, November 9, 2012 7:21 AM
    Moderator