locked
Pdf Reader in Vb.net RRS feed

  • Question

  • How to read the pdf file in vb.net and convert to word or any other format
    Tuesday, October 20, 2015 8:54 AM

Answers

  • Hi Vinay,

    iTextPdf looks like a good starting point, open source and c# so any examples should be portable to vb.net The c# port http://sourceforge.net/projects/itextsharp/files/

    There is third-party things out there; Thanks for your understanding. The site http://itextpdf.com/ 

    Alternativly take a look at this article for a number of .net alternatives  http://www.codeproject.com/KB/string/pdf2text.aspx

    Best regards,

    Kristin


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click HERE to participate the survey.

    Wednesday, October 21, 2015 1:43 AM
  • A 3rd party library is necessary. While iTextSharp is not able to give layout, formatting information from the pdf file when converting PDF to Word. It just extracts content and provides some other functionality. So I recommend to test .NET PDF library which provides easy method to process PDF in C#.

    read PDF file:

    PdfDocument doc = new PdfDocument();
    doc.LoadFromFile("sample.pdf");

    choose desired file format from FileFormat enum and save to another format:

    doc.SaveToFile("PDFtoDoc.doc", FileFormat.DOC);



    Wednesday, October 21, 2015 2:38 AM

All replies

  • Hi Vinay,

    iTextPdf looks like a good starting point, open source and c# so any examples should be portable to vb.net The c# port http://sourceforge.net/projects/itextsharp/files/

    There is third-party things out there; Thanks for your understanding. The site http://itextpdf.com/ 

    Alternativly take a look at this article for a number of .net alternatives  http://www.codeproject.com/KB/string/pdf2text.aspx

    Best regards,

    Kristin


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click HERE to participate the survey.

    Wednesday, October 21, 2015 1:43 AM
  • A 3rd party library is necessary. While iTextSharp is not able to give layout, formatting information from the pdf file when converting PDF to Word. It just extracts content and provides some other functionality. So I recommend to test .NET PDF library which provides easy method to process PDF in C#.

    read PDF file:

    PdfDocument doc = new PdfDocument();
    doc.LoadFromFile("sample.pdf");

    choose desired file format from FileFormat enum and save to another format:

    doc.SaveToFile("PDFtoDoc.doc", FileFormat.DOC);



    Wednesday, October 21, 2015 2:38 AM
  • Here is another alternative that you may want to test out:

    // Load PDF file.
    var document = DocumentModel.Load("Sample.pdf", LoadOptions.PdfDefault);
    // Save DOCX file.
    document.Save("Sample.docx", SaveOptions.DocxDefault);

    It is a word processing library for C# that can read PDF files in .NET and write them as some other format. 

    Wednesday, March 30, 2016 1:45 PM