Searching arabic pdf document in sharepoint 2007 result content is not in readable format (Out of Box search) RRS feed

  • Question

  • Hi Team,

    We are having sharepoint 2007 Enterprise environment (english) with sharepoint 2007 arabic language pack installed. we have installed TET PDF Ifilter 4.1p10- windows to enable pdf search in sharepoint 2007 we are using Out of box search.

    We have uploaded English pdf documents and Arabic pdf documents after doing the crawling in sharepoint if we search for a English pdf document the results are correct for English(its showing the documents name with some content below it, it is perfect), If we search for a arabic pdf document, the search result shows the arabic pdf documents but the arabic content is not in the readable format, we are getting some unicode characters in the search result.

    To fix the issue we running the Arabic PDF file through Arabic OCR softwares (Like Shakr, Readiris Abbyy etc.,) and uploading the OCRed PDf files in to sharepoint portal but still we are getting the same result (Content in the search result is not readable).

    Can we know how to fix the above issue, do we need to do any additional settings for search configuration.


    • Edited by Jeevanmjr Wednesday, December 26, 2012 12:23 PM
    Wednesday, December 26, 2012 12:09 PM