locked
Search documents in BLOB storage RRS feed

  • Question

  • Is it possible to crawl BLOB storage and index the contents of documents using SharePoint online?

    Are there other products that can crawl BLOB storage and index the contents of documents?

    Wednesday, January 22, 2014 1:32 PM

Answers

  • hi Davido,

    Thanks for posting!

    Base on my experience, you didn't directly crawl blob storage content .Because blob is stored as the file. You could get the blob list and search files from blob list by blob name.

    Or you could use table storage. When you upload a file, you could save the file information and blob URL as a table entity into the table storage.  And then you could search content from table storage and find the related URL to download blob file.

    I suggest you could see those links:

    http://sachintana.blogspot.in/2012/08/azure-blob-storage-for-sharepoint.html

    and http://www.windowsazure.com/en-us/documentation/articles/storage-dotnet-how-to-use-blobs-20/

    http://www.windowsazure.com/en-us/documentation/articles/storage-dotnet-how-to-use-table-storage-20/

    Please try it.

    Regards,

    Will


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click HERE to participate the survey.

    Thursday, January 23, 2014 7:16 AM
  • hi David,

    >>I can see now how I can store the documents in blob storage and keep a reference to them in SharePoint by putting metadata in a SharePoint List. My next question may belong in a different forum so let me know if you think that is the case. How can I configure SharePoint to index the contents of the documents in the SharePoint List?

    You could refer to this thread (http://www.codeproject.com/Questions/333781/How-to-insert-data-to-sharepoint-lists ) to try to how to store the data in sharepoint list.

    Also this question may be related about SharePoint (http://social.msdn.microsoft.com/Forums/sharepoint/en-US/home?category=sharepoint ), you could post on sharepoint forum for better support.

    >>So if I have a recipes.docx in blob storage I can search for pasta and find that document.

    If your docx file was stored on blob, you didn't find this file by searching pasta. You only get the file name and url on blob. In this scenarios, I guess the sharepoint may be the good choice. Or you could use sql azure, you store all content into sql azure table.

    Regards,

    Will


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click HERE to participate the survey.

    Tuesday, January 28, 2014 8:08 AM
  • As far as I know SharePoint online does not support Remote Blob Stream(RBS), which is helpful feature to store large documents in a remote blob. It's a feature that comes from SQL server, but in SharePoint Online, the content database is not exposed meaning we can't directly access it.

    An alternative would be a custom search solution with SQL Azure + Blob where you can index in SQL according to requirements and store the actual content in Blob.

    Good Luck!

    Wednesday, January 29, 2014 2:54 PM

All replies

  • hi Davido,

    Thanks for posting!

    Base on my experience, you didn't directly crawl blob storage content .Because blob is stored as the file. You could get the blob list and search files from blob list by blob name.

    Or you could use table storage. When you upload a file, you could save the file information and blob URL as a table entity into the table storage.  And then you could search content from table storage and find the related URL to download blob file.

    I suggest you could see those links:

    http://sachintana.blogspot.in/2012/08/azure-blob-storage-for-sharepoint.html

    and http://www.windowsazure.com/en-us/documentation/articles/storage-dotnet-how-to-use-blobs-20/

    http://www.windowsazure.com/en-us/documentation/articles/storage-dotnet-how-to-use-table-storage-20/

    Please try it.

    Regards,

    Will


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click HERE to participate the survey.

    Thursday, January 23, 2014 7:16 AM
  • Hi Will,

    Thanks for the reply. I can see now how I can store the documents in blob storage and keep a reference to them in SharePoint by putting metadata in a SharePoint List. My next question may belong in a different forum so let me know if you think that is the case.

    How can I configure SharePoint to index the contents of the documents in the SharePoint List?

    So if I have a recipes.docx in blob storage I can search for pasta and find that document.

    Regards,

    David

    Thursday, January 23, 2014 12:04 PM
  • hi David,

    >>I can see now how I can store the documents in blob storage and keep a reference to them in SharePoint by putting metadata in a SharePoint List. My next question may belong in a different forum so let me know if you think that is the case. How can I configure SharePoint to index the contents of the documents in the SharePoint List?

    You could refer to this thread (http://www.codeproject.com/Questions/333781/How-to-insert-data-to-sharepoint-lists ) to try to how to store the data in sharepoint list.

    Also this question may be related about SharePoint (http://social.msdn.microsoft.com/Forums/sharepoint/en-US/home?category=sharepoint ), you could post on sharepoint forum for better support.

    >>So if I have a recipes.docx in blob storage I can search for pasta and find that document.

    If your docx file was stored on blob, you didn't find this file by searching pasta. You only get the file name and url on blob. In this scenarios, I guess the sharepoint may be the good choice. Or you could use sql azure, you store all content into sql azure table.

    Regards,

    Will


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click HERE to participate the survey.

    Tuesday, January 28, 2014 8:08 AM
  • As far as I know SharePoint online does not support Remote Blob Stream(RBS), which is helpful feature to store large documents in a remote blob. It's a feature that comes from SQL server, but in SharePoint Online, the content database is not exposed meaning we can't directly access it.

    An alternative would be a custom search solution with SQL Azure + Blob where you can index in SQL according to requirements and store the actual content in Blob.

    Good Luck!

    Wednesday, January 29, 2014 2:54 PM