locked
Hive & Azure Storage - Blob, Table, File ? Which is the best Storage Option and Why for unstructured data RRS feed

  • Question

  • We are considering migrating to Azure.

    We run a Hive\Hadoop on premise instance.

    We process around 2 TB of unstructured data daily and create about 2TB of processed data that is relational.

    We are considering moving to Azure IaaS or PaaS

    When looking at the storage options for Azure Storage, I am trying to make a decision on whether to use 

    Blob, Table or File and the pros\cons of each for Hive processing on Azure storage



    • Edited by Grover Man Thursday, August 10, 2017 3:50 PM
    Thursday, August 10, 2017 3:39 PM

All replies

  • Based on this article I would say Blob storage.  I would think Table would be too limiting as to what you can store and size and file storage is more of a way to replace on-premises file shares.

    https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-use-blob-storage


    Gary A. Bushey

    • Proposed as answer by vikranth s Sunday, August 13, 2017 12:10 PM
    Thursday, August 10, 2017 5:43 PM