XML files as data source for Polybase? RRS feed

  • Question

  • I have 50-100GB XML files streamed to Azure Blob every day. I would like to make query to XML data and store aggregated data to Azure Data Warehouse.

    Is XML files supported in polybase?

    If not what is alternative method? Should XML files flattened to HDInsight Hive and polybase read Hive file?

    Is there Azure services scalable enough to store XML contents and allow Azure DW to access data?


    Wednesday, November 23, 2016 2:55 PM

All replies

  • PolyBase does not support XML as a file format. Your workaround would be ideal.

    PolyBase supports Delimited Text, RC, ORC and Parquet.

    If you are already using hive, I would create a ORC output and write it to blob, then use PolyBase to ingest the data.

     Documentation https://msdn.microsoft.com/en-us/library/dn935026.aspx?f=255&MSPPError=-2147217396
    Monday, December 5, 2016 5:27 PM
  • did you find a solution for this?
    Friday, September 28, 2018 2:44 PM