How to load array<string> data type from parquet file to Azure Data Warehouse RRS feed

  • Question

  • Hi Team,

    I am working with parquet files stored on Amazon S3. These files need to extracted and the data needs to be loaded into Azure Data Warehouse. My plan

    Amazon S3 -> Use SAP BODS to move parquet files to Azure Blob -> Create External tables on those parquet files -> Staging -> Fact/ Dim tables

    Now the problem is in one of the parquet files there is a column that is stored as array<string>. I am able to create external table on it using varchar for that column but if I perform any sql query operation (i.e. Select) on that external table then it throws below error.

    Msg 106000, Level 16, State 1, Line 3

    HdfsBridge::recordReaderFillBuffer - Unexpected error encountered filling record reader buffer: ClassCastException: optional group status (LIST) {

      repeated group bag {

        optional binary array_element (UTF8);


    } is not primitive

    I have tried different data types but unable to run select query on that external table.

    Please let me know if there are any other options.



    Friday, July 19, 2019 2:14 AM


All replies