locked
No data slices in this table RRS feed

  • Question

  • Dear all

    I am try to create predictive pipelines using Azure Data Factory and Azure Machine Learning according to this article:

    https://azure.microsoft.com/en-us/documentation/articles/data-factory-create-predictive-pipelines/

    The input table is a static csv file with header and I already set the external and externalData settings. 

    But in Azure Data Factory, it still shows "No data slices in this table"

    Why "No data slices in this table"?

    Thanks,

    Qi

    Sunday, September 6, 2015 2:52 PM

Answers

  • Hi Qi,

    The combination of the input table and output table availability, along with the start and end times for the pipeline are what really dictate the data.

    So if your input is static but you say it is available every hour then it'll be processed every hour, ADF makes no assumption on whether the data has changed.

    It's a bit of an arcane concept, I found the following to be a useful article on this.

    https://azure.microsoft.com/en-us/documentation/articles/data-factory-scheduling-and-execution/

    It is through this sort of thing that if you set the pipeline to have dates in the past it'll pre-fill data slices.

    If you're able to share some of the pipeline and input/output config that might be helpful to help diagnose your issue.

    Thanks,

    Martin

    Friday, September 11, 2015 12:56 AM

All replies

  • Hi Qi,

    Take a look at the last point in the following forum post.

    https://social.msdn.microsoft.com/Forums/windowsserver/en-US/66405ec9-a43b-45e2-b19d-2409d04305b5/no-data-slices-in-this-table?forum=AzureDataFactory

    So basically if you've set up the external flags correctly, is the active period for the pipeline configured correctly?

    Thanks,

    Martin

    Monday, September 7, 2015 12:31 AM
  • Hi, Martin

    Thanks so much for your reply. I have checked the active period for the pipeline and it configured correctly.

    I have two csv files in my Azure storage. One is modified by Stream Analytics every 20 seconds. This csv file works well as input dataset in Data Factory. 

    Another csv file is a static csv file which I want to test Azure Machine Learning web services. This csv file cannot be sliced correctly. 

    Is there any requirement for input dataset? such as periodically update?

    Thanks,

    Qi

    Wednesday, September 9, 2015 7:08 AM
  • Hi Qi,

    The combination of the input table and output table availability, along with the start and end times for the pipeline are what really dictate the data.

    So if your input is static but you say it is available every hour then it'll be processed every hour, ADF makes no assumption on whether the data has changed.

    It's a bit of an arcane concept, I found the following to be a useful article on this.

    https://azure.microsoft.com/en-us/documentation/articles/data-factory-scheduling-and-execution/

    It is through this sort of thing that if you set the pipeline to have dates in the past it'll pre-fill data slices.

    If you're able to share some of the pipeline and input/output config that might be helpful to help diagnose your issue.

    Thanks,

    Martin

    Friday, September 11, 2015 12:56 AM