none
DataFrames in Azure Data Lake Storage

    Question

  • 1. I am trying to learn how to read ADLS. But in this link https://docs.azuredatabricks.net/spark/latest/dataframes-datasets/introduction-to-dataframes-python.html, various ways of how to use DataFrame is given. But basic definition of what is DataFrame is not given. Can somebody suggest some link.

    Thanks.

    Tuesday, January 8, 2019 1:27 PM

Answers

  • A DataFrame is a distributed collection of data organized into named columns. It is conceptually equivalent to a table in a relational database or a data frame in R/Python.

    DataFrame API provides a rich set of functions (select columns, filter, join, aggregate, and so on) that allow you to solve common data analysis problems efficiently.

    Check the documentation What is Azure Databricks? For more details.

    Wednesday, January 9, 2019 5:23 AM
    Moderator

All replies

  • A DataFrame is a distributed collection of data organized into named columns. It is conceptually equivalent to a table in a relational database or a data frame in R/Python.

    DataFrame API provides a rich set of functions (select columns, filter, join, aggregate, and so on) that allow you to solve common data analysis problems efficiently.

    Check the documentation What is Azure Databricks? For more details.

    Wednesday, January 9, 2019 5:23 AM
    Moderator
  • The explanation given is good. But this explanation I dont find in the link you shared.
    Sunday, January 13, 2019 7:15 PM