locked
SQL Server 2019 BDC storage pool and data pool RRS feed

  • Question

  • What's the actual difference of data pool and storage pool in BDC arch .?

    Data pool

    The data pool is used for data persistence and caching. The data pool consists of one or more pods running SQL Server on Linux. It is used to ingest data from SQL queries or Spark jobs. SQL Server big data cluster data marts are persisted in the data pool.

    Storage pool

    The storage pool consists of storage pool pods comprised of SQL Server on Linux, Spark, and HDFS. All the storage nodes in a SQL Server big data cluster are members of an HDFS cluster.

    https://docs.microsoft.com/en-us/sql/big-data-cluster/big-data-cluster-overview?view=sql-server-ver15

    Thursday, July 9, 2020 2:32 AM

Answers

  • The actual difference is they have difference architecture.

    Data pool consists of one or more SQL Server data pool instances.
    Storage pool consists of storage nodes comprised of SQL Server on Linux, Spark, and HDFS.


    • Marked as answer by sakurai_db Sunday, July 12, 2020 1:25 AM
    Friday, July 10, 2020 3:30 AM
  • I think you are correct.

    Data Pool~ database;

    Storage Pool~Server;

    • Marked as answer by sakurai_db Sunday, July 12, 2020 1:25 AM
    Friday, July 10, 2020 7:36 AM

All replies

  • The actual difference is they have difference architecture.

    Data pool consists of one or more SQL Server data pool instances.
    Storage pool consists of storage nodes comprised of SQL Server on Linux, Spark, and HDFS.


    • Marked as answer by sakurai_db Sunday, July 12, 2020 1:25 AM
    Friday, July 10, 2020 3:30 AM
  • May I summerize like in the following :

    Data Pool ~similar to multiple SQL instances responsible to mount database (mdf,ldf ) to store persistent "Structured data" (tables).

    Storage pool  ~ similar to storage of semi structure data, unstructure data , and master instance use Polybase and through the SQL Server in these storage pool to get data 

    Is my understanding above correct ? 

    Friday, July 10, 2020 4:03 AM
  • I think you are correct.

    Data Pool~ database;

    Storage Pool~Server;

    • Marked as answer by sakurai_db Sunday, July 12, 2020 1:25 AM
    Friday, July 10, 2020 7:36 AM
  • So I mean either accessing data pool / or storage pool data , still through the following path ?

    Master instance -> Polybase -> Compute Node -> data pool / Storage pool ?

    Sunday, July 12, 2020 1:26 AM