Data Virtualization in SQL Server 2022

ABSTRACT

PolyBase is Microsoft's primary data virtualization technology, allowing you to integrate SQL Server with a variety of external systems such as Oracle, MongoDB, Cosmos DB, Teradata, Apache Spark, and even other SQL Server instances. In this talk, we will walk through the mechanics of PolyBase, showing how we can write T-SQL queries to access data from heterogeneous sources. We will see how data virtualization in SQL Server has evolved over the years and the new capabilities you will find in SQL Server 2022. Finally, we will look at several interesting use cases for data virtualization in the modern data environment.

ADDITIONAL MEDIA

On August 30, 2022, I gave this talk to the DBA Fundamentals group. You can get the recording on the DBA Fundamentals group's YouTube channel.

On August 13, 2019, I delivered the prior version of this talk at the Triangle SQL Server User Group's Advanced DBA special interest group. You can get the recording on my YouTube channel.