The world of analytics and data warehousing has evolved rapidly in the last 10 years with the Data Lake as the backbone of modern data environments. This video describes what a data lake is and why we build them, shows an example of querying a data lake with Spark SQL in Azure Databricks, and recommends some best practices.
Notebook used for demo can be downloaded at https://github.com/datakickstart/data...
More from Dustin:
Website: dustinvannoy.com
Twitter: @dustinvannoy
Github: https://github.com/datakickstart
Table of Contents:
00:12 - What is a Data Lake
01:29 - Why Data Lakes?
06:01 - Demo: Data Lake Querying
11:01 - Data Lake Best Practices
12:56 - More Content