2020 @SQLSatLA presents: Data Lakes with Azure Databricks by Dustin Vannoy | @Blackline Room

3 years ago
37

Data Lakes with Azure Databricks by Dustin Vannoy (@dustinvannoy)
The world of analytics and data warehousing has evolved rapidly in the last 10 years with the Data Lake as the backbone of modern data environments. Data Lakes are best built leveraging unique services of the cloud provider to reduce operations complexity. This session will explain why everyone's talking about data lakes, break down the best services in Azure to build a Data Lake, and walk through code for querying and loading with Azure Databricks.

Attendees will leave the session with a firm grasp of why we build data lakes and how Azure Databricks fits in for ETL and querying.

Agenda:
- Why Data Lakes?
- Data Lake best practices
- Reference Architecture implemented with:
○ Azure Databricks
○ Azure Data Lake Storage (Gen 2)
○ Event Hubs for Apache Kafka

Loading comments...