Welcome to the Data Lakehouse!

The Data Lakehouse is a new paradigm in Big Data processing. It aims to unify Data Lakes and Business Intelligence.

This website is a collection of blogs and videos curated by Advancing Analytics.

As topics emerge and change you will find this website updating to reflect the current ideas from data leaders.

Realizing the Vision of the Data Lakehouse | Ali Ghodsi 

Data warehouses have a long history in decision support and business intelligence applications. But, data warehouses were not well suited to dealing with the unstructured, semi-structured, and streaming data common in modern enterprises. This led to organizations building data lakes of raw data about a decade ago. But, they also lacked important capabilities. The need for a better solution has given rise to the data lakehouse, which implements similar data structures and data management features to those in a data warehouse, directly on the kind of low cost storage used for data lakes.

Azure Synapse Analytics - Lambda Patterns with Synapse Link On-Demand

One of the new features of Synapse Analytics is Synapse Link - the ability to query a live analytics store within CosmosDB with only tiny amounts of setup. We've recently seen it rolled out for the SQL On-Demand endpoint, meaning we can write both Spark and SQL directly over this analytics store!

Data Lakehouse Explained in 5 Minutes | Ajay Singh

Presented by Ajay Singh, VP Field and Partner Engineering at Databricks.