Databricks Lakebase - The new era of transactional databases in the Lakehouse
- Juan Diaz
- Aug 30, 2025
- 02 Mins read
- Databricks
A fully managed PostgreSQL database, designed to unify analytics and operations for intelligent applications.
Databricks has launched Lakebase, a product that marks a turning point in handling Online Transaction Processing (OLTP) databases. By integrating a fully managed PostgreSQL database engine within its platform, Lakebase addresses the challenges of fragmented data architectures, offering a solution that combines the familiarity of SQL with the scalability and efficiency of a lakehouse.
What is Lakebase?
At its core, Lakebase is a PostgreSQL instance that functions as a service within the Databricks Data Intelligence Platform. Unlike traditional OLTP databases, its decoupled architecture separates storage from compute, allowing for unprecedented flexibility and scalability. This new product is built on the open-source PostgreSQL standard, ensuring that users can leverage its robust ecosystem of tools and libraries without worrying about vendor lock-in.
Key Features and Advantages
-
Decoupled Architecture: It uses a data lake or object store as the primary storage medium. To ensure low latency for OLTP workloads, it implements an intermediate storage layer that acts as a write-through cache. Data is stored in standard PostgreSQL page formats, maintaining compatibility and openness.
-
Serverless and Scalable: Lakebase is available in an autoscaling version that allows databases to start in less than a second and dynamically adjust their resources based on the workload. This means you only pay for the compute time you actually use, optimizing costs.
-
Branching Capability: Thanks to its copy-on-write architecture, users can instantly create a branch of their database, including data and schema. This functionality is ideal for modern development practices and for engineers who need to test multiple experiments with AI agents without affecting the main database.
-
Native Integration with the Lakehouse: Lakebase is designed to work in harmony with the Databricks lakehouse. It can publish tables for real-time analysis and, in turn, consume historical data from the lakehouse through Unity Catalog. This creates a seamless bridge between transactional and analytical workloads, enabling a more efficient and unified data flow.
-
Enterprise-Ready: As part of the Databricks infrastructure, Lakebase comes with enterprise-grade security, compliance, and governance features, ensuring that transactional data is protected and governed consistently with the rest of the platform.
Conclusion
Lakebase is not just another database; it’s a fundamental pillar in Databricks’ vision of a unified data platform. By bringing the familiarity and power of PostgreSQL to the lakehouse environment.
Databricks offers a solution that not only simplifies data architecture but also opens up new possibilities for application development and AI innovation, all on an open and scalable foundation.
Resources
info
The new Lakebase, powered by Neon technology, brings operational data to the lakehouse (storing data in low-cost lakes) with continuous automatic compute scaling to support agent workloads and unifies operational and analytical data.