Databricks is simplifying the path from raw event data to real-time applications with its Zerobus Ingest and Lakebase offerings. Traditionally, ingesting high-velocity data from sources like IoT devices, clickstream analytics, and application telemetry required complex, multi-hop architectures involving message queues and separate processing jobs. This approach introduced latency, data duplication, and operational overhead. According to Databricks, Zerobus Ingest and Lakebase aim to streamline this entire process.
Zerobus Ingest, part of Lakeflow Connect, provides APIs for pushing event data directly into the Databricks Lakehouse. It eliminates the need for a separate message bus layer, reducing infrastructure complexity and enabling near real-time ingestion at scale, with latencies reportedly as low as 5 seconds. This allows thousands of clients to write data concurrently.