Databricks has launched Zerobus Ingest, a serverless streaming API designed to handle petabyte-scale data pipelines without requiring manual infrastructure setup. This new service promises to ingest massive volumes of time-series data from sources like IoT sensors and autonomous vehicles directly into Delta tables, governed by Unity Catalog.
The system bypasses the need for traditional message queues like Kafka, offering a push-based API that accepts data from any producer and writes it to the lakehouse. According to the Databricks blog post, Zerobus Ingest demonstrated the ability to ingest one petabyte of data in under 24 hours, maintaining a stable throughput of 12 GB/s to a single table during benchmarks.