Databricks is rolling out public preview support for Apache Iceberg v3, signaling a significant push towards unifying data management in the open lakehouse architecture. This update integrates key Iceberg v3 features directly into the Databricks platform, promising enhanced performance and interoperability.
Iceberg v3 introduces Row Lineage and Deletion Vectors, enabling more efficient incremental data processing. These features allow for tracking row changes and applying deletions without immediate file rewrites, potentially speeding up data manipulation by up to 10x compared to traditional copy-on-write methods.