Databricks is rolling out Query Tags, a new feature designed to bring much-needed clarity to the often opaque world of data pipeline costs. This enhancement, detailed in an announcement from Databricks, promises granular usage attribution for dbt pipelines, allowing teams to track precisely where compute resources are being consumed.
Related startups
For too long, understanding the cost implications of complex dbt projects has been a significant challenge. When a warehouse bill doubles, pinpointing the exact models or teams responsible can feel like searching for a needle in a haystack, especially when query histories show little more than generic labels like 'Databricks Dbt.' Query Tags aim to solve this by automatically injecting metadata and enabling custom tagging for every query generated by a dbt pipeline.
Automated Insights, Custom Control
The integration with the dbt on Databricks adapter (version 1.11 and above) offers multiple layers of tagging. Databricks automatically injects tags like the dbt model name, materialization strategy, and adapter versions. This out-of-the-box visibility requires zero configuration.