European e-commerce giant Zalando has built a unified data foundation on the Databricks Platform, tackling the perennial problem of inconsistent metric definitions and enabling AI-driven analytics.
The company grappled with a complex data ecosystem, fueled by a microservices architecture that generated terabytes of event data. This scale presented significant governance challenges and blurred the lines between transactional and analytical data.
Democratizing Data Governance
Zalando shifted from a resource-centric access model to an identity-based governance approach using Databricks Unity Catalog. This allows for reusable policies tied to people and groups, simplifying access management and auditing.
They implemented a dual-catalog pattern to separate data creation from consumption. Private Catalogs offer domain teams autonomy for development, while a Central Shared Catalog enforces strict governance via Dynamic Views for company-wide data sharing.