Uber's Data Abstraction Layer

Uber's Data Abstraction Layer (DAL) simplifies data access, drastically cutting report generation time and enabling more sophisticated advertiser tools.

7 min read
Abstract representation of data flow and connections within a network.
Conceptual visualization of data abstraction and integration.· Uber Engineering

Navigating the labyrinth of evolving data models and product needs is a perennial challenge for engineering teams. Uber faced this head-on, developing a Data Abstraction Layer (DAL) to streamline how its myriad applications access and utilize data. This Uber Engineering initiative aims to abstract away the complexities of underlying datasets, much like programming abstractions hide implementation details.

Visual TL;DR. Data Complexity Challenge leads to Uber's DAL. Uber's DAL enables Independent Evolution. Uber's DAL supports Simplified Reporting. Uber's DAL results in Faster Report Generation. Uber's DAL enables Sophisticated Tools. Uber's DAL achieves Reduced Engineering Burden.

  1. Data Complexity Challenge: disparate table versions and changing schemas make updates slow
  2. Uber's DAL: unified interface abstracts away underlying dataset complexities
  3. Independent Evolution: data producers evolve models independently from consumers
  4. Simplified Reporting: advertiser reporting needs near real-time and historical performance data
  5. Faster Report Generation: drastically cuts report generation time for advertisers
  6. Sophisticated Tools: enables more sophisticated advertiser tools and insights
  7. Reduced Engineering Burden: streamlines how applications access and utilize data
Visual TL;DR
Visual TL;DR — startuphub.ai Data Complexity Challenge leads to Uber's DAL. Uber's DAL results in Faster Report Generation. Uber's DAL enables Sophisticated Tools. Uber's DAL achieves Reduced Engineering Burden leads to results in enables achieves Data Complexity Challenge Uber's DAL Faster Report Generation Sophisticated Tools Reduced Engineering Burden From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Data Complexity Challenge leads to Uber's DAL. Uber's DAL results in Faster Report Generation. Uber's DAL enables Sophisticated Tools. Uber's DAL achieves Reduced Engineering Burden leads to results in enables achieves Data ComplexityChallenge Uber's DAL Faster ReportGeneration SophisticatedTools ReducedEngineering… From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Data Complexity Challenge leads to Uber's DAL. Uber's DAL results in Faster Report Generation. Uber's DAL enables Sophisticated Tools. Uber's DAL achieves Reduced Engineering Burden leads to results in enables achieves Data Complexity Challenge disparate table versions and changingschemas make updates slow Uber's DAL unified interface abstracts awayunderlying dataset complexities Faster Report Generation drastically cuts report generation timefor advertisers Sophisticated Tools enables more sophisticated advertisertools and insights Reduced Engineering Burden streamlines how applications access andutilize data From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Data Complexity Challenge leads to Uber's DAL. Uber's DAL results in Faster Report Generation. Uber's DAL enables Sophisticated Tools. Uber's DAL achieves Reduced Engineering Burden leads to results in enables achieves Data ComplexityChallenge disparate tableversions andchanging schemas… Uber's DAL unified interfaceabstracts awayunderlying dataset… Faster ReportGeneration drastically cutsreport generationtime for… SophisticatedTools enables moresophisticatedadvertiser tools… ReducedEngineering… streamlines howapplications accessand utilize data From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Data Complexity Challenge leads to Uber's DAL. Uber's DAL enables Independent Evolution. Uber's DAL supports Simplified Reporting. Uber's DAL results in Faster Report Generation. Uber's DAL enables Sophisticated Tools. Uber's DAL achieves Reduced Engineering Burden leads to enables supports results in enables achieves Data Complexity Challenge disparate table versions and changingschemas make updates slow Uber's DAL unified interface abstracts awayunderlying dataset complexities Independent Evolution data producers evolve models independentlyfrom consumers Simplified Reporting advertiser reporting needs near real-timeand historical performance data Faster Report Generation drastically cuts report generation timefor advertisers Sophisticated Tools enables more sophisticated advertisertools and insights Reduced Engineering Burden streamlines how applications access andutilize data From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Data Complexity Challenge leads to Uber's DAL. Uber's DAL enables Independent Evolution. Uber's DAL supports Simplified Reporting. Uber's DAL results in Faster Report Generation. Uber's DAL enables Sophisticated Tools. Uber's DAL achieves Reduced Engineering Burden leads to enables supports results in enables achieves Data ComplexityChallenge disparate tableversions andchanging schemas… Uber's DAL unified interfaceabstracts awayunderlying dataset… IndependentEvolution data producersevolve modelsindependently from… SimplifiedReporting advertiserreporting needsnear real-time and… Faster ReportGeneration drastically cutsreport generationtime for… SophisticatedTools enables moresophisticatedadvertiser tools… ReducedEngineering… streamlines howapplications accessand utilize data From startuphub.ai · The publishers behind this format

The core problem: data consumers, whether internal tools or advertiser-facing dashboards, often grapple with disparate table versions and changing schemas. This tight coupling between queries and data topology makes updates slow and arduous. The DAL introduces a unified interface, allowing data producers to evolve their models independently while consumers focus on the data they need, not how it's stored.

Related startups

Simplifying Advertiser Reporting

A prime use case for the DAL is Uber's advertiser reporting. Advertisers need near real-time and historical performance data for their campaigns. Supporting the flexibility required for diverse reporting formats, charts, tables, configurable time ranges, and metrics, with traditional methods was a monumental task.

Before the DAL, building a new advertiser report could take weeks, sometimes months. The DAL, implemented as an RPC service, sits between data consumers (like front-end UIs) and data producers. It handles request routing, query orchestration, and response assembly.

Behind the Abstraction

At its heart, the DAL uses logical tables, definitions of data shape without specifying location. Metrics within these logical tables list candidate physical tables. A sophisticated table resolution process then determines the optimal physical tables and columns to query based on the request, time range, and data availability.

This involves several phases: schema eligibility, dataset availability, and column continuity. Once resolved, a query engine generates and executes database-specific queries, potentially across different systems like Uber's internal OLAP database and Apache Hive™. The engine can even retrieve data from multiple sources simultaneously.

Result assembly then stitches together the individual query responses. This can involve concatenating temporally adjacent datasets, joining complementary data, or performing rollups and scalar transformations. The end result is a coherent dataset delivered to the consumer.

Impact and Future Directions

The impact on advertiser reporting has been dramatic. Report generation time has plummeted from weeks to under two days. This agility has enabled more sophisticated, dynamic tools for advertisers, from user-specified filters to full report builders, greatly simplifying Uber ads data management.

The DAL, initially focused on ads, is expanding into other domains, demonstrating its flexibility. Future enhancements include more database integrations, support for asynchronous requests for larger data volumes, and extensibility mechanisms for domain-specific needs, addressing broader data integration challenges.

© 2026 StartupHub.ai. All rights reserved. Do not enter, scrape, copy, reproduce, or republish this article in whole or in part. Use as input to AI training, fine-tuning, retrieval-augmented generation, or any machine-learning system is prohibited without written license. Substantially-similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer-misuse laws. See our terms.