Data Agents Need Context Layer

AI data agents are failing due to a lack of context. A new 'context layer' aims to provide the necessary business understanding for agents to function effectively.

4 min read
Abstract visualization of interconnected data points forming a complex network, representing a context layer.
Image credit: Andreessen Horowitz

The promise of AI data agents, capable of automating complex tasks and answering intricate business questions, has hit a significant roadblock: context. Despite advancements in Large Language Models (LLMs) and the maturation of the modern data stack, organizations are finding their agents largely ineffective. This isn't a failure of the AI models themselves, but a fundamental gap in how they understand and interact with enterprise data, as detailed in a recent analysis from Andreessen Horowitz.

The journey began with the modern data stack, aiming to consolidate and clean disparate data sources. The idea was that with organized data, business intelligence would be straightforward. Then came the AI agent frenzy of 2024-2025, fueled by LLM capabilities. Companies rushed to build 'chat with your data' tools and support agents, expecting increased efficiency.

However, reality set in. MIT's 'State of AI in Business 2025' report highlighted that most AI deployments fail due to brittle workflows and a lack of contextual learning. Data agents, in particular, struggled with ambiguous queries, deciphering business definitions, and reasoning across messy, disparate data. The core issue wasn't just translating natural language to SQL; it was understanding the nuances of business terms and data sources.

The Contextual Chasm

Consider a seemingly simple query: "What was revenue growth last quarter?" For an AI agent, this is a minefield. How is 'revenue' defined – ARR or run rate? Are fiscal quarters normalized across the organization? What time window is relevant?

Traditional semantic layers, often built for specific BI tools like Looker, fall short. They might define 'revenue' but are typically hand-coded, quickly become outdated, and fail to capture new product lines or evolving business logic. A hardcoded definition, even if accurate at one point, doesn't account for the dynamic nature of enterprise data.

Furthermore, identifying the correct data sources is a challenge. Finance might use one table, while the data team has materialized views. Agents need access to an up-to-date repository of business definitions and data sources to navigate this complexity. This is where the concept of a 'data agents context layer' emerges as critical.

Enter the Context Layer

A context layer aims to bridge this gap. It acts as an intelligent intermediary, consolidating messy enterprise data and overlaying it with business logic and tribal knowledge. While terms like 'context OS' or 'contextual data layer' are used, the underlying principle is consistent: providing agents with the necessary understanding to function effectively.

This is more than a traditional semantic layer. A modern context layer should encompass canonical entities, identity resolution, and specific instructions for navigating implicit knowledge. It's about creating a multi-dimensional corpus where both code and natural language coexist, guiding agents with the full picture of an organization's data and decision-making processes.

Building such a system involves several steps: ensuring comprehensive data access, automating initial context construction using LLMs, refining this context with human input for implicit knowledge, connecting agents via APIs, and establishing self-updating flows to adapt to changing data landscapes. The goal is to create a living, evolving knowledge base for AI agents.

The market is beginning to respond, with data gravity platforms like Databricks and Snowflake building AI analyst tools and emerging companies focusing on context construction. Solutions like Snowflake's Context Layer for Smarter Agents and others are starting to address this critical need, recognizing that context is not a nice-to-have, but a fundamental requirement for the success of data agents.

The path to truly autonomous AI agents requires solving the data context problem, a blend of technical data infrastructure challenges and the human element of capturing and maintaining institutional knowledge. As the industry grapples with this, the importance of a robust context layers AI approach becomes increasingly clear, enabling the next generation of intelligent automation. The future of modern data stack AI agents hinges on this contextual foundation.