Cloudflare, processing over a billion events per second across its global network, faced a significant data challenge. Information was scattered across dozens of databases, cloud buckets, and streaming platforms, making even simple queries a complex, knowledge-intensive task. This data sprawl hindered effective insight generation.
To combat this, Cloudflare developed two internal tools: "Town Lake," a unified data analytics platform, and "Skipper," an AI data agent built on top of it. Town Lake provides a single SQL interface to all of Cloudflare's data, while Skipper enables employees to ask questions in plain English and receive auditable answers rapidly.
The Data Sprawl Problem
Hyper-growth often leads to data fragmentation. Cloudflare experienced this with too many disparate systems, sampled data unsuitable for critical functions like billing, reliance on external vendors for internal reporting, and a lack of discoverability for data assets.
This situation fostered a culture where data infrastructure was viewed as a secondary function rather than critical technology.
