Cloudflare is rolling out a new feature designed to tackle a growing problem: AI training crawlers that ignore signals indicating outdated content. Dubbed "Redirects for AI Training," the system automatically converts existing canonical tags into HTTP 301 redirects specifically for verified AI training bots.
This addresses the issue where AI models ingest stale information from deprecated documentation or web pages, leading to inaccurate outputs. Cloudflare observed that bots categorized for AI training visited deprecated content on developers.cloudflare.com at the same rate as current content, despite deprecation banners and noindex tags.
Unlike human users or standard search engine crawlers, AI training bots often rely on cached or modeled data, making it difficult to correct outdated training sets once the information is ingested. Blocking these crawlers outright creates a content void, offering no alternative learning path.
