Cloudflare is rolling out a new feature for its Browser Rendering service that allows developers to crawl entire websites with a single API call. The new /crawl endpoint, now in open beta, simplifies the process of gathering website data.
Users submit a starting URL, and the service automatically discovers, renders pages in a headless browser, and returns content in multiple formats, including HTML, Markdown, and structured JSON. This capability is particularly useful for training large language models, extracting data for RAG pipeline data extraction, or conducting site-wide research and monitoring. The service, detailed on Cloudflare Docs, operates asynchronously, providing a job ID upon submission and allowing users to check results as pages are processed.
