Reducto, a startup building the essential plumbing for AI to understand real-world documents, has raised a $75 million Series B led by Andreessen Horowitz (a16z). The new round brings the company’s total funding to $108 million and signals a massive bet on a less glamorous but critical piece of the AI puzzle: data ingestion.
While the industry obsesses over building bigger and better large language models, Reducto is focused on the messy, unstructured data those models need to be useful. Most of the world’s most valuable information is locked away in PDFs, spreadsheets, scans, and slides—formats that are notoriously difficult for AI to parse accurately. Reducto’s API acts as a universal translator, combining computer vision with vision-language models (VLMs) to convert this chaos into clean, LLM-ready data.
