deepinfradeepinfra
deepinfra

deepinfra

Cloud GPU Provider

Active

About

DeepInfra is a scalable AI inference infrastructure platform that enables developers and enterprises to deploy open-source machine learning models through simple APIs. Operating GPU infrastructure across eight U.S. data centers, it supports over 190 open-source models via OpenAI-compatible APIs and processes nearly five trillion tokens per week for production and agentic AI workloads.

Technology stack

detected 2026-06-18
Est. monthly stack spend~$160/mo
CDN
Vercel
EmailGoogle Workspace
Hosting
Vercel
Stack
Next.js
Comments

No comments yet. Be the first to share your take.

Frequently asked

What does deepinfra do?

DeepInfra is a scalable AI inference infrastructure platform that enables developers and enterprises to deploy open-source machine learning models through simple APIs. Operating GPU infrastructure across eight U.S. data centers, it supports over 190 open-source models via OpenAI-compatible APIs and processes nearly five trillion tokens per week for production and agentic AI workloads.

How much funding has deepinfra raised?

deepinfra has raised a total of $107M in funding. The most recent round on record is Series B.

What industry does deepinfra operate in?

deepinfra operates in AI Infrastructure, Machine Learning, Cloud Computing, Developer Tools.