Cumulus Labs

Cumulus LabsCumulus Labs
Cumulus Labs

Cumulus Labs

Building a hardware-native inference engine for AI.

2022Active
Rate

About

Cumulus Labs is developing Ion, a C++ inference engine with custom CUDA kernels specifically designed for NVIDIA GH200 hardware. Their focus is on optimizing AI inference performance, achieving high token generation speeds and significantly reducing cold start times.

Technology stack

detected 2026-06-30
Emailunknown
Comments

No comments yet. Be the first to share your take.

Frequently asked

What does Cumulus Labs do?

Cumulus Labs is developing Ion, a C++ inference engine with custom CUDA kernels specifically designed for NVIDIA GH200 hardware. Their focus is on optimizing AI inference performance, achieving high token generation speeds and significantly reducing cold start times.

How much funding has Cumulus Labs raised?

Cumulus Labs has raised a total of $63M in funding. The most recent round on record is Series A.

Where is Cumulus Labs headquartered?

Cumulus Labs is headquartered in San Francisco, California, USA.

When was Cumulus Labs founded?

Cumulus Labs was founded in 2022.

What industry does Cumulus Labs operate in?

Cumulus Labs operates in Artificial Intelligence, Cloud Computing, Hardware, AI Hardware, AI Chip, AI Inference Engine.