Multimodal Large Language Models (MLLMs) are now powering robotic navigation systems, and they’re compact enough to work at 10 frames per second, on the edge. Vayu Robotics is building one to power autonomous delivery robots, with plans to expand its use across the full gamut of autonomous robots and vehicles.
The advent of LLMs brought a slew of use cases for enterprise and media-related tasks. Combined with sensor arrays, synthetic data, and cutting-edge RAG-like techniques, Vayu Robotics has fashioned them into a high-powered operating system for robotic perception, reasoning, and navigation.
