Here is a question almost nobody is asking loudly enough: where does the training data come from for the humanoid robot wave everybody is betting on? Figure, Physical Intelligence, 1X, Boston Dynamics — they are all racing to ship general-purpose robots that can cook, clean, and work in unstructured environments. And they all hit the same wall. Their training data is terrible.
Existing robot datasets are almost entirely lab footage. Controlled workspace, controlled task, controlled lighting, controlled everything. Fine for bolting an arm to a factory floor. Useless for a robot that needs to navigate your specific kitchen, adapt to your specific clutter, and handle the thousand small variations that make everyday life chaotic and interesting. The gap between "lab robot" and "home robot" is fundamentally a data gap, and closing it requires data from actual homes, actual people, and actual tasks — at global scale.
Anshul Verma and Lyem Ningthou decided to close it. Their startup, Asimov (YC W26), is building an internet-scale marketplace for robot training data. Their most underrated business move: running a cleaning company on the side to collect organic data while paying workers a real salary.
What They Are Building
Asimov operates on both sides of a data marketplace. On the supply side, individuals worldwide wear a lightweight collection kit — essentially a phone mounted on a headband — and record themselves doing everyday tasks. Cooking. Cleaning. Sorting laundry. Carrying groceries. The contributors get paid. The videos get uploaded to Asimov's pipeline. On the demand side, frontier robotics labs buy access to curated, richly annotated datasets of real human motion in real environments.
The unit of value is not raw video. It is structured, annotated training data: 3D body pose estimates, depth maps, semantic object labels, and activity segmentation — all synchronized and quality-checked. A robotics team at a top lab does not want to spend six months building an annotation pipeline. They want to plug in a dataset and train. Asimov handles everything between "someone wearing a headband in their kitchen" and "ready-to-train tensor."
The cleaning company angle is not a gimmick. Asimov started a managed cleaning service that operates in the Bay Area, currently serving over 100 founders, investors, and students weekly. The cleaners wear collection kits as part of their job. The business covers worker salaries through cleaning fees. Asimov collects densely varied, organic household data — different floorplans, different messes, different objects — without paying separately for data collection. It is a flywheel: the cleaning company generates revenue AND generates data. Two products, one labor cost.
How It Works: The Technical Pipeline
The data collection side is deceptively simple by design. A contributor straps on a lightweight headband rig, opens an app, and presses record. The app guides them through a task protocol — or they just go about their day. Video is captured at egocentric (first-person) perspective, which is exactly the viewpoint a robot needs to learn manipulation tasks. The headband collects synchronized RGB video; higher-end kits add depth sensors for richer spatial data.
