The frontier of AI research is moving beyond human intervention. A new project on GitHub, dubbed 'autoresearch', details an approach where autonomous AI agents conduct LLM training experiments overnight. This system allows agents to modify code, train models for brief periods, and iterate based on performance improvements.
The core idea is to hand over the reins of a simplified LLM training setup, based on nanochat, to AI agents. Instead of researchers manually tweaking Python files, they program the agents via Markdown files. These agents then autonomously experiment with architecture, hyperparameters, and optimizers within a strict 5-minute time budget.
