Researchers at Sakana AI have developed a groundbreaking method that treats AI development like biological evolution, allowing specialized models to compete for resources, select mates based on complementary strengths, and produce increasingly capable offspring. The approach, detailed in a paper presented at GECCO'25 where it was runner-up for best paper, fundamentally challenges the industry's focus on building ever-larger monolithic AI systems.
The Biological Inspiration Behind M2N2
The research introduces M2N2 (Model Merging of Natural Niches), which applies three key evolutionary principles to AI development:
Resource Competition for Specialization: Just as animals compete for limited food sources and develop specialized survival strategies, M2N2 forces AI models to compete for limited training data points. Models that can excel on data points where others struggle gain fitness advantages, naturally promoting specialization and diversity within the population.
Intelligent Mate Selection: In nature, reproduction is expensive, so animals invest heavily in choosing compatible partners. M2N2 introduces an "attraction" mechanism that pairs models with complementary strengths—choosing partners that perform well where the other is weak. This dramatically improves the efficiency of the computationally expensive model merging process.
Dynamic Genetic Boundaries: Unlike traditional model merging that requires manually defining how to split model parameters (like always combining entire layers), M2N2 evolves flexible "split-points" that can divide parameters at any location. This is analogous to genetic recombination, where DNA segments of variable length can be exchanged between chromosomes.
Technical Innovation: Moving Beyond Fixed Boundaries
Previous model merging methods suffered from a critical limitation: researchers had to manually group model parameters into fixed sets before merging, severely restricting the search space for potential combinations. M2N2 eliminates this constraint through its evolutionary approach.
The system maintains an evolving archive of models and uses the following merging formula: