Meta is once again betting big on artificial intelligence, unveiling Muse Spark, its latest foundation model. This launch follows a period of significant internal shifts and external scrutiny, including a talent exodus and a pivot away from its previous metaverse ambitions.
The company positions Muse Spark as a crucial step towards what it terms 'personal superintelligence,' a concept previously outlined by executives. This new model is designed to be natively multimodal, capable of understanding and processing various forms of data, including images.
Muse Spark: Capabilities and Ambitions
Muse Spark supports tool-use, visual chain-of-thought reasoning, and multi-agent orchestration. Meta claims this represents a ground-up overhaul of its AI efforts, with strategic investments in research, training, and infrastructure like the Hyperion data center.
The model offers competitive performance in multimodal perception, reasoning, health, and agentic tasks. Meta acknowledges areas for improvement, such as long-horizon agentic systems and coding.

