Current autonomous agentic systems suffer from a critical inflexibility: once deployed, they remain static, unable to learn from user interactions or fix recurring failures without manual intervention. Existing self-evolving agents, while a step forward, are limited to text-mutable artifacts like prompts and skill files, leaving the core agent harness untouched. This prevents the system from addressing structural failures embedded in code, such as routing or hook ordering. The researchers propose that true self-evolution requires source-level adaptation, a fundamentally more general and robust approach.
Related startups
Beyond Textual Artifacts: The Power of Source-Level Adaptation
This paper introduces MOSS, a system designed for self-rewriting at the source code level within production agentic substrates. Unlike previous methods confined to text-based configurations, MOSS operates on the actual code. This approach is inherently more powerful because source-level adaptation is Turing-complete, a strict superset of any text-mutable scope. It ensures deterministic evolution, unaffected by the long-context drift that can plague base-model compliance, and addresses a class of failures previously inaccessible through text modifications alone.
MOSS: A Deterministic Pipeline for Agentic Evolution
MOSS implements a structured, multi-stage pipeline for its self-rewriting process. Each evolution cycle is triggered by an automatically curated batch of production failure evidence. While a pluggable external coding-agent CLI handles the actual code modification, MOSS maintains control over stage ordering and final verdicts. Candidate code is rigorously verified by replaying the failure batch in ephemeral trial workers. Successful candidates are then deployed through a user-consent-gated, in-place container swap, with a health-probe-gated rollback mechanism ensuring system stability. This meticulous process underpins the reliability of MOSS self-rewriting agents.