MOSS: Source-Level Self-Rewriting for Agents

Current autonomous agentic systems suffer from a critical inflexibility: once deployed, they remain static, unable to learn from user interactions or fix recurring failures without manual intervention. Existing self-evolving agents, while a step forward, are limited to text-mutable artifacts like prompts and skill files, leaving the core agent harness untouched. This prevents the system from addressing structural failures embedded in code, such as routing or hook ordering. The researchers propose that true self-evolution requires source-level adaptation, a fundamentally more general and robust approach.

Visual TL;DR. Static Agent Inflexibility leads to Text-Based Evolution Limits. Text-Based Evolution Limits addressed by MOSS System. MOSS System uses Source-Level Adaptation. Source-Level Adaptation enables Turing-Complete Power. Source-Level Adaptation results in Performance Leap. MOSS System enables Overcoming Limitations.

Related startups

Static Agent Inflexibility: deployed agents cannot learn or fix failures without manual intervention
Text-Based Evolution Limits: only text artifacts like prompts and skills can be evolved
MOSS System: enables source-level self-rewriting for AI agents
Source-Level Adaptation: operates on the actual agent code, not just text configurations
Turing-Complete Power: fundamentally more general and robust approach to agent evolution
Performance Leap: significant gains demonstrated in real-world scenarios
Overcoming Limitations: addresses structural failures in core agent harness code

Visual TL;DRQuickExplainDeeper

Beyond Textual Artifacts: The Power of Source-Level Adaptation

This paper introduces MOSS, a system designed for self-rewriting at the source code level within production agentic substrates. Unlike previous methods confined to text-based configurations, MOSS operates on the actual code. This approach is inherently more powerful because source-level adaptation is Turing-complete, a strict superset of any text-mutable scope. It ensures deterministic evolution, unaffected by the long-context drift that can plague base-model compliance, and addresses a class of failures previously inaccessible through text modifications alone.

MOSS: A Deterministic Pipeline for Agentic Evolution

MOSS implements a structured, multi-stage pipeline for its self-rewriting process. Each evolution cycle is triggered by an automatically curated batch of production failure evidence. While a pluggable external coding-agent CLI handles the actual code modification, MOSS maintains control over stage ordering and final verdicts. Candidate code is rigorously verified by replaying the failure batch in ephemeral trial workers. Successful candidates are then deployed through a user-consent-gated, in-place container swap, with a health-probe-gated rollback mechanism ensuring system stability. This meticulous process underpins the reliability of MOSS self-rewriting agents.

Demonstrated Performance Leap in Real-World Scenarios

The efficacy of MOSS is clearly demonstrated on the OpenClaw platform. In a single, unsupervised evolution cycle, the system successfully lifted the mean grader score across four tasks from a baseline of 0.25 to 0.61. This significant improvement showcases the practical impact of enabling agents to evolve at the source code level, moving beyond static deployments and text-based adjustments to achieve more dynamic and effective performance.

MOSS: Source-Level Self-Rewriting for Agents

Related startups

Beyond Textual Artifacts: The Power of Source-Level Adaptation

MOSS: A Deterministic Pipeline for Agentic Evolution

Demonstrated Performance Leap in Real-World Scenarios

AI Daily Digest