OpenAI has launched GPT-5.1-Codex-Max, a new frontier agentic coding model that fundamentally changes the scope of what AI can handle in software development. Available immediately in the Codex environment, this isn't just a faster autocomplete tool; it's an agent built for marathon coding sessions, capable of sustaining complex work for hours, even days, at a time.
The core breakthrough enabling this long-haul capability is a process OpenAI calls "compaction." GPT-5.1-Codex-Max is the first model natively trained to operate across multiple context windows, coherently managing millions of tokens within a single task. When the session approaches its context limit, the model automatically prunes its history while preserving the most critical context, effectively giving it a fresh memory slate without losing progress.
This technical leap unlocks capabilities previously impossible for context-limited models, such as project-scale refactors, deep debugging sessions, and multi-hour agent loops. OpenAI claims internal evaluations have shown Codex Max working independently on tasks for more than 24 hours, persistently iterating on implementations and fixing test failures until a successful result is delivered.
