xAI has quietly rolled out a significant upgrade to its Grok 4 Fast models, expanding the context window to an impressive 2 million tokens. The enhancement transforms what was already a cost-efficient reasoning engine into a formidable powerhouse. This move allows Grok 4 Fast to ingest entire codebases, voluminous documents, or extended multi-turn conversations without the typical reliance on retrieval-augmented generation (RAG) pipelines.
Initially launched in September as a leaner alternative to the flagship Grok 4, the Fast variants now boast a unified architecture honed through end-to-end reinforcement learning. This enables seamless tool integration for web searches, code execution, and multimodal analysis, positioning it as a versatile option for developers.
At its core, Grok 4 Fast delivers frontier-level performance with remarkable frugality. Input tokens start at just $0.20 per million, scaling to $0.40 beyond 128K, while outputs range from $0.50, $1.00. This pricing significantly undercuts heavier rivals like OpenAI's GPT-5 or Anthropic's offerings. Generous rate limits and cached prompts further slash costs, making high-volume deployments viable for startups and enterprises alike. Tool invocations, including advanced agentic features, remain free until November 21, 2025, sweetening the deal for developers building autonomous agents. This pricing, paired with a 40% reduction in thinking tokens compared to Grok 4, places it atop Artificial Analysis' efficiency rankings.
