#Reasoning

12 articles with this tag

New Models Tackle Reasoning Puzzles with Symmetry
AI Research

New Models Tackle Reasoning Puzzles with Symmetry

New Symbol-Equivariant Recurrent Reasoning Models (SE-RRMs) offer improved performance and generalization on reasoning tasks like Sudoku and ARC-AGI by explicitly encoding symmetry.

10 days ago
Recursive LLMs Tackle Long-Horizon Reasoning
AI Research

Recursive LLMs Tackle Long-Horizon Reasoning

New research introduces recursive language models to overcome context limitations, showing significant improvements on long-horizon reasoning tasks like Boolean satisfiability.

10 days ago
LLMs Lost in Transmission: Why Global Reasoning Fails
AI Research

LLMs Lost in Transmission: Why Global Reasoning Fails

A new paper reveals transformer LLMs struggle with complex global reasoning due to limited 'effective bandwidth,' solvable by Chain of Thought.

18 days ago
Uniqueness-Aware RL stops LLMs from getting lazy
AI Research

Uniqueness-Aware RL stops LLMs from getting lazy

Uniqueness-Aware RL prevents LLMs from converging on a single solution path by explicitly rewarding correct answers that employ rare problem-solving strategies.

about 2 months ago
Google Gemini 3 Redefines AI Reasoning and Efficiency
AI Research

Google Gemini 3 Redefines AI Reasoning and Efficiency

3 months ago
Google Gemini 3 Redefines Frontier AI Capabilities
AI Research

Google Gemini 3 Redefines Frontier AI Capabilities

3 months ago
DeepSeek V3.2 Release: Agent Focus Hits GPT-5 Level
AI Research

DeepSeek V3.2 Release: Agent Focus Hits GPT-5 Level

\n The DeepSeek V3.2 release signals a significant push in the open-source LLM race, not just chasing raw benchmark scores but specifically targeting agentic ca...

3 months ago
DeepSeek V3.2 Release: Agent Focus Hits GPT-5 Level
AI Research

DeepSeek V3.2 Release: Agent Focus Hits GPT-5 Level

\n The DeepSeek V3.2 release signals a significant push in the open-source LLM race, not just chasing raw benchmark scores but specifically targeting agentic ca...

3 months ago
Claude Opus 4.5 Arrives, Dominating Code and Agents
AI Research

Claude Opus 4.5 Arrives, Dominating Code and Agents

\n Anthropic just dropped Claude Opus 4.5, and the initial data suggests this isn\'t just an incremental update.

4 months ago
Claude Opus 4.5 Arrives, Dominating Code and Agents
AI Research

Claude Opus 4.5 Arrives, Dominating Code and Agents

\n Anthropic just dropped Claude Opus 4.5, and the initial data suggests this isn\'t just an incremental update.

4 months ago
Jeremy Berman’s Evolutionary Leap: Natural Language for ARC-AGI-2
AI Video

Jeremy Berman’s Evolutionary Leap: Natural Language for ARC-AGI-2

6 months ago
Unpacking AI's Invisible Rules: A Frog's Perspective
AI Video

Unpacking AI's Invisible Rules: A Frog's Perspective

6 months ago