#AI Research
50 articles with this tag

US Lifts Export Curbs on Anthropic's Fable 5 AI Model
US lifts export restrictions on Anthropic's Fable 5 AI model, signaling a shift in balancing AI innovation with national security.

Diffusion Research: Drug Discovery Outshines Image Generation
AI's diffusion model research is making bigger waves in drug discovery than image generation, tackling complex molecular interactions with physics-informed AI.

Rachel Nabors: Local AI Models for Frontier Results
Rachel Nabors advocates for using smaller, on-device AI models, showcasing their efficiency, cost savings, and performance benefits over large frontier models.

Meta's Nishant Gupta on Deterministic AI Infrastructure
Nishant Gupta from Meta discusses the critical need for deterministic infrastructure to reliably run non-deterministic AI agents, highlighting the shift from model-centric to systems-centric development.

Orbis's Luis Romero-Sevilla on Extended Cache Augmented Generation
Luis Romero-Sevilla of Orbis Operations explains Extended Cache Augmented Generation (ECAG), a method to improve AI accuracy by dynamically updating its knowledge base.

TurboQuant: Supercharging AI Agent Retrieval with Compression
Shashi Jagtap of Superagentic AI introduces TurboQuant, a method to compress AI agent memory and embeddings, reducing usage by 5-8x with no quality loss.

AI Agents: The "User Signal Dies" Problem
Sonam Pankaj of StarlightSearch discusses the critical "retrieval boundary" problem in AI agents and introduces agentRTX, a novel runtime learning layer designed to improve agent performance.

Agents Building Agents: Nearform's AI Approach
Alfonso Graziano from Nearform explores how AI agents can build and improve other AI agents, detailing the 'Harness Engineering' methodology for reliable AI development.

Benchmarks Fail Modern AI, Says OpenAI Scientist
OpenAI's Noam Brown discusses why traditional benchmarks fail modern AI, emphasizing the need for new evaluation methods that account for computational budgets and model capabilities.

OpenAI's Mark Chen on AGI, Scaling Laws, and Evals
OpenAI's Chief of Research, Mark Chen, shares insights on the path to AGI, the impact of scaling laws, and the importance of robust evaluations for AI safety.

Google Loses Top AI Talent to Anthropic
Google is losing top AI talent to rival Anthropic, with two key researchers joining the AI safety company.

Meta's Nishant Gupta on Evaluating Agentic AI Systems
Nishant Gupta from Meta's Superintelligence Labs discusses the shift from accuracy-based evaluation to reliability-focused methods for agentic AI systems.
OpenData Pipeline Elevates Agentic AI
The OpenThoughts-Agent project introduces an open data pipeline that significantly enhances generalization for agentic language models, outperforming existing benchmarks.

AI Models Storms with Unprecedented Accuracy
AI models are achieving surprising accuracy in predicting mega storms, outperforming traditional methods and offering crucial insights into future weather patterns.

Engram's AI: Memory and Continual Learning
Engram's Dan Biderman and Jessy Lin discuss the critical role of memory and continual learning in AI, aiming to overcome catastrophic forgetting.
GPT-5 Pro Solves Immunological Puzzle
OpenAI's GPT-5 Pro helped immunologist Derya Unutmaz solve a three-year-old mystery about T-cell specialization, showcasing AI's potential in scientific research.
FlashRT: Execution State for Latency-First AI
FlashRT revolutionizes on-device AI serving with execution-state capsules, enabling sub-millisecond state restoration and significant TTFT speedups for latency-critical applications.
PDE Solutions Get Analytical
Agentic Symbolic Search (ASYS) automates the discovery of analytical forms for PDE solutions, bridging computation and mathematical insight.
Hybrid AI Models Get Orthogonal
OrthoReg, a novel regularization method, ensures clear separation between symbolic and neural components in hybrid dynamical systems, boosting interpretability and generalization.
Autonomous Agents Streamline Data Integration
Data Intelligence Agents (DIA) system revolutionizes data integration by using autonomous coding agents to generate, execute, and validate concrete artifacts, achieving state-of-the-art results.
OneCanvas: Unified 3D Scene Representation
OneCanvas revolutionizes 3D scene understanding in VLMs by projecting multi-view features onto a unified equirectangular canvas, enabling efficient situated reasoning and SOTA performance.
LoopWM: A New Scaling Axis for World Models
Looped World Models (LoopWM) redefine world simulation with iterative refinement, achieving 100x parameter efficiency and establishing latent depth as a new scaling axis.
WEQA: Bridging LLMs and Wearable Health Data
WEQA, a novel agent framework, unifies LLM reasoning with specialized tools for wearable health data, achieving 24% higher accuracy and expert-validated clinical soundness.

Anthropic Lands in Seoul
Anthropic opens its Seoul office and announces new partnerships, expanding Claude's reach across Korean enterprises, startups, and research institutions.
OpenAI Simulates AI Deployments
OpenAI's new deployment simulation technique replays past conversations with candidate models to predict real-world behavior and mitigate risks before release.
Phase Dominance in AI Image Recognition
AI image classifiers exhibit a striking phase dominance for identity encoding, mirroring human vision principles, with architectural differences shaping its expression.
TokenPilot: Reining in LLM Context Costs
TokenPilot offers a dual-granularity context management framework, slashing LLM inference costs by up to 87% while preserving performance.
ActiveSAM: Efficient Open-Vocabulary Segmentation
ActiveSAM revolutionizes open-vocabulary semantic segmentation with a training-free framework that dynamically identifies relevant classes, boosting speed and accuracy while enhancing robustness for real-world AI.

Tejal Patwardhan: Stop Underestimating AI Models
Tejal Patwardhan of OpenAI discusses the evolution of AI evaluation, the concept of 'capability overhang,' and the need for realistic, real-world benchmarks.

Nvidia's Ziv Ilan on Faster Diffusion Models
Nvidia's Ziv Ilan explains how to reduce diffusion model latency using quantization, caching, and distillation, plus the new FastGen library.

Yann LeCun Left Meta to Put $1.03B Behind His LLM Critique
Yann LeCun's position on LLMs hasn't changed since 2022 — but his stakes have. He left Meta in November 2025 after 12 years and, by March 2026, AMI Labs had raised $1.03 billion to build the world-model alternative at a $3.5 billion valuation.

Wayfair CTO Fiona Tan on Scaling AI for Catalog Enrichment
Wayfair's CTO Fiona Tan discusses how the company uses advanced AI, similar to GPT-5.5, to enrich its massive product catalog.

Sakana AI launches autonomous research tool
Sakana AI launches Sakana Marlin, an autonomous research assistant that completes complex strategy work in hours, leveraging proprietary AI technologies.
Compute Once: Unlocking AI Agent Efficiency
A radical proposal to precompute LLM KV caches, slashing inference costs by up to 50x and enabling a new compute-efficient AI agent paradigm.
HYDRA-X: Unifying Image & Video Tokenization
HYDRA-X, a novel Vision Transformer-based UMM, unifies image and video tokenization, enhancing editing consistency and performance through causal attention and latent-level manipulation.
Humanoids Learn Self-Other Distinction
Humanoid robots now learn self-other distinction and build predictive self-models from sensory data, enabling better collaboration and task performance in human-robot environments.
Unlocking Ultra-Long Context for LLMs
MiniMax Sparse Attention breaks the context window barrier for LLMs, enabling millions of tokens with significant compute reduction and practical speedups.
Mana Reimagines Dexterous Robotics
Mana framework reinterprets dexterous robotics as animation, achieving zero-shot sim-to-real transfer for articulated tool manipulation.
From LLM Agents to Scientific Knowledge Graphs
Agents-K1 revolutionizes LLM research agents by creating agent-native scientific knowledge graphs from full papers, enabling deeper scientific reasoning.

5 AI Research Papers Shaping AI's Future
Discover five key AI research papers that reveal the current trajectory and future directions of artificial intelligence development.
Rethinking VLM Token Reduction
Reroute transforms VLM token reduction from irreversible pruning to recoverable routing, improving grounding performance without sacrificing efficiency.
Automating Scientific Discovery
ATLAS, an active learning framework, automates the discovery of interpretable mechanistic models, achieving 5-10x sample efficiency gains.
VLA Models Unlock Decentralized Multi-Robot Teams
CHORUS leverages pretrained VLA models for decentralized multi-robot collaboration, achieving significant performance gains without inference-time communication.

Codex Aids Black Hole Simulation Breakthrough
This video explores how the AI model Codex is revolutionizing the creation of black hole simulations, making previously intractable problems computationally feasible and accelerating astrophysical research.

DeepMind's Kilpatrick on AI Models Eating Harnesses
Google DeepMind's Logan Kilpatrick delves into the AI concept of models "eating the harness," explaining how over-specialization hinders generalization and what can be done to prevent it.
Causal Inference's Counterfactual Blind Spot
Predictive AI models fail on counterfactual couplings. A new world model using semidefinite kernels offers a solution for robust causal inference.
Steering LRMs Beyond Output Degradation
A new probe-based method, FPCG, distinguishes prediction from detection features to enable precise large reasoning models steering with minimal output quality degradation.
LLMs Accelerate FPGA Design
LLMs are now automating complex FPGA accelerator design, reducing time and expertise needed for efficient AI hardware deployment.

Yann LeCun on World Models and the AI Revolution
AI pioneer Yann LeCun discusses how 'World Models' are key to the next AI revolution, emphasizing prediction, planning, and learning from real-world data.
Topology-Aware Operator Learning
Topological Neural Operators (TNOs) provide a unified framework for operator learning on cell complexes, improving PDE benchmark accuracy by integrating topological structures.