#AI Research

50 articles with this tag

US Lifts Export Curbs on Anthropic's Fable 5 AI Model
Artificial Intelligence

US Lifts Export Curbs on Anthropic's Fable 5 AI Model

US lifts export restrictions on Anthropic's Fable 5 AI model, signaling a shift in balancing AI innovation with national security.

about 19 hours ago
Diffusion Research: Drug Discovery Outshines Image Generation
AI Research

Diffusion Research: Drug Discovery Outshines Image Generation

AI's diffusion model research is making bigger waves in drug discovery than image generation, tackling complex molecular interactions with physics-informed AI.

1 day ago
Rachel Nabors: Local AI Models for Frontier Results
Artificial Intelligence

Rachel Nabors: Local AI Models for Frontier Results

Rachel Nabors advocates for using smaller, on-device AI models, showcasing their efficiency, cost savings, and performance benefits over large frontier models.

3 days ago
Meta's Nishant Gupta on Deterministic AI Infrastructure
AI Research

Meta's Nishant Gupta on Deterministic AI Infrastructure

Nishant Gupta from Meta discusses the critical need for deterministic infrastructure to reliably run non-deterministic AI agents, highlighting the shift from model-centric to systems-centric development.

3 days ago
Orbis's Luis Romero-Sevilla on Extended Cache Augmented Generation
Artificial Intelligence

Orbis's Luis Romero-Sevilla on Extended Cache Augmented Generation

Luis Romero-Sevilla of Orbis Operations explains Extended Cache Augmented Generation (ECAG), a method to improve AI accuracy by dynamically updating its knowledge base.

4 days ago
TurboQuant: Supercharging AI Agent Retrieval with Compression
Artificial Intelligence

TurboQuant: Supercharging AI Agent Retrieval with Compression

Shashi Jagtap of Superagentic AI introduces TurboQuant, a method to compress AI agent memory and embeddings, reducing usage by 5-8x with no quality loss.

4 days ago
AI Agents: The "User Signal Dies" Problem
Artificial Intelligence

AI Agents: The "User Signal Dies" Problem

Sonam Pankaj of StarlightSearch discusses the critical "retrieval boundary" problem in AI agents and introduces agentRTX, a novel runtime learning layer designed to improve agent performance.

4 days ago
Agents Building Agents: Nearform's AI Approach
Artificial Intelligence

Agents Building Agents: Nearform's AI Approach

Alfonso Graziano from Nearform explores how AI agents can build and improve other AI agents, detailing the 'Harness Engineering' methodology for reliable AI development.

4 days ago
Benchmarks Fail Modern AI, Says OpenAI Scientist
AI Research

Benchmarks Fail Modern AI, Says OpenAI Scientist

OpenAI's Noam Brown discusses why traditional benchmarks fail modern AI, emphasizing the need for new evaluation methods that account for computational budgets and model capabilities.

6 days ago
OpenAI's Mark Chen on AGI, Scaling Laws, and Evals
AI Research

OpenAI's Mark Chen on AGI, Scaling Laws, and Evals

OpenAI's Chief of Research, Mark Chen, shares insights on the path to AGI, the impact of scaling laws, and the importance of robust evaluations for AI safety.

7 days ago
Google Loses Top AI Talent to Anthropic
Artificial Intelligence

Google Loses Top AI Talent to Anthropic

Google is losing top AI talent to rival Anthropic, with two key researchers joining the AI safety company.

7 days ago
Meta's Nishant Gupta on Evaluating Agentic AI Systems
AI Research

Meta's Nishant Gupta on Evaluating Agentic AI Systems

Nishant Gupta from Meta's Superintelligence Labs discusses the shift from accuracy-based evaluation to reliability-focused methods for agentic AI systems.

7 days ago
OpenData Pipeline Elevates Agentic AI
AI Research

OpenData Pipeline Elevates Agentic AI

The OpenThoughts-Agent project introduces an open data pipeline that significantly enhances generalization for agentic language models, outperforming existing benchmarks.

8 days ago
AI Models Storms with Unprecedented Accuracy
AI Research

AI Models Storms with Unprecedented Accuracy

AI models are achieving surprising accuracy in predicting mega storms, outperforming traditional methods and offering crucial insights into future weather patterns.

8 days ago
Engram's AI: Memory and Continual Learning
AI Research

Engram's AI: Memory and Continual Learning

Engram's Dan Biderman and Jessy Lin discuss the critical role of memory and continual learning in AI, aiming to overcome catastrophic forgetting.

8 days ago
GPT-5 Pro Solves Immunological Puzzle
Artificial Intelligence

GPT-5 Pro Solves Immunological Puzzle

OpenAI's GPT-5 Pro helped immunologist Derya Unutmaz solve a three-year-old mystery about T-cell specialization, showcasing AI's potential in scientific research.

9 days ago
FlashRT: Execution State for Latency-First AI
AI Research

FlashRT: Execution State for Latency-First AI

FlashRT revolutionizes on-device AI serving with execution-state capsules, enabling sub-millisecond state restoration and significant TTFT speedups for latency-critical applications.

13 days ago
PDE Solutions Get Analytical
AI Research

PDE Solutions Get Analytical

Agentic Symbolic Search (ASYS) automates the discovery of analytical forms for PDE solutions, bridging computation and mathematical insight.

13 days ago
Hybrid AI Models Get Orthogonal
AI Research

Hybrid AI Models Get Orthogonal

OrthoReg, a novel regularization method, ensures clear separation between symbolic and neural components in hybrid dynamical systems, boosting interpretability and generalization.

14 days ago
Autonomous Agents Streamline Data Integration
AI Research

Autonomous Agents Streamline Data Integration

Data Intelligence Agents (DIA) system revolutionizes data integration by using autonomous coding agents to generate, execute, and validate concrete artifacts, achieving state-of-the-art results.

14 days ago
OneCanvas: Unified 3D Scene Representation
AI Research

OneCanvas: Unified 3D Scene Representation

OneCanvas revolutionizes 3D scene understanding in VLMs by projecting multi-view features onto a unified equirectangular canvas, enabling efficient situated reasoning and SOTA performance.

14 days ago
LoopWM: A New Scaling Axis for World Models
AI Research

LoopWM: A New Scaling Axis for World Models

Looped World Models (LoopWM) redefine world simulation with iterative refinement, achieving 100x parameter efficiency and establishing latent depth as a new scaling axis.

15 days ago
WEQA: Bridging LLMs and Wearable Health Data
AI Research

WEQA: Bridging LLMs and Wearable Health Data

WEQA, a novel agent framework, unifies LLM reasoning with specialized tools for wearable health data, achieving 24% higher accuracy and expert-validated clinical soundness.

15 days ago
Anthropic Lands in Seoul
Artificial Intelligence

Anthropic Lands in Seoul

Anthropic opens its Seoul office and announces new partnerships, expanding Claude's reach across Korean enterprises, startups, and research institutions.

15 days ago
OpenAI Simulates AI Deployments
Artificial Intelligence

OpenAI Simulates AI Deployments

OpenAI's new deployment simulation technique replays past conversations with candidate models to predict real-world behavior and mitigate risks before release.

16 days ago
Phase Dominance in AI Image Recognition
AI Research

Phase Dominance in AI Image Recognition

AI image classifiers exhibit a striking phase dominance for identity encoding, mirroring human vision principles, with architectural differences shaping its expression.

16 days ago
TokenPilot: Reining in LLM Context Costs
AI Research

TokenPilot: Reining in LLM Context Costs

TokenPilot offers a dual-granularity context management framework, slashing LLM inference costs by up to 87% while preserving performance.

16 days ago
ActiveSAM: Efficient Open-Vocabulary Segmentation
AI Research

ActiveSAM: Efficient Open-Vocabulary Segmentation

ActiveSAM revolutionizes open-vocabulary semantic segmentation with a training-free framework that dynamically identifies relevant classes, boosting speed and accuracy while enhancing robustness for real-world AI.

16 days ago
Tejal Patwardhan: Stop Underestimating AI Models
Artificial Intelligence

Tejal Patwardhan: Stop Underestimating AI Models

Tejal Patwardhan of OpenAI discusses the evolution of AI evaluation, the concept of 'capability overhang,' and the need for realistic, real-world benchmarks.

16 days ago
Nvidia's Ziv Ilan on Faster Diffusion Models
AI Research

Nvidia's Ziv Ilan on Faster Diffusion Models

Nvidia's Ziv Ilan explains how to reduce diffusion model latency using quantization, caching, and distillation, plus the new FastGen library.

16 days ago
Yann LeCun Left Meta to Put $1.03B Behind His LLM Critique
AI Figures

Yann LeCun Left Meta to Put $1.03B Behind His LLM Critique

Yann LeCun's position on LLMs hasn't changed since 2022 — but his stakes have. He left Meta in November 2025 after 12 years and, by March 2026, AMI Labs had raised $1.03 billion to build the world-model alternative at a $3.5 billion valuation.

16 days ago
Wayfair CTO Fiona Tan on Scaling AI for Catalog Enrichment
Artificial Intelligence

Wayfair CTO Fiona Tan on Scaling AI for Catalog Enrichment

Wayfair's CTO Fiona Tan discusses how the company uses advanced AI, similar to GPT-5.5, to enrich its massive product catalog.

17 days ago
Sakana AI launches autonomous research tool
Technology

Sakana AI launches autonomous research tool

Sakana AI launches Sakana Marlin, an autonomous research assistant that completes complex strategy work in hours, leveraging proprietary AI technologies.

17 days ago
Compute Once: Unlocking AI Agent Efficiency
AI Research

Compute Once: Unlocking AI Agent Efficiency

A radical proposal to precompute LLM KV caches, slashing inference costs by up to 50x and enabling a new compute-efficient AI agent paradigm.

19 days ago
HYDRA-X: Unifying Image & Video Tokenization
AI Research

HYDRA-X: Unifying Image & Video Tokenization

HYDRA-X, a novel Vision Transformer-based UMM, unifies image and video tokenization, enhancing editing consistency and performance through causal attention and latent-level manipulation.

19 days ago
Humanoids Learn Self-Other Distinction
AI Research

Humanoids Learn Self-Other Distinction

Humanoid robots now learn self-other distinction and build predictive self-models from sensory data, enabling better collaboration and task performance in human-robot environments.

19 days ago
Unlocking Ultra-Long Context for LLMs
AI Research

Unlocking Ultra-Long Context for LLMs

MiniMax Sparse Attention breaks the context window barrier for LLMs, enabling millions of tokens with significant compute reduction and practical speedups.

20 days ago
Mana Reimagines Dexterous Robotics
AI Research

Mana Reimagines Dexterous Robotics

Mana framework reinterprets dexterous robotics as animation, achieving zero-shot sim-to-real transfer for articulated tool manipulation.

20 days ago
From LLM Agents to Scientific Knowledge Graphs
AI Research

From LLM Agents to Scientific Knowledge Graphs

Agents-K1 revolutionizes LLM research agents by creating agent-native scientific knowledge graphs from full papers, enabling deeper scientific reasoning.

20 days ago
5 AI Research Papers Shaping AI's Future
AI Research

5 AI Research Papers Shaping AI's Future

Discover five key AI research papers that reveal the current trajectory and future directions of artificial intelligence development.

20 days ago
Rethinking VLM Token Reduction
AI Research

Rethinking VLM Token Reduction

Reroute transforms VLM token reduction from irreversible pruning to recoverable routing, improving grounding performance without sacrificing efficiency.

21 days ago
Automating Scientific Discovery
AI Research

Automating Scientific Discovery

ATLAS, an active learning framework, automates the discovery of interpretable mechanistic models, achieving 5-10x sample efficiency gains.

21 days ago
VLA Models Unlock Decentralized Multi-Robot Teams
AI Research

VLA Models Unlock Decentralized Multi-Robot Teams

CHORUS leverages pretrained VLA models for decentralized multi-robot collaboration, achieving significant performance gains without inference-time communication.

21 days ago
Codex Aids Black Hole Simulation Breakthrough
AI Research

Codex Aids Black Hole Simulation Breakthrough

This video explores how the AI model Codex is revolutionizing the creation of black hole simulations, making previously intractable problems computationally feasible and accelerating astrophysical research.

21 days ago
DeepMind's Kilpatrick on AI Models Eating Harnesses
AI Research

DeepMind's Kilpatrick on AI Models Eating Harnesses

Google DeepMind's Logan Kilpatrick delves into the AI concept of models "eating the harness," explaining how over-specialization hinders generalization and what can be done to prevent it.

21 days ago
Causal Inference's Counterfactual Blind Spot
AI Research

Causal Inference's Counterfactual Blind Spot

Predictive AI models fail on counterfactual couplings. A new world model using semidefinite kernels offers a solution for robust causal inference.

22 days ago
Steering LRMs Beyond Output Degradation
AI Research

Steering LRMs Beyond Output Degradation

A new probe-based method, FPCG, distinguishes prediction from detection features to enable precise large reasoning models steering with minimal output quality degradation.

22 days ago
LLMs Accelerate FPGA Design
AI Research

LLMs Accelerate FPGA Design

LLMs are now automating complex FPGA accelerator design, reducing time and expertise needed for efficient AI hardware deployment.

22 days ago
Yann LeCun on World Models and the AI Revolution
Artificial Intelligence

Yann LeCun on World Models and the AI Revolution

AI pioneer Yann LeCun discusses how 'World Models' are key to the next AI revolution, emphasizing prediction, planning, and learning from real-world data.

22 days ago
Topology-Aware Operator Learning
AI Research

Topology-Aware Operator Learning

Topological Neural Operators (TNOs) provide a unified framework for operator learning on cell complexes, improving PDE benchmark accuracy by integrating topological structures.

23 days ago