#Large Language Models

50 articles with this tag

AI Societies' Safety Problem
AI Research

AI Societies' Safety Problem

Self-evolving AI societies face an impossible trilemma: achieving continuous learning, isolation, and safety alignment simultaneously.

about 13 hours ago
Technology

Testing AI Guardrails Across Languages

Researchers tested context-aware AI guardrails across English and Farsi in humanitarian scenarios, finding nuanced performance differences and highlighting the need for language-specific safety evaluations.

10 days ago
AI Coding Tests Flawed by Infrastructure Noise
Artificial Intelligence

AI Coding Tests Flawed by Infrastructure Noise

The infrastructure powering AI coding tests can significantly inflate or deflate model scores, potentially masking true capabilities and misleading deployment decisions.

10 days ago
Claude Opus 4.6: Smarter, Faster, and Longer Context
Artificial Intelligence

Claude Opus 4.6: Smarter, Faster, and Longer Context

Anthropic's Claude Opus 4.6 launches with a 1M token context window, enhanced coding, and state-of-the-art benchmark performance.

10 days ago
Uniqueness-Aware RL stops LLMs from getting lazy
AI Research

Uniqueness-Aware RL stops LLMs from getting lazy

Uniqueness-Aware RL prevents LLMs from converging on a single solution path by explicitly rewarding correct answers that employ rare problem-solving strategies.

30 days ago
AI’s Dual Reality: Safety Theater and the Autonomous Arms Race to AGI
AI Video

AI’s Dual Reality: Safety Theater and the Autonomous Arms Race to AGI

about 2 months ago
NeuroDiscoveryBench Sets New Standard for Neuroscience AI Benchmarks
AI Research

NeuroDiscoveryBench Sets New Standard for Neuroscience AI Benchmarks

2 months ago
A Philosopher's Lens on AI's Evolving Consciousness
AI Video

A Philosopher's Lens on AI's Evolving Consciousness

2 months ago
Anthropic Unveils Advanced APIs for Agentic AI Development
AI Video

Anthropic Unveils Advanced APIs for Agentic AI Development

2 months ago
Claude.ai: Amplifying Human-AI Collaboration Through Intelligent Context and Customization
AI Video

Claude.ai: Amplifying Human-AI Collaboration Through Intelligent Context and Customization

2 months ago
Claude.ai's Projects Feature Elevates Enterprise AI Interaction
AI Video

Claude.ai's Projects Feature Elevates Enterprise AI Interaction

2 months ago
GPT-5.1: The Art and Science of Intelligent Personalities
AI Video

GPT-5.1: The Art and Science of Intelligent Personalities

2 months ago
Building Cursor Composer – Lee Robinson, Cursor
AI Video

Building Cursor Composer – Lee Robinson, Cursor

2 months ago
Claude's Research Feature Redefines Information Synthesis for Elite Professionals
AI Video

Claude's Research Feature Redefines Information Synthesis for Elite Professionals

3 months ago
OpenAI's Future Hinges on Enterprise Adoption and Sustained Funding
AI Video

OpenAI's Future Hinges on Enterprise Adoption and Sustained Funding

3 months ago
Meta's AI Investment Pays Off: A Clear Return Amidst the Tech Race
AI Video

Meta's AI Investment Pays Off: A Clear Return Amidst the Tech Race

3 months ago
How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning
AI Video

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

3 months ago
Claude's Agent Skills Unlock Granular AI Expertise
AI Video

Claude's Agent Skills Unlock Granular AI Expertise

3 months ago
Agentic AI Rewrites the Rules for Real-Time Sports Fan Engagement
AI Video

Agentic AI Rewrites the Rules for Real-Time Sports Fan Engagement

3 months ago
Claude Opus 4.5 Unlocks Advanced Reasoning and Efficiency
AI Video

Claude Opus 4.5 Unlocks Advanced Reasoning and Efficiency

3 months ago
Context Engineering: The Graph-Powered Evolution of AI Context
AI Video

Context Engineering: The Graph-Powered Evolution of AI Context

3 months ago
The Shifting Sands of AI Supremacy: ChatGPT's Lightning Bolt Meets Gemini's Insane Leap
AI Video

The Shifting Sands of AI Supremacy: ChatGPT's Lightning Bolt Meets Gemini's Insane Leap

3 months ago
Gemini's Ascent: Google's Existential Challenge to OpenAI
AI Video

Gemini's Ascent: Google's Existential Challenge to OpenAI

3 months ago
Anthropic's Opus 4.5: Redefining AI Capabilities and Efficiency
AI Video

Anthropic's Opus 4.5: Redefining AI Capabilities and Efficiency

3 months ago
Claude Opus 4.5 Delivers Actionable Outputs for Complex Business Tasks
AI Video

Claude Opus 4.5 Delivers Actionable Outputs for Complex Business Tasks

3 months ago
Claude Code Redefines Developer Workflows on Desktop
AI Video

Claude Code Redefines Developer Workflows on Desktop

3 months ago
Claude Kayak Rumor: Anthropic's Next AI Bet
Startup News

Claude Kayak Rumor: Anthropic's Next AI Bet

3 months ago
GLM 4.6 Challenges Frontier Models with Open-Source Prowess
AI Video

GLM 4.6 Challenges Frontier Models with Open-Source Prowess

3 months ago
Claude's Evolution: From Chatbot to Cognitive Collaborator
AI Video

Claude's Evolution: From Chatbot to Cognitive Collaborator

3 months ago
GPT-5's Scientific Revolution: From Niche Proofs to Accelerated Discovery
AI Video

GPT-5's Scientific Revolution: From Niche Proofs to Accelerated Discovery

3 months ago
Google's Gemini 3 Dominance Reshapes AI Landscape
AI Video

Google's Gemini 3 Dominance Reshapes AI Landscape

3 months ago
Reflexivity AI Accelerates Investment Insights for Institutions
AI Video

Reflexivity AI Accelerates Investment Insights for Institutions

3 months ago
GPT-5.1: OpenAI’s Leap Towards Human-Centric AI and Enterprise Efficiency
AI Video

GPT-5.1: OpenAI’s Leap Towards Human-Centric AI and Enterprise Efficiency

3 months ago
vLLM Solves the AI Model Serving Conundrum at Scale
AI Video

vLLM Solves the AI Model Serving Conundrum at Scale

3 months ago
AI's Leap into the Physical: Project Fetch's Robot Dog Revelation
AI Video

AI's Leap into the Physical: Project Fetch's Robot Dog Revelation

3 months ago
Model Context Protocol: Streamlining AI Agent Interaction with Cloud Tools
AI Video

Model Context Protocol: Streamlining AI Agent Interaction with Cloud Tools

3 months ago
Intelligence Is "Less is More": A Fundamental Challenge to LLMs
AI Video

Intelligence Is "Less is More": A Fundamental Challenge to LLMs

3 months ago
Anthropic's Introspection Paper Hints at AI Self-Awareness
AI Video

Anthropic's Introspection Paper Hints at AI Self-Awareness

4 months ago
Wikipedia Founder Jimmy Wales on AI's Factual Blind Spot
AI Video

Wikipedia Founder Jimmy Wales on AI's Factual Blind Spot

4 months ago
Google's Model Armor: The AI Bodyguard Preventing Digital Catastrophes
AI Video

Google's Model Armor: The AI Bodyguard Preventing Digital Catastrophes

4 months ago
ENEOS Materials Redefines Enterprise AI Adoption with ChatGPT Enterprise
AI Video

ENEOS Materials Redefines Enterprise AI Adoption with ChatGPT Enterprise

4 months ago
Beyond LLMs: Crafting Robust AI with Multi-Method Agentic Architectures
AI Video

Beyond LLMs: Crafting Robust AI with Multi-Method Agentic Architectures

4 months ago
Anthropic's Claude: Reshaping Finance from Curiosity to Production
AI Video

Anthropic's Claude: Reshaping Finance from Curiosity to Production

4 months ago
OpenAI's Browser Gambit: Reshaping the AI Interface
AI Video

OpenAI's Browser Gambit: Reshaping the AI Interface

4 months ago
ChatGPT Unlocks Enterprise Data with New Company Knowledge Feature
AI Video

ChatGPT Unlocks Enterprise Data with New Company Knowledge Feature

4 months ago
Claude's New Memory Feature Elevates AI Personalization
AI Video

Claude's New Memory Feature Elevates AI Personalization

4 months ago
AI Agents: From Prediction to Autonomous Action
AI Video

AI Agents: From Prediction to Autonomous Action

4 months ago
AbbVie's AI Strategy: Reshaping Pharma from Discovery to Patient Impact
AI Video

AbbVie's AI Strategy: Reshaping Pharma from Discovery to Patient Impact

4 months ago
Claude for Life Sciences: Reshaping Scientific Discovery
AI Video

Claude for Life Sciences: Reshaping Scientific Discovery

4 months ago
Claude's Skill Creator Redefines AI Tooling
AI Video

Claude's Skill Creator Redefines AI Tooling

4 months ago