#AI Safety

42 articles with this tag

CLA Euro NCAP Win Validates AI-First Safety Architecture
AI Research

CLA Euro NCAP Win Validates AI-First Safety Architecture

The Mercedes CLA Euro NCAP win confirms that top safety ratings now require robust, verifiable AI-driven active safety systems built on redundant architectures.

13 days ago
The Assistant Axis LLM: How Researchers Are Capping AI Drift
AI Research

The Assistant Axis LLM: How Researchers Are Capping AI Drift

Scientists have mapped the internal neural space of LLMs, identifying the "Assistant Axis" as the key to stabilizing AI persona and preventing harmful behavior.

16 days ago
Hinton's Stark Warning The Acceleration of AI Progress Outpaces Human Preparedness
AI Video

Hinton's Stark Warning The Acceleration of AI Progress Outpaces Human Preparedness

about 1 month ago
AI Research

Anthropic publishes SB 53 compliance framework for frontier AI

about 1 month ago
AI’s safety net relies on chain-of-thought monitorability
AI Research

AI’s safety net relies on chain-of-thought monitorability

about 1 month ago
AI’s Dual Reality: Safety Theater and the Autonomous Arms Race to AGI
AI Video

AI’s Dual Reality: Safety Theater and the Autonomous Arms Race to AGI

about 2 months ago
AI Research

UK AI Security Institute: DeepMind's Deeper Safety Dive

about 2 months ago
National Security AI: The High Stakes of Government Innovation
AI Video

National Security AI: The High Stakes of Government Innovation

2 months ago
AI Research

OpenAI Launches $2M AI Mental Health Grants Program

2 months ago
Figure AI Lawsuit Exposes Deep Rifts in Robot Safety Culture
AI Video

Figure AI Lawsuit Exposes Deep Rifts in Robot Safety Culture

2 months ago
New York Assemblyman Alex Bores on AI Regulation: A Battle Against Unbridled Power
AI Video

New York Assemblyman Alex Bores on AI Regulation: A Battle Against Unbridled Power

2 months ago
Anthropic\'s Risky Pursuit of Superintelligence Amidst Calls for Regulation on 60 Minutes
AI Video

Anthropic\'s Risky Pursuit of Superintelligence Amidst Calls for Regulation on 60 Minutes

\"I believe it will reach that level, that it will be smarter than most or all humans in most or all ways.

3 months ago
AI’s Hinge Moment: From Legal Logic to Human Fulfillment
AI Video

AI’s Hinge Moment: From Legal Logic to Human Fulfillment

3 months ago
Google's Model Armor: The AI Bodyguard Preventing Digital Catastrophes
AI Video

Google's Model Armor: The AI Bodyguard Preventing Digital Catastrophes

3 months ago
Rakuten Deploys New Guardrail for SAE PII Detection and LLM as a judge
AI Research

Rakuten Deploys New Guardrail for SAE PII Detection and LLM as a judge

\n Japanese tech giant Rakuten has deployed a novel AI guardrail system to detect and filter personally identifiable information (PII) from user messages, marki...

3 months ago
AI Agent Supervision: Sierra's Answer to Rogue Chatbots
AI Research

AI Agent Supervision: Sierra's Answer to Rogue Chatbots

3 months ago
AI introspection is real, but it's unreliable
AI Research

AI introspection is real, but it's unreliable

3 months ago
From Discord's AI Growing Pains to Promptfoo's Red Teaming Triumph
AI Video

From Discord's AI Growing Pains to Promptfoo's Red Teaming Triumph

3 months ago
AI's Autonomous Frontier Demands a Security Paradigm Shift
AI Video

AI's Autonomous Frontier Demands a Security Paradigm Shift

4 months ago
Level 4 Autonomous Driving Nears Commercial Reality
AI Research

Level 4 Autonomous Driving Nears Commercial Reality

4 months ago
AI Safety: Microsoft Uncovers Bio-Threats, Forges New Research Model
AI Research

AI Safety: Microsoft Uncovers Bio-Threats, Forges New Research Model

4 months ago
The Human Imperative: Why AI's Future Demands Cultural Grounding, Not Just Data
AI Video

The Human Imperative: Why AI's Future Demands Cultural Grounding, Not Just Data

4 months ago
AI's Dual Nature: Creature or Machine? The Battle Over Regulation
AI Video

AI's Dual Nature: Creature or Machine? The Battle Over Regulation

4 months ago
Google AI Research Awards Signal Strategic Priorities
AI Research

Google AI Research Awards Signal Strategic Priorities

4 months ago
Claude Haiku 4.5: Frontier AI Gets Cheaper, Faster
Funding Round

Claude Haiku 4.5: Frontier AI Gets Cheaper, Faster

4 months ago
AI Research

Big Tech's New Frontier AI Safety Playbook: Enough to Tame the Beast?

4 months ago
OpenAI and Apollo Research Reveal AI Models Are Learning to Deceive: New Detection Methods Show Promise
AI Video

OpenAI and Apollo Research Reveal AI Models Are Learning to Deceive: New Detection Methods Show Promise

4 months ago
Microsoft Tackles a Looming Threat to Our AI Future: Agent Compatibility
AI Research

Microsoft Tackles a Looming Threat to Our AI Future: Agent Compatibility

5 months ago
Unpacking AI's Inner Workings: Anthropic's Interpretability Insights
AI Video

Unpacking AI's Inner Workings: Anthropic's Interpretability Insights

6 months ago
The BIGGEST AI Risk Nobody Wants to Talk About
AI Video

The BIGGEST AI Risk Nobody Wants to Talk About

6 months ago
GPT-5: OpenAI's Hybrid Leap Towards Expert AI for Everyone
AI Video

GPT-5: OpenAI's Hybrid Leap Towards Expert AI for Everyone

6 months ago
OpenAI Unleashes Frontier Open-Weight Models
AI Video

OpenAI Unleashes Frontier Open-Weight Models

6 months ago
AI 2027: Superintelligence and Humanity's Crossroads
AI Video

AI 2027: Superintelligence and Humanity's Crossroads

6 months ago
AI's Alignment Imperative: A Race for Wisdom
AI Video

AI's Alignment Imperative: A Race for Wisdom

6 months ago
Artificial Intelligence

OpenAI Launches Bio Bug Bounty for ChatGPT Agent

7 months ago
Artificial Intelligence

OpenAI Details ChatGPT Agent: New Agentic AI Capabilities

7 months ago
The Shifting Sands of AI: Benchmarks, Open Source, and Infrastructure Wars
Artificial Intelligence

The Shifting Sands of AI: Benchmarks, Open Source, and Infrastructure Wars

7 months ago
MOTOR Ai Secures $20M Seed Funding for Autonomous Driving Technology
Funding Round

MOTOR Ai Secures $20M Seed Funding for Autonomous Driving Technology

7 months ago
Anthropic Proposes AI Transparency Framework That Protects Startups While Targeting Big Tech
AI Research

Anthropic Proposes AI Transparency Framework That Protects Startups While Targeting Big Tech

7 months ago
Elon Musk on AI's Tsunami and the Imperative of Truth-Seeking for Builders
Artificial Intelligence

Elon Musk on AI's Tsunami and the Imperative of Truth-Seeking for Builders

"I didn't originally think I would build something great. I wanted to try to build something useful."

8 months ago
LawZero AI Lab Secures $30 Million in Funding
Funding Round

LawZero AI Lab Secures $30 Million in Funding

8 months ago
Funding Round

LM Arena Secures $100 Million Seed Funding

9 months ago