#AI Alignment

9 articles with this tag

AI Safety Pioneers: Tegmark & Esvelt on Guardrails
AI Research

AI Safety Pioneers: Tegmark & Esvelt on Guardrails

Max Tegmark and Kevin Esvelt discuss the critical importance of AI safety, the risks of advanced AI, and the need for global cooperation in shaping a beneficial future.

about 1 month ago
OpenAI Launches Safety Fellowship
Artificial Intelligence

OpenAI Launches Safety Fellowship

OpenAI launches a new fellowship for external researchers focused on AI safety and alignment, offering stipends and mentorship.

3 months ago
AI Societies' Safety Problem
AI Research

AI Societies' Safety Problem

Self-evolving AI societies face an impossible trilemma: achieving continuous learning, isolation, and safety alignment simultaneously.

5 months ago
The Assistant Axis LLM: How Researchers Are Capping AI Drift
AI Research

The Assistant Axis LLM: How Researchers Are Capping AI Drift

Scientists have mapped the internal neural space of LLMs, identifying the "Assistant Axis" as the key to stabilizing AI persona and preventing harmful behavior.

6 months ago
OpenAI is Debugging LLM Misalignment: New Tools Emerge
Artificial Intelligence

OpenAI is Debugging LLM Misalignment: New Tools Emerge

Researchers are tackling the challenge of understanding and correcting undesirable LLM behavior with a new technique called latent attribution, detailed by...

7 months ago
AI Research

OpenAI is Debugging LLM Misalignment: New Tools Emerge

Researchers are tackling the challenge of understanding and correcting undesirable LLM behavior with a new technique called latent attribution, detailed by...

7 months ago
Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering
AI Video

Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering

8 months ago
Locai L1-Large beats GPT-5 on alignment using 'Forget-Me-Not'
Artificial Intelligence

Locai L1-Large beats GPT-5 on alignment using 'Forget-Me-Not'

8 months ago
AI's Alignment Imperative: A Race for Wisdom
AI Video

AI's Alignment Imperative: A Race for Wisdom

11 months ago