#AI Alignment
6 articles with this tag

AI Research
AI Societies' Safety Problem
Self-evolving AI societies face an impossible trilemma: achieving continuous learning, isolation, and safety alignment simultaneously.
about 13 hours ago

AI Research
The Assistant Axis LLM: How Researchers Are Capping AI Drift
Scientists have mapped the internal neural space of LLMs, identifying the "Assistant Axis" as the key to stabilizing AI persona and preventing harmful behavior.
27 days ago
AI Research
OpenAI is Debugging LLM Misalignment: New Tools Emerge
\n Researchers are tackling the challenge of understanding and correcting undesirable LLM behavior with a new technique called latent attribution , detailed by ...
3 months ago

AI Video
Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering
3 months ago

Artificial Intelligence
Locai L1-Large beats GPT-5 on alignment using 'Forget-Me-Not'
3 months ago

AI Video
AI's Alignment Imperative: A Race for Wisdom
7 months ago