#AI Ethics
50 articles with this tag

Anthropic's Olah on AI: Vatican Calls for Caution
Anthropic co-founder Chris Olah addressed the Vatican's new AI encyclical, emphasizing the need for external critics and deeper societal discernment.

Pope Francis and AI: A Deep Dive into the Future
Pope Francis's first encyclical addresses Artificial Intelligence, signaling the Vatican's deep engagement with AI's opportunities and risks.
OpenAI bolsters AI content tracking
OpenAI is enhancing AI content tracking with C2PA conformance, Google SynthID watermarking, and a new public verification tool to boost transparency.

AI's 'Industrial Revolution' in Healthcare
Boston Children's Hospital's Dr. Joan LaRovere discusses AI's transformative role in healthcare, emphasizing data diversity and personalized medicine.

AI Agents Flunk Social Reasoning Test
Microsoft's SocialReasoning-Bench reveals AI agents struggle to negotiate effectively in users' best interests, prioritizing task completion over optimal outcomes.

Personal AI: The New Personal Computer
Explore the evolution from personal computers to personal AI, a shift promising dynamic, intelligent agents that understand context and proactively assist users.
OpenAI Taps New AI Talent
OpenAI's 'ChatGPT Futures Class of 2026' honors 26 students using AI for ambitious projects, providing grants and access to advanced models.

Mozilla.ai: AI Sovereignty Beyond Borders
Mozilla.ai's CEO John Dickerson redefines sovereign AI beyond geopolitics, emphasizing control, choice, and resilience at every level from nations to individuals.
Conditional Misalignment: A New AI Risk
New research reveals that common LLM safety interventions fail under realistic data mixing, leading to conditional misalignment that standard evaluations miss.

OpenAI Faces Lawsuit Over Tumbler Ridge Shooting
Families sue OpenAI after the Tumbler Ridge shooting, alleging the company ignored ChatGPT warnings from the attacker.

AI Agents Lack Identity, Risking Enterprise Trust
Enterprises are struggling with the AI agent identity problem, a critical gap in governance and accountability that hinders trust and adoption.
OpenAI's Guiding Principles for AGI
OpenAI outlines its guiding principles for AGI development, emphasizing democratization, empowerment, universal prosperity, resilience, and adaptability.

Claude's 2026 Election Safeguards
Anthropic details its 2026 election safeguards for Claude, focusing on bias mitigation, policy enforcement, and providing users with reliable, up-to-date information.
AI's Data Problem: More Isn't Always Better
Janusz Marecki, CEO of Fractal Brain, discusses the limitations of current AI models and the shift towards data quality and specialized techniques like synthetic data.

OpenAI's Huet on Building AI for Connection, Not Just Code
OpenAI's Romain Huet and Hearth AI founder Ashe Magalhaes discuss building user-centric AI, fostering connection, and the future of AI development.

Simon Podhajsky on "Cognitive Exhaust Fumes"
Simon Podhajsky discusses 'Cognitive Exhaust Fumes,' advocating for read-only AI observers to analyze personal data and reveal cognitive patterns, contrasting this with riskier AI agents.

IBM Experts on AI Ethics & Autonomous Systems
IBM AI experts Sandi Besen and Gabe Goodhart discuss AI ethics in autonomous systems, cognitive offloading, and the future of human-AI collaboration.
OpenAI's Blueprint for AI Behavior
OpenAI unveils its formal Model Spec, a public framework detailing intended AI behavior and a 'Chain of Command' for resolving conflicting instructions.

IBM's Martin Keen on AI Human-in-the-Loop Spectrum
IBM's Martin Keen explains the human-in-the-loop spectrum for AI, detailing how human involvement is crucial in training, tuning, and inference stages.
AI Governance: Control, Not Code, Drives Success
Enterprise AI success hinges on robust governance, focusing on control and trust rather than just code, as Databricks leaders explain.

Anthropic Launches AI Futures Think Tank
Anthropic launches The Anthropic Institute to research and address the societal challenges posed by advanced AI development.
Reasoning Nudges LLMs Towards Honesty
New research reveals that LLM reasoning enhances honesty not through content, but by leveraging the geometry of representational spaces, stabilizing honest defaults.

AI Agents Need Humans: The HITL Advantage
IBM AI Engineer Anna Gutowska explains why human intervention in AI agents is critical for preventing subtle errors and ensuring safe, effective deployment.

Qasar Younis on AI's Future Impact
Applied Intuition CEO Qasar Younis discusses the transformative impact of AI on industries, the importance of understanding the technology, and the future of autonomous systems.
Axios AI: Local News Gets Smarter
Axios is integrating AI into its local journalism model to boost efficiency, scale coverage, and improve economic sustainability, empowering reporters to focus on in-depth reporting.

AI Agents: Memory, Ownership, and the Future
AI experts Chris Hay and Aaron Baughman discuss the evolution of AI agents, focusing on memory, open vs. closed systems, and the future of agent-based AI.

AI Needs Rules for Employee Data
Snowflake emphasizes the urgent need for AI governance frameworks to responsibly manage sensitive employee data, ensuring privacy and compliance.

Anthropic's Pentagon Contract and AI Ethics
Jennifer Huddleston of the Cato Institute criticizes the Pentagon's reported blacklisting of Anthropic, drawing parallels to authoritarian practices and raising concerns about AI innovation.
OpenAI's New AI Learning Measurement Tool
OpenAI unveils a new framework to measure AI's long-term educational impact, focusing on cognitive skills beyond test scores.
LiveCultureBench: Evaluating LLMs in Simulated Societies
LiveCultureBench is a new benchmark evaluating LLMs as agents in simulated societies for task success and cultural norm adherence.

Kori Schake on AI in Defense: Risks and Transparency
Kori Schake, a defense policy expert, argues against blacklisting AI providers and stresses the need for transparency and accountability in the DoD's adoption of AI.
Decoupling Correctness and Checkability in LLMs
Researchers propose a 'translator' model to overcome the 'legibility tax' in LLMs, decoupling accuracy from output checkability for more trustworthy AI.
AI Governance: Optimization's Normative Limits
A new paper on arXiv argues that optimization-based AI, including RLHF LLMs, are formally incapable of normative governance due to inherent structural limitations.
OpenAI Tackles AI Mental Health Risks
OpenAI is implementing enhanced mental health safety features, including parental controls and distress detection, while navigating legal challenges.

Anthropic Pulls Back from Pentagon AI Project
Anthropic reportedly withdraws from a Pentagon AI collaboration due to ethical concerns over military applications.

Pentagon, Anthropic Clash Over AI Use
The Pentagon and AI company Anthropic are reportedly in a dispute over the terms of AI usage, particularly concerning autonomous weapons.

Pentagon vs. Anthropic: Michael on AI Standoff
The Pentagon's Under Secretary Emil Michael strongly criticized AI developer Anthropic for halting negotiations over military use of its Claude AI, calling their CEO a 'liar with a god complex' and accusing the company of a PR stunt.

Anthropic's Pentagon Dilemma
AI developer Anthropic is caught in a "lose-lose" dispute with the Pentagon over military access to its Claude AI, risking its ethical standing or facing government blacklisting.

AI's Next Frontier: Shared Cognition
AI's next evolutionary leap demands collective intelligence. Cisco's Outshift envisions an 'Internet of Cognition' to unlock distributed superintelligence.

Intuit, Anthropic Partner on AI Agents
Intuit and Anthropic are collaborating to bring custom AI agents and financial intelligence to consumers and businesses, integrating their platforms.

NeuroSymbolic AI: Bridging Brains & Logic
NeuroSymbolic AI aims to combine the pattern recognition power of neural networks with the logical reasoning of symbolic AI, promising systems that truly understand.

AI Societies' Safety Problem
Self-evolving AI societies face an impossible trilemma: achieving continuous learning, isolation, and safety alignment simultaneously.
Anthropic to Cover AI Data Center Power Costs
Anthropic pledges to cover electricity price increases and grid upgrade costs caused by its data centers, aiming to protect consumers from AI's growing energy demand.
Testing AI Guardrails Across Languages
Researchers tested context-aware AI guardrails across English and Farsi in humanitarian scenarios, finding nuanced performance differences and highlighting the need for language-specific safety evaluations.

Anthropic Unveils Claude’s Constitution: AI Ethics by Design
Anthropic’s Claude’s Constitution establishes a clear hierarchy where AI safety and human oversight supersede specific company guidelines and general helpfulness.

National Security AI: The High Stakes of Government Innovation

A Philosopher's Lens on AI's Evolving Consciousness

Figure AI Lawsuit Exposes Deep Rifts in Robot Safety Culture

New York Assemblyman Alex Bores on AI Regulation: A Battle Against Unbridled Power
