#AI Ethics

50 articles with this tag

Anthropic's Olah on AI: Vatican Calls for Caution
Artificial Intelligence

Anthropic's Olah on AI: Vatican Calls for Caution

Anthropic co-founder Chris Olah addressed the Vatican's new AI encyclical, emphasizing the need for external critics and deeper societal discernment.

2 days ago
Pope Francis and AI: A Deep Dive into the Future
Artificial Intelligence

Pope Francis and AI: A Deep Dive into the Future

Pope Francis's first encyclical addresses Artificial Intelligence, signaling the Vatican's deep engagement with AI's opportunities and risks.

3 days ago
OpenAI bolsters AI content tracking
Artificial Intelligence

OpenAI bolsters AI content tracking

OpenAI is enhancing AI content tracking with C2PA conformance, Google SynthID watermarking, and a new public verification tool to boost transparency.

8 days ago
AI's 'Industrial Revolution' in Healthcare
Healthcare

AI's 'Industrial Revolution' in Healthcare

Boston Children's Hospital's Dr. Joan LaRovere discusses AI's transformative role in healthcare, emphasizing data diversity and personalized medicine.

9 days ago
AI Agents Flunk Social Reasoning Test
AI Research

AI Agents Flunk Social Reasoning Test

Microsoft's SocialReasoning-Bench reveals AI agents struggle to negotiate effectively in users' best interests, prioritizing task completion over optimal outcomes.

16 days ago
Personal AI: The New Personal Computer
Artificial Intelligence

Personal AI: The New Personal Computer

Explore the evolution from personal computers to personal AI, a shift promising dynamic, intelligent agents that understand context and proactively assist users.

19 days ago
OpenAI Taps New AI Talent
Artificial Intelligence

OpenAI Taps New AI Talent

OpenAI's 'ChatGPT Futures Class of 2026' honors 26 students using AI for ambitious projects, providing grants and access to advanced models.

21 days ago
Mozilla.ai: AI Sovereignty Beyond Borders
Technology

Mozilla.ai: AI Sovereignty Beyond Borders

Mozilla.ai's CEO John Dickerson redefines sovereign AI beyond geopolitics, emphasizing control, choice, and resilience at every level from nations to individuals.

22 days ago
Conditional Misalignment: A New AI Risk
AI Research

Conditional Misalignment: A New AI Risk

New research reveals that common LLM safety interventions fail under realistic data mixing, leading to conditional misalignment that standard evaluations miss.

28 days ago
OpenAI Faces Lawsuit Over Tumbler Ridge Shooting
Artificial Intelligence

OpenAI Faces Lawsuit Over Tumbler Ridge Shooting

Families sue OpenAI after the Tumbler Ridge shooting, alleging the company ignored ChatGPT warnings from the attacker.

28 days ago
AI Agents Lack Identity, Risking Enterprise Trust
Technology

AI Agents Lack Identity, Risking Enterprise Trust

Enterprises are struggling with the AI agent identity problem, a critical gap in governance and accountability that hinders trust and adoption.

29 days ago
OpenAI's Guiding Principles for AGI
Artificial Intelligence

OpenAI's Guiding Principles for AGI

OpenAI outlines its guiding principles for AGI development, emphasizing democratization, empowerment, universal prosperity, resilience, and adaptability.

about 1 month ago
Claude's 2026 Election Safeguards
Artificial Intelligence

Claude's 2026 Election Safeguards

Anthropic details its 2026 election safeguards for Claude, focusing on bias mitigation, policy enforcement, and providing users with reliable, up-to-date information.

about 1 month ago
AI's Data Problem: More Isn't Always Better
Artificial Intelligence

AI's Data Problem: More Isn't Always Better

Janusz Marecki, CEO of Fractal Brain, discusses the limitations of current AI models and the shift towards data quality and specialized techniques like synthetic data.

about 1 month ago
OpenAI's Huet on Building AI for Connection, Not Just Code
Artificial Intelligence

OpenAI's Huet on Building AI for Connection, Not Just Code

OpenAI's Romain Huet and Hearth AI founder Ashe Magalhaes discuss building user-centric AI, fostering connection, and the future of AI development.

about 2 months ago
Simon Podhajsky on "Cognitive Exhaust Fumes"
Artificial Intelligence

Simon Podhajsky on "Cognitive Exhaust Fumes"

Simon Podhajsky discusses 'Cognitive Exhaust Fumes,' advocating for read-only AI observers to analyze personal data and reveal cognitive patterns, contrasting this with riskier AI agents.

about 2 months ago
IBM Experts on AI Ethics & Autonomous Systems
Artificial Intelligence

IBM Experts on AI Ethics & Autonomous Systems

IBM AI experts Sandi Besen and Gabe Goodhart discuss AI ethics in autonomous systems, cognitive offloading, and the future of human-AI collaboration.

about 2 months ago
OpenAI's Blueprint for AI Behavior
Artificial Intelligence

OpenAI's Blueprint for AI Behavior

OpenAI unveils its formal Model Spec, a public framework detailing intended AI behavior and a 'Chain of Command' for resolving conflicting instructions.

2 months ago
IBM's Martin Keen on AI Human-in-the-Loop Spectrum
Artificial Intelligence

IBM's Martin Keen on AI Human-in-the-Loop Spectrum

IBM's Martin Keen explains the human-in-the-loop spectrum for AI, detailing how human involvement is crucial in training, tuning, and inference stages.

2 months ago
AI Governance: Control, Not Code, Drives Success
Technology

AI Governance: Control, Not Code, Drives Success

Enterprise AI success hinges on robust governance, focusing on control and trust rather than just code, as Databricks leaders explain.

3 months ago
Anthropic Launches AI Futures Think Tank
Artificial Intelligence

Anthropic Launches AI Futures Think Tank

Anthropic launches The Anthropic Institute to research and address the societal challenges posed by advanced AI development.

3 months ago
Reasoning Nudges LLMs Towards Honesty
AI Research

Reasoning Nudges LLMs Towards Honesty

New research reveals that LLM reasoning enhances honesty not through content, but by leveraging the geometry of representational spaces, stabilizing honest defaults.

3 months ago
AI Agents Need Humans: The HITL Advantage
Artificial Intelligence

AI Agents Need Humans: The HITL Advantage

IBM AI Engineer Anna Gutowska explains why human intervention in AI agents is critical for preventing subtle errors and ensuring safe, effective deployment.

3 months ago
Qasar Younis on AI's Future Impact
Artificial Intelligence

Qasar Younis on AI's Future Impact

Applied Intuition CEO Qasar Younis discusses the transformative impact of AI on industries, the importance of understanding the technology, and the future of autonomous systems.

3 months ago
Axios AI: Local News Gets Smarter
Artificial Intelligence

Axios AI: Local News Gets Smarter

Axios is integrating AI into its local journalism model to boost efficiency, scale coverage, and improve economic sustainability, empowering reporters to focus on in-depth reporting.

3 months ago
AI Agents: Memory, Ownership, and the Future
Artificial Intelligence

AI Agents: Memory, Ownership, and the Future

AI experts Chris Hay and Aaron Baughman discuss the evolution of AI agents, focusing on memory, open vs. closed systems, and the future of agent-based AI.

3 months ago
AI Needs Rules for Employee Data
Technology

AI Needs Rules for Employee Data

Snowflake emphasizes the urgent need for AI governance frameworks to responsibly manage sensitive employee data, ensuring privacy and compliance.

3 months ago
Anthropic's Pentagon Contract and AI Ethics
Artificial Intelligence

Anthropic's Pentagon Contract and AI Ethics

Jennifer Huddleston of the Cato Institute criticizes the Pentagon's reported blacklisting of Anthropic, drawing parallels to authoritarian practices and raising concerns about AI innovation.

3 months ago
OpenAI's New AI Learning Measurement Tool
Artificial Intelligence

OpenAI's New AI Learning Measurement Tool

OpenAI unveils a new framework to measure AI's long-term educational impact, focusing on cognitive skills beyond test scores.

3 months ago
LiveCultureBench: Evaluating LLMs in Simulated Societies
AI Research

LiveCultureBench: Evaluating LLMs in Simulated Societies

LiveCultureBench is a new benchmark evaluating LLMs as agents in simulated societies for task success and cultural norm adherence.

3 months ago
Kori Schake on AI in Defense: Risks and Transparency
Artificial Intelligence

Kori Schake on AI in Defense: Risks and Transparency

Kori Schake, a defense policy expert, argues against blacklisting AI providers and stresses the need for transparency and accountability in the DoD's adoption of AI.

3 months ago
AI Research

Decoupling Correctness and Checkability in LLMs

Researchers propose a 'translator' model to overcome the 'legibility tax' in LLMs, decoupling accuracy from output checkability for more trustworthy AI.

3 months ago
AI Research

AI Governance: Optimization's Normative Limits

A new paper on arXiv argues that optimization-based AI, including RLHF LLMs, are formally incapable of normative governance due to inherent structural limitations.

3 months ago
Artificial Intelligence

OpenAI Tackles AI Mental Health Risks

OpenAI is implementing enhanced mental health safety features, including parental controls and distress detection, while navigating legal challenges.

3 months ago
Anthropic Pulls Back from Pentagon AI Project
Technology

Anthropic Pulls Back from Pentagon AI Project

Anthropic reportedly withdraws from a Pentagon AI collaboration due to ethical concerns over military applications.

3 months ago
Pentagon, Anthropic Clash Over AI Use
Technology

Pentagon, Anthropic Clash Over AI Use

The Pentagon and AI company Anthropic are reportedly in a dispute over the terms of AI usage, particularly concerning autonomous weapons.

3 months ago
Pentagon vs. Anthropic: Michael on AI Standoff
Technology

Pentagon vs. Anthropic: Michael on AI Standoff

The Pentagon's Under Secretary Emil Michael strongly criticized AI developer Anthropic for halting negotiations over military use of its Claude AI, calling their CEO a 'liar with a god complex' and accusing the company of a PR stunt.

3 months ago
Anthropic's Pentagon Dilemma
Technology

Anthropic's Pentagon Dilemma

AI developer Anthropic is caught in a "lose-lose" dispute with the Pentagon over military access to its Claude AI, risking its ethical standing or facing government blacklisting.

3 months ago
AI's Next Frontier: Shared Cognition
Artificial Intelligence

AI's Next Frontier: Shared Cognition

AI's next evolutionary leap demands collective intelligence. Cisco's Outshift envisions an 'Internet of Cognition' to unlock distributed superintelligence.

3 months ago
Intuit, Anthropic Partner on AI Agents
Artificial Intelligence

Intuit, Anthropic Partner on AI Agents

Intuit and Anthropic are collaborating to bring custom AI agents and financial intelligence to consumers and businesses, integrating their platforms.

3 months ago
NeuroSymbolic AI: Bridging Brains & Logic
Technology

NeuroSymbolic AI: Bridging Brains & Logic

NeuroSymbolic AI aims to combine the pattern recognition power of neural networks with the logical reasoning of symbolic AI, promising systems that truly understand.

3 months ago
AI Societies' Safety Problem
AI Research

AI Societies' Safety Problem

Self-evolving AI societies face an impossible trilemma: achieving continuous learning, isolation, and safety alignment simultaneously.

3 months ago
Technology

Anthropic to Cover AI Data Center Power Costs

Anthropic pledges to cover electricity price increases and grid upgrade costs caused by its data centers, aiming to protect consumers from AI's growing energy demand.

3 months ago
Technology

Testing AI Guardrails Across Languages

Researchers tested context-aware AI guardrails across English and Farsi in humanitarian scenarios, finding nuanced performance differences and highlighting the need for language-specific safety evaluations.

4 months ago
Anthropic Unveils Claude’s Constitution: AI Ethics by Design
Artificial Intelligence

Anthropic Unveils Claude’s Constitution: AI Ethics by Design

Anthropic’s Claude’s Constitution establishes a clear hierarchy where AI safety and human oversight supersede specific company guidelines and general helpfulness.

4 months ago
National Security AI: The High Stakes of Government Innovation
AI Video

National Security AI: The High Stakes of Government Innovation

6 months ago
A Philosopher's Lens on AI's Evolving Consciousness
AI Video

A Philosopher's Lens on AI's Evolving Consciousness

6 months ago
Figure AI Lawsuit Exposes Deep Rifts in Robot Safety Culture
AI Video

Figure AI Lawsuit Exposes Deep Rifts in Robot Safety Culture

6 months ago
New York Assemblyman Alex Bores on AI Regulation: A Battle Against Unbridled Power
AI Video

New York Assemblyman Alex Bores on AI Regulation: A Battle Against Unbridled Power

6 months ago
Defense AI Demands Trust, Not Just Performance
AI Video

Defense AI Demands Trust, Not Just Performance

6 months ago