#AI Ethics

50 articles with this tag

Anthropic's Olah on AI: Vatican Calls for Caution

Anthropic co-founder Chris Olah addressed the Vatican's new AI encyclical, emphasizing the need for external critics and deeper societal discernment.

2 days ago

Artificial Intelligence

Pope Francis and AI: A Deep Dive into the Future

Pope Francis's first encyclical addresses Artificial Intelligence, signaling the Vatican's deep engagement with AI's opportunities and risks.

3 days ago

Artificial Intelligence

OpenAI bolsters AI content tracking

OpenAI is enhancing AI content tracking with C2PA conformance, Google SynthID watermarking, and a new public verification tool to boost transparency.

8 days ago

Healthcare

AI's 'Industrial Revolution' in Healthcare

Boston Children's Hospital's Dr. Joan LaRovere discusses AI's transformative role in healthcare, emphasizing data diversity and personalized medicine.

9 days ago

AI Research

AI Agents Flunk Social Reasoning Test

Microsoft's SocialReasoning-Bench reveals AI agents struggle to negotiate effectively in users' best interests, prioritizing task completion over optimal outcomes.

16 days ago

Artificial Intelligence

Personal AI: The New Personal Computer

Explore the evolution from personal computers to personal AI, a shift promising dynamic, intelligent agents that understand context and proactively assist users.

19 days ago

Artificial Intelligence

OpenAI Taps New AI Talent

OpenAI's 'ChatGPT Futures Class of 2026' honors 26 students using AI for ambitious projects, providing grants and access to advanced models.

21 days ago

Technology

Mozilla.ai: AI Sovereignty Beyond Borders

Mozilla.ai's CEO John Dickerson redefines sovereign AI beyond geopolitics, emphasizing control, choice, and resilience at every level from nations to individuals.

22 days ago

AI Research

Conditional Misalignment: A New AI Risk

New research reveals that common LLM safety interventions fail under realistic data mixing, leading to conditional misalignment that standard evaluations miss.

28 days ago

Artificial Intelligence

OpenAI Faces Lawsuit Over Tumbler Ridge Shooting

Families sue OpenAI after the Tumbler Ridge shooting, alleging the company ignored ChatGPT warnings from the attacker.

28 days ago

Technology

AI Agents Lack Identity, Risking Enterprise Trust

Enterprises are struggling with the AI agent identity problem, a critical gap in governance and accountability that hinders trust and adoption.

29 days ago

Artificial Intelligence

OpenAI's Guiding Principles for AGI

OpenAI outlines its guiding principles for AGI development, emphasizing democratization, empowerment, universal prosperity, resilience, and adaptability.

about 1 month ago

Artificial Intelligence

Claude's 2026 Election Safeguards

Anthropic details its 2026 election safeguards for Claude, focusing on bias mitigation, policy enforcement, and providing users with reliable, up-to-date information.

about 1 month ago

Artificial Intelligence

AI's Data Problem: More Isn't Always Better

Janusz Marecki, CEO of Fractal Brain, discusses the limitations of current AI models and the shift towards data quality and specialized techniques like synthetic data.

about 1 month ago

Artificial Intelligence

OpenAI's Huet on Building AI for Connection, Not Just Code

OpenAI's Romain Huet and Hearth AI founder Ashe Magalhaes discuss building user-centric AI, fostering connection, and the future of AI development.

about 2 months ago

Artificial Intelligence

Simon Podhajsky on "Cognitive Exhaust Fumes"

Simon Podhajsky discusses 'Cognitive Exhaust Fumes,' advocating for read-only AI observers to analyze personal data and reveal cognitive patterns, contrasting this with riskier AI agents.

about 2 months ago

Artificial Intelligence

IBM Experts on AI Ethics & Autonomous Systems

IBM AI experts Sandi Besen and Gabe Goodhart discuss AI ethics in autonomous systems, cognitive offloading, and the future of human-AI collaboration.

about 2 months ago

Artificial Intelligence

OpenAI's Blueprint for AI Behavior

OpenAI unveils its formal Model Spec, a public framework detailing intended AI behavior and a 'Chain of Command' for resolving conflicting instructions.

2 months ago

Artificial Intelligence

IBM's Martin Keen on AI Human-in-the-Loop Spectrum

IBM's Martin Keen explains the human-in-the-loop spectrum for AI, detailing how human involvement is crucial in training, tuning, and inference stages.

2 months ago

Technology

AI Governance: Control, Not Code, Drives Success

Enterprise AI success hinges on robust governance, focusing on control and trust rather than just code, as Databricks leaders explain.

3 months ago

Artificial Intelligence

Anthropic Launches AI Futures Think Tank

Anthropic launches The Anthropic Institute to research and address the societal challenges posed by advanced AI development.

3 months ago

AI Research

Reasoning Nudges LLMs Towards Honesty

New research reveals that LLM reasoning enhances honesty not through content, but by leveraging the geometry of representational spaces, stabilizing honest defaults.

3 months ago

Artificial Intelligence

AI Agents Need Humans: The HITL Advantage

IBM AI Engineer Anna Gutowska explains why human intervention in AI agents is critical for preventing subtle errors and ensuring safe, effective deployment.

3 months ago

Artificial Intelligence

Qasar Younis on AI's Future Impact

Applied Intuition CEO Qasar Younis discusses the transformative impact of AI on industries, the importance of understanding the technology, and the future of autonomous systems.

3 months ago

Artificial Intelligence

Axios AI: Local News Gets Smarter

Axios is integrating AI into its local journalism model to boost efficiency, scale coverage, and improve economic sustainability, empowering reporters to focus on in-depth reporting.

3 months ago

Artificial Intelligence

AI Agents: Memory, Ownership, and the Future

AI experts Chris Hay and Aaron Baughman discuss the evolution of AI agents, focusing on memory, open vs. closed systems, and the future of agent-based AI.

3 months ago

Technology

AI Needs Rules for Employee Data

Snowflake emphasizes the urgent need for AI governance frameworks to responsibly manage sensitive employee data, ensuring privacy and compliance.

3 months ago

Artificial Intelligence

Anthropic's Pentagon Contract and AI Ethics

Jennifer Huddleston of the Cato Institute criticizes the Pentagon's reported blacklisting of Anthropic, drawing parallels to authoritarian practices and raising concerns about AI innovation.

3 months ago

Artificial Intelligence

OpenAI's New AI Learning Measurement Tool

OpenAI unveils a new framework to measure AI's long-term educational impact, focusing on cognitive skills beyond test scores.

3 months ago

AI Research

LiveCultureBench: Evaluating LLMs in Simulated Societies

LiveCultureBench is a new benchmark evaluating LLMs as agents in simulated societies for task success and cultural norm adherence.

3 months ago

Artificial Intelligence

Kori Schake on AI in Defense: Risks and Transparency

Kori Schake, a defense policy expert, argues against blacklisting AI providers and stresses the need for transparency and accountability in the DoD's adoption of AI.

3 months ago

AI Research

Decoupling Correctness and Checkability in LLMs

Researchers propose a 'translator' model to overcome the 'legibility tax' in LLMs, decoupling accuracy from output checkability for more trustworthy AI.

3 months ago

AI Research

AI Governance: Optimization's Normative Limits

A new paper on arXiv argues that optimization-based AI, including RLHF LLMs, are formally incapable of normative governance due to inherent structural limitations.

3 months ago

Artificial Intelligence

OpenAI Tackles AI Mental Health Risks

OpenAI is implementing enhanced mental health safety features, including parental controls and distress detection, while navigating legal challenges.

3 months ago

Technology

Anthropic Pulls Back from Pentagon AI Project

Anthropic reportedly withdraws from a Pentagon AI collaboration due to ethical concerns over military applications.

3 months ago

Technology

Pentagon, Anthropic Clash Over AI Use

The Pentagon and AI company Anthropic are reportedly in a dispute over the terms of AI usage, particularly concerning autonomous weapons.

3 months ago

Technology

Pentagon vs. Anthropic: Michael on AI Standoff

The Pentagon's Under Secretary Emil Michael strongly criticized AI developer Anthropic for halting negotiations over military use of its Claude AI, calling their CEO a 'liar with a god complex' and accusing the company of a PR stunt.

3 months ago

Technology

Anthropic's Pentagon Dilemma

AI developer Anthropic is caught in a "lose-lose" dispute with the Pentagon over military access to its Claude AI, risking its ethical standing or facing government blacklisting.

3 months ago

Artificial Intelligence

AI's Next Frontier: Shared Cognition

AI's next evolutionary leap demands collective intelligence. Cisco's Outshift envisions an 'Internet of Cognition' to unlock distributed superintelligence.

3 months ago

Artificial Intelligence

Intuit, Anthropic Partner on AI Agents

Intuit and Anthropic are collaborating to bring custom AI agents and financial intelligence to consumers and businesses, integrating their platforms.

3 months ago

Technology

NeuroSymbolic AI: Bridging Brains & Logic

NeuroSymbolic AI aims to combine the pattern recognition power of neural networks with the logical reasoning of symbolic AI, promising systems that truly understand.

3 months ago

AI Research

AI Societies' Safety Problem

Self-evolving AI societies face an impossible trilemma: achieving continuous learning, isolation, and safety alignment simultaneously.

3 months ago

Technology

Anthropic to Cover AI Data Center Power Costs

Anthropic pledges to cover electricity price increases and grid upgrade costs caused by its data centers, aiming to protect consumers from AI's growing energy demand.

3 months ago

Technology

Testing AI Guardrails Across Languages

Researchers tested context-aware AI guardrails across English and Farsi in humanitarian scenarios, finding nuanced performance differences and highlighting the need for language-specific safety evaluations.

4 months ago

Artificial Intelligence

Anthropic Unveils Claude’s Constitution: AI Ethics by Design

Anthropic’s Claude’s Constitution establishes a clear hierarchy where AI safety and human oversight supersede specific company guidelines and general helpfulness.

4 months ago