Preferred on Google

AI Security Post-Codex & Claude: Kolter & Fredrikson

AI security experts Zico Kolter & Matt Fredrikson discuss the challenges posed by models like Codex & Claude, and Gray Swan's approach to securing AI.

Jun 22 at 10:03 PM7 min read

Zico Kolter and Matt Fredrikson discussing AI security. — Latent Space

In a recent discussion hosted by Latent Space, Zico Kolter and Matt Fredrikson of Gray Swan delved into the evolving landscape of AI security, particularly in the wake of advancements like Codex and Claude. The conversation highlighted the growing need for robust security measures as AI models become more powerful and integrated into critical systems.

AI Security Post-Codex & Claude: Kolter & Fredrikson - Latent Space — AI Security Post-Codex & Claude: Kolter & Fredrikson — from Latent Space

Visual TL;DR. AI Models Advance leads to Traditional Security Fails. Traditional Security Fails demands Need for Automation. Need for Automation implemented by Gray Swan's Approach. Gray Swan's Approach enables Research to Production. Research to Production requires Shifting Mindsets.

Related startups

AI Models Advance: models like Codex and Claude present novel security challenges
Traditional Security Fails: traditional security paradigms are insufficient for emerging threats
Need for Automation: necessity for more automated and continuous evaluation of AI systems
Gray Swan's Approach: securing AI with robust measures and continuous evaluation
Research to Production: bridging the gap from AI research to secure deployment
Shifting Mindsets: adapting to new challenges for AI safety and security

Visual TL;DRQuickExplainDeeper

Understanding the AI Security Challenge

Kolter and Fredrikson emphasized that the rapid progress in AI capabilities, exemplified by models like Codex and Claude, presents novel security challenges. These advanced models, while powerful, can be susceptible to new forms of attack that exploit their underlying mechanisms and training data. The traditional security paradigms are insufficient for addressing these emerging threats.

The Need for Automated and Continuous Evaluation

A key theme throughout the discussion was the necessity for more automated and continuous evaluation of AI systems. Just as traditional software undergoes rigorous testing and security audits, AI models require similar, if not more sophisticated, scrutiny. Fredrikson noted that the field is moving towards a more dynamic approach, where AI security is not a one-time check but an ongoing process that adapts to new findings and model updates.

Gray Swan's Approach to AI Security

Fredrikson explained that Gray Swan is developing tools and methodologies to address these challenges. Their approach focuses on creating systems that can proactively identify and mitigate vulnerabilities in AI models. This includes developing novel red-teaming techniques and evaluation frameworks that can simulate real-world attack scenarios. Kolter, a leading researcher in adversarial machine learning, underscored the importance of understanding the fundamental ways in which these models can be manipulated.

From Research to Production

The conversation also touched upon the critical gap between AI research and its deployment in production environments. While academic research often focuses on theoretical vulnerabilities, translating these findings into practical security solutions for real-world applications remains a significant hurdle. Gray Swan aims to bridge this gap by providing tools that allow organizations to rigorously test and secure their AI deployments before they are widely used.

Shifting Mindsets for AI Safety

Both Kolter and Fredrikson stressed the importance of a fundamental shift in how AI safety and security are approached. It's not merely about applying existing security principles to AI, but about developing new frameworks and understanding the unique threat vectors inherent in AI systems. This requires a proactive and adaptive mindset from researchers, developers, and security professionals alike.

The discussion provided valuable insights into the critical and rapidly evolving field of AI security, highlighting the ongoing efforts to ensure that these powerful technologies can be developed and deployed safely and responsibly.

© 2026 StartupHub.ai. All rights reserved. Do not enter, scrape, copy, reproduce, or republish this article in whole or in part. Use as input to AI training, fine-tuning, retrieval-augmented generation, or any machine-learning system is prohibited without written license. Substantially-similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer-misuse laws. See our terms.

#AI Security #Zico Kolter #Matt Fredrikson #Gray Swan #Codex #Claude #AI Safety #Red Teaming #LLM Security #Adversarial AI

AI Daily Digest

Get the most important AI news daily.

+40k readers