AI Security Post-Codex & Claude: Kolter & Fredrikson

AI security experts Zico Kolter & Matt Fredrikson discuss the challenges posed by models like Codex & Claude, and Gray Swan's approach to securing AI.

7 min read
Zico Kolter and Matt Fredrikson discussing AI security.
Latent Space

In a recent discussion hosted by Latent Space, Zico Kolter and Matt Fredrikson of Gray Swan delved into the evolving landscape of AI security, particularly in the wake of advancements like Codex and Claude. The conversation highlighted the growing need for robust security measures as AI models become more powerful and integrated into critical systems.

AI Security Post-Codex & Claude: Kolter & Fredrikson - Latent Space
AI Security Post-Codex & Claude: Kolter & Fredrikson — from Latent Space

Visual TL;DR. AI Models Advance leads to Traditional Security Fails. Traditional Security Fails demands Need for Automation. Need for Automation implemented by Gray Swan's Approach. Gray Swan's Approach enables Research to Production. Research to Production requires Shifting Mindsets.

Related startups

  1. AI Models Advance: models like Codex and Claude present novel security challenges
  2. Traditional Security Fails: traditional security paradigms are insufficient for emerging threats
  3. Need for Automation: necessity for more automated and continuous evaluation of AI systems
  4. Gray Swan's Approach: securing AI with robust measures and continuous evaluation
  5. Research to Production: bridging the gap from AI research to secure deployment
  6. Shifting Mindsets: adapting to new challenges for AI safety and security
Visual TL;DR
Visual TL;DR, startuphub.ai AI Models Advance leads to Traditional Security Fails. Traditional Security Fails demands Need for Automation. Need for Automation implemented by Gray Swan's Approach leads to demands implemented by AI Models Advance Traditional Security Fails Need for Automation Gray Swan's Approach From startuphub.ai · The publishers behind this format
Visual TL;DR, startuphub.ai AI Models Advance leads to Traditional Security Fails. Traditional Security Fails demands Need for Automation. Need for Automation implemented by Gray Swan's Approach leads to demands implemented by AI Models Advance TraditionalSecurity Fails Need forAutomation Gray Swan'sApproach From startuphub.ai · The publishers behind this format
Visual TL;DR, startuphub.ai AI Models Advance leads to Traditional Security Fails. Traditional Security Fails demands Need for Automation. Need for Automation implemented by Gray Swan's Approach leads to demands implemented by AI Models Advance models like Codex and Claude present novelsecurity challenges Traditional Security Fails traditional security paradigms areinsufficient for emerging threats Need for Automation necessity for more automated andcontinuous evaluation of AI systems Gray Swan's Approach securing AI with robust measures andcontinuous evaluation From startuphub.ai · The publishers behind this format
Visual TL;DR, startuphub.ai AI Models Advance leads to Traditional Security Fails. Traditional Security Fails demands Need for Automation. Need for Automation implemented by Gray Swan's Approach leads to demands implemented by AI Models Advance models like Codexand Claude presentnovel security… TraditionalSecurity Fails traditionalsecurity paradigmsare insufficient… Need forAutomation necessity for moreautomated andcontinuous… Gray Swan'sApproach securing AI withrobust measures andcontinuous… From startuphub.ai · The publishers behind this format
Visual TL;DR, startuphub.ai AI Models Advance leads to Traditional Security Fails. Traditional Security Fails demands Need for Automation. Need for Automation implemented by Gray Swan's Approach. Gray Swan's Approach enables Research to Production. Research to Production requires Shifting Mindsets leads to demands implemented by enables requires AI Models Advance models like Codex and Claude present novelsecurity challenges Traditional Security Fails traditional security paradigms areinsufficient for emerging threats Need for Automation necessity for more automated andcontinuous evaluation of AI systems Gray Swan's Approach securing AI with robust measures andcontinuous evaluation Research to Production bridging the gap from AI research tosecure deployment Shifting Mindsets adapting to new challenges for AI safetyand security From startuphub.ai · The publishers behind this format
Visual TL;DR, startuphub.ai AI Models Advance leads to Traditional Security Fails. Traditional Security Fails demands Need for Automation. Need for Automation implemented by Gray Swan's Approach. Gray Swan's Approach enables Research to Production. Research to Production requires Shifting Mindsets leads to demands implemented by enables requires AI Models Advance models like Codexand Claude presentnovel security… TraditionalSecurity Fails traditionalsecurity paradigmsare insufficient… Need forAutomation necessity for moreautomated andcontinuous… Gray Swan'sApproach securing AI withrobust measures andcontinuous… Research toProduction bridging the gapfrom AI research tosecure deployment Shifting Mindsets adapting to newchallenges for AIsafety and security From startuphub.ai · The publishers behind this format

Understanding the AI Security Challenge

Kolter and Fredrikson emphasized that the rapid progress in AI capabilities, exemplified by models like Codex and Claude, presents novel security challenges. These advanced models, while powerful, can be susceptible to new forms of attack that exploit their underlying mechanisms and training data. The traditional security paradigms are insufficient for addressing these emerging threats.

The Need for Automated and Continuous Evaluation

A key theme throughout the discussion was the necessity for more automated and continuous evaluation of AI systems. Just as traditional software undergoes rigorous testing and security audits, AI models require similar, if not more sophisticated, scrutiny. Fredrikson noted that the field is moving towards a more dynamic approach, where AI security is not a one-time check but an ongoing process that adapts to new findings and model updates.

Gray Swan's Approach to AI Security

Fredrikson explained that Gray Swan is developing tools and methodologies to address these challenges. Their approach focuses on creating systems that can proactively identify and mitigate vulnerabilities in AI models. This includes developing novel red-teaming techniques and evaluation frameworks that can simulate real-world attack scenarios. Kolter, a leading researcher in adversarial machine learning, underscored the importance of understanding the fundamental ways in which these models can be manipulated.

From Research to Production

The conversation also touched upon the critical gap between AI research and its deployment in production environments. While academic research often focuses on theoretical vulnerabilities, translating these findings into practical security solutions for real-world applications remains a significant hurdle. Gray Swan aims to bridge this gap by providing tools that allow organizations to rigorously test and secure their AI deployments before they are widely used.

Shifting Mindsets for AI Safety

Both Kolter and Fredrikson stressed the importance of a fundamental shift in how AI safety and security are approached. It's not merely about applying existing security principles to AI, but about developing new frameworks and understanding the unique threat vectors inherent in AI systems. This requires a proactive and adaptive mindset from researchers, developers, and security professionals alike.

The discussion provided valuable insights into the critical and rapidly evolving field of AI security, highlighting the ongoing efforts to ensure that these powerful technologies can be developed and deployed safely and responsibly.

© 2026 StartupHub.ai. All rights reserved. Do not enter, scrape, copy, reproduce, or republish this article in whole or in part. Use as input to AI training, fine-tuning, retrieval-augmented generation, or any machine-learning system is prohibited without written license. Substantially-similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer-misuse laws. See our terms.