#Debugging
11 articles with this tag

Microsoft Experts on Debugging Non-Deterministic AI Agents
Microsoft experts Tisha Chawla and Susheem Koul discuss the challenges of debugging AI agents in production and introduce strategies for ensuring replayability and observability.

Codex Enhances Web Debugging with Browser Integration
OpenAI's Codex now integrates browser interaction, allowing developers to debug web apps by inspecting network traffic, logs, and performance in real-time.

Fixing AI Bugs: Humanity's Last Big Problem?
Ben Hylak, CTO of Raindrop, discusses the critical challenge of fixing AI agent bugs, calling it "Humanity's Last Big Problem to Solve" and highlighting Raindrop's approach to creating self-healing AI.
Nextdoor engineers build faster with Codex
Nextdoor engineers are using OpenAI's Codex to accelerate development, enabling end-to-end feature building and faster debugging.

Marc Klingen on AI Agents & Langfuse
Marc Klingen of Langfuse shares lessons on upskilling AI coding agents, discussing the importance of observability, documentation, and iterative improvement.

Cursor Cracks Down on App Stability
Cursor has dramatically improved app stability, slashing OOM errors by 80% through advanced diagnostics and focused engineering efforts.

AI in Web Dev: Coding, Debugging, and Site Optimization
AI is revolutionizing web development, offering tools for coding, debugging, and optimization. Experts Yohan Lasorsa and Olivier Leplus explore AI's impact on the field.

Microsoft Debugs AI Agents with AgentRx
Microsoft Research launches AgentRx, an open-source framework and benchmark for systematically debugging AI agent failures, improving accuracy by over 23%.
AI Model Confessions: A New Honesty Layer

OpenAI is Debugging LLM Misalignment: New Tools Emerge
Researchers are tackling the challenge of understanding and correcting undesirable LLM behavior with a new technique called latent attribution, detailed by...
OpenAI is Debugging LLM Misalignment: New Tools Emerge
Researchers are tackling the challenge of understanding and correcting undesirable LLM behavior with a new technique called latent attribution, detailed by...