#Coding Agents

9 articles with this tag

Marlene Mhangami: Playwright for Functionality Testing
Technology

Marlene Mhangami: Playwright for Functionality Testing

Marlene Mhangami from Microsoft and GitHub discusses leveraging Playwright and AI agents for effective functionality testing, emphasizing clean code and behavior-driven development.

about 4 hours ago
OpenAI's "Parameter Golf" Reveals AI's Role
Artificial Intelligence

OpenAI's "Parameter Golf" Reveals AI's Role

OpenAI's "Parameter Golf" competition revealed how AI coding agents are transforming machine learning research, pushing innovation under tight constraints.

4 days ago
VIBE✓ adds friction to AI coding agents
Technology

VIBE✓ adds friction to AI coding agents

Mozilla.ai's VIBE✓ framework introduces deliberate friction to coding agent workflows, mitigating automation bias and ensuring human oversight.

4 days ago
Embedding OpenClaw Coding Agent in Your Product
Artificial Intelligence

Embedding OpenClaw Coding Agent in Your Product

Matthias Luebken from Tavon.ai discusses embedding the OpenClaw coding agent, Pi, into products, highlighting its utility for developers and the future of AI in software systems.

5 days ago
OpenAI's Safety Playbook for Codex
Artificial Intelligence

OpenAI's Safety Playbook for Codex

OpenAI details its robust safety measures for its Codex AI coding agent, emphasizing sandboxing, network controls, and detailed telemetry for secure deployment.

8 days ago
Databricks Tames Coding AI Chaos
Technology

Databricks Tames Coding AI Chaos

Databricks introduces Unity AI Gateway to manage AI coding agents, offering centralized governance, cost controls, and observability for enterprises.

29 days ago
Databricks Centralizes Coding AI
Technology

Databricks Centralizes Coding AI

Databricks launches AI Gateway to centralize governance, security, and cost controls for the growing number of AI coding agents used by enterprises.

29 days ago
Exa Unveils New Code Search Benchmarks
Artificial Intelligence

Exa Unveils New Code Search Benchmarks

Exa.ai releases 'WebCode', a new benchmark suite for evaluating search performance in coding agents, addressing limitations in existing tools.

about 2 months ago
AI Agents Leveled Up by Harness Engineering
Artificial Intelligence

AI Agents Leveled Up by Harness Engineering

LangChain's harness engineering approach dramatically improved an AI coding agent's performance by refining its surrounding system, not the core model.

3 months ago