#AI Development
50 articles with this tag
Sea Bets Big on AI Coding with OpenAI Codex
Sea Limited is deploying OpenAI Codex across its developer organization, aiming to transform software development in Southeast Asia through AI-native workflows and agentic collaboration.

Mind the Gap in Agent Observability
Microsoft's Amy Boyd and Nitya Narasimhan discuss the critical 'gap' in AI agent observability and the need for better tools.

Open Source Roguelikes Thrive on Community
The enduring legacy of roguelike games is built on open-source collaboration and community-driven evolution, a model that continues to shape their development.

VIBE✓ adds friction to AI coding agents
Mozilla.ai's VIBE✓ framework introduces deliberate friction to coding agent workflows, mitigating automation bias and ensuring human oversight.

ElevenLabs Gives Chat Agents a Voice
Luke Harries from ElevenLabs discusses the increasing importance of voice for AI chat agents, highlighting the benefits of speed, accessibility, and user experience.

Matt Pocock: Engineering Fundamentals Still Crucial in AI
Matt Pocock, author of 'AI Hero', emphasizes that engineering fundamentals are more crucial than ever for building robust AI systems.

Cursor's AI Agents Get Worktree Boost
David Gomes of Cursor detailed the integration of Git worktrees into AI agents, enabling isolated task execution and reducing code complexity.

Building Better AI Agents: The Eval Platform Challenge
Phil Hetzel of Braintrust discusses the challenges and best practices for building effective evaluation platforms for AI agents, emphasizing a systems-level approach.

Snowflake Adds OpenAI's GPT 5.5
Snowflake integrates OpenAI's latest GPT 5.5 model into its Cortex AI platform, enhancing enterprise capabilities for coding, data analysis, and AI agent development.

NVIDIA Engineer on GPT-5.5's 'Superpower'
NVIDIA's Dennis Hannusch discusses GPT-5.5, calling its ability to 'just get things done' its superpower and detailing its use in production-level software development.
OpenAI Codex Plugins Expand AI Capabilities
OpenAI Codex enhances its AI with plugins for external data access and skills for executing custom workflows.

Red Hat's Clyburn on Podman's AI Potential
Red Hat's Cedric Clyburn discusses Podman, highlighting its features for AI development, including Systemd integration and bootable containers.

Snowflake's Agent Framework for Finance
Snowflake's Ecosystem Agent Framework aims to automate financial services by enabling AI agents to execute tasks directly on unified data.

AI Agents Need Skills: Martin Keen on LLM Tooling
Martin Keen of IBM explains how AI agent skills, defined in structured files, are essential for LLMs to perform tasks, detailing the "skill file" format and different knowledge types.

OpenAI's Ryan Lopopolo on Harnessing AI for Software Engineering
OpenAI's Ryan Lopopolo discusses how AI agents are reshaping software engineering, emphasizing the shift towards human oversight and strategic prompt design.

AI Personalities: From Shakespeare to ChatGPT
Anish Acharya and Erik Torenberg of a16z discuss the development of AI personalities, the challenges of making AI relatable, and the future of human-AI interaction.

Notion's Sarah Sachs on AI Agents and the Future of Work
Sarah Sachs, AI Lead at Notion, discusses the company's vision for flexible AI agents, iterative development, and balancing powerful features with user accessibility on the Latent Space podcast.

Cloudflare Unveils Durable Objects Facets
Cloudflare's new Durable Objects Facets feature allows AI-generated applications to manage persistent data, bridging the gap between dynamic code execution and stateful storage.

IBM's Jeff Crume on AI Tech Debt
Jeff Crume of IBM explains how AI systems can accrue technical debt, the risks involved, and how to mitigate it through strategic planning and discipline.

Meta-Harness: AI Optimizes AI Development
Researchers unveil Meta-Harness, a novel AI system that automates harness optimization, leading to faster and more capable LLMs.

Garry Tan Showcases AI Agent Frameworks
Garry Tan explores gstack, Hermes Agent, and Paperclip, showcasing the growing power of AI agent orchestration frameworks.

Anthropic's Claude Masters Autonomous Coding
Anthropic details a new multi-agent system that enables Claude to autonomously generate complex full-stack applications, moving beyond previous limitations in AI coding.

Andrej Karpathy on AI Agents: More Than Just Code
Andrej Karpathy discusses the evolution of AI agents beyond code generation, emphasizing the need for modularity, self-improvement, and human-AI collaboration for future advancements.

Sakana AI, MUFG Test Loan Expert AI
Sakana AI and MUFG are piloting an AI agent to transform the bank's lending process, emphasizing human-AI collaboration for enhanced decision-making.

Pydantic AI's Samuel Colvin on Building Better LLM Agents
Pydantic AI founder Samuel Colvin discusses building LLM agents, highlighting type safety, code execution environments, and the future of AI tooling.

Perplexity's Agent API Unifies LLM Access
Perplexity's new Agent API offers a unified interface to multiple LLM providers, simplifying development with integrated search and tools.

IBM's Grant Miller on AI Agents: Control vs. Capability
IBM Distinguished Engineer Grant Miller discusses the challenges of AI agent development, focusing on balancing capability with control and avoiding super agency.

AI Agents: From "Slowly Accelerating" to "Faster": Latent Space Recap
Latent Space podcast guests Samantha Whitmore and Jonas Nelle discuss the evolution of AI agents, from basic tasks to complex reasoning and the importance of human collaboration.

OpenAI's GPT-5.4 Enhances Web Dev and Game Creation
OpenAI's GPT-5.4 demonstrates significant advancements in AI-assisted development, showcasing its prowess in building complex applications like a 3D chess game and responsive websites from design images.

OpenAI Codex Deepens Figma Integration
OpenAI and Figma have launched a direct integration for Codex, enabling a seamless code-to-design and design-to-code workflow for faster product iteration.

Micro1: AI's Future is Expert Human Judgment
Micro1 argues AI's next frontier involves distilling expert human judgment and replicating complex, multi-actor tasks, requiring a human-first approach to data and supervision.

Brave Search API for AI Apps Sees Explosive Growth
Brave Search API for AI apps is rapidly becoming the go-to solution for developers, offering an independent, private, and AI-optimized web index amid market shifts.

Palantir OSDK Simplifies Enterprise AI Dev
Palantir's OSDK enables businesses to generate custom SDKs from their enterprise ontology, integrating data, logic, actions, and LLMs for rapid application development.

AI Agents Leveled Up by Harness Engineering
LangChain's harness engineering approach dramatically improved an AI coding agent's performance by refining its surrounding system, not the core model.

Claude Sonnet 4.6 Ups the AI Ante
Anthropic's Claude Sonnet 4.6 launches with major upgrades in coding, reasoning, and computer use, plus a 1M token context window.

Anthropic Labs Expansion Signals New Product Velocity
Anthropic is formalizing its experimental product unit, signaling a commitment to faster iteration from frontier research to market-ready tools.

Google's Vertex AI Studio: Accelerating AI Development from Concept to Production

Khosla's Bet on World Models: General Intuition's Vision for AI's Next Frontier

Building Resilient AI Agents Through Abstraction

Anthropic Unveils Advanced APIs for Agentic AI Development

AWS AgentCore Unlocks Production-Ready AI Agents for Enterprise
GPT-5.1: The Art and Science of Intelligent Personalities
\"Part of the art here is figuring out how to pull out these quirks in the model that can come across as personality without breaking steerability.

GPT-5.1: The Art and Science of Intelligent Personalities
\"Part of the art here is figuring out how to pull out these quirks in the model that can come across as personality without breaking steerability.

Google Antigravity Redefines AI Development with Agent-First IDE

AI's Codebase Conundrum: HumanLayer's Context Engineering Breakthrough

AI's Seventy-Year Odyssey: From Turing's Test to Agentic Futures

Enterprise Vibe Coding: The New Frontier for Developer Productivity

ASEAN's Digital Sovereignty: Building, Not Buying, the AI Future

AIE Code Summit: The Unspoken Rhythms of AI Innovation
