#AI Infrastructure

50 articles with this tag

Mozilla.ai: AI Sovereignty Beyond Borders
Technology

Mozilla.ai: AI Sovereignty Beyond Borders

Mozilla.ai's CEO John Dickerson redefines sovereign AI beyond geopolitics, emphasizing control, choice, and resilience at every level from nations to individuals.

about 22 hours ago
Together AI Supercharges LLM Inference
Technology

Together AI Supercharges LLM Inference

Together AI unveils ATLAS, accelerating LLM inference up to 4x with adaptive speculative decoding, tackling the growing cost challenge for AI-native companies.

2 days ago
AI Chip Surge Fuels Market Rally
Investors News

AI Chip Surge Fuels Market Rally

Nvidia AI infrastructure leads a market surge, mirroring past tech cycles, while software valuations face headwinds and social media usage plateaus globally.

5 days ago
Together AI Halts Copy Fail Exploit
Technology

Together AI Halts Copy Fail Exploit

Together AI swiftly contained the Copy Fail CVE-2026-31431 vulnerability by disabling a vulnerable Linux kernel module, safeguarding its AI infrastructure.

6 days ago
DeepSeek V4 Pro Hits Together AI
Technology

DeepSeek V4 Pro Hits Together AI

Together AI launches DeepSeek V4 Pro, a 1.6T MoE model with a 512K context window and new cached input pricing for cost-effective long-context reasoning.

7 days ago
Together AI Adds NVIDIA Nemotron 3
Technology

Together AI Adds NVIDIA Nemotron 3

Together AI launches NVIDIA's Nemotron 3 Nano Omni, a unified multimodal AI model, to developers, simplifying agentic application creation.

8 days ago
OpenAI Breaks Free From Microsoft Pact
Artificial Intelligence

OpenAI Breaks Free From Microsoft Pact

OpenAI is reportedly ending its exclusive partnership with Microsoft, aiming to broaden access to its AI models by partnering with other cloud providers like AWS.

9 days ago
Meta Taps AWS Graviton for AI
Technology

Meta Taps AWS Graviton for AI

Meta is significantly expanding its AI infrastructure by deploying tens of millions of AWS Graviton processors to power agentic AI workloads.

12 days ago
AI Compute & The Token Economy
Artificial Intelligence

AI Compute & The Token Economy

ARK Invest's Brett Winton and Michael Stuart discuss how tokenization could revolutionize AI compute by increasing accessibility and efficiency.

13 days ago
Cloudflare Builds the Agentic Cloud
Technology

Cloudflare Builds the Agentic Cloud

Cloudflare unveils its 'agentic cloud' vision with new tools for building and scaling AI agents, addressing compute, security, and infrastructure needs.

16 days ago
Cloudflare's LLM Infrastructure Deep Dive
Technology

Cloudflare's LLM Infrastructure Deep Dive

Cloudflare details its advanced infrastructure optimizations for running large language models on its Workers AI platform, focusing on performance and cost-efficiency.

20 days ago
Bloomberg Money Minute: Stocks Mixed, Albert's AI Shift, Live Nation Ruling
Artificial Intelligence

Bloomberg Money Minute: Stocks Mixed, Albert's AI Shift, Live Nation Ruling

Bloomberg Money Minute covers mixed stock performance, Albert's shift to AI, Live Nation's antitrust ruling, American Eagle's gains, and Sazerac's bid for Jack Daniel's.

21 days ago
OpenAI Upgrades Agent Tools for Developers
Artificial Intelligence

OpenAI Upgrades Agent Tools for Developers

OpenAI's revamped Agents SDK introduces native sandbox execution and a more capable harness, boosting security and developer flexibility for building advanced AI agents.

21 days ago
Claude's Corner: Rubric AI — The Agent Reliability Layer Every Vertical AI Company Needs
Claude's Corner

Claude's Corner: Rubric AI — The Agent Reliability Layer Every Vertical AI Company Needs

Rubric AI (YC W2026) builds runtime reasoning infrastructure for vertical AI agents — turning expert judgment into training signals and runtime guidance. Deep technical breakdown, difficulty score, and moat analysis.

24 days ago
CoreWeave, Meta AI Deals Signal Compute Demand
Artificial Intelligence

CoreWeave, Meta AI Deals Signal Compute Demand

CoreWeave and Meta strike $21B AI compute deal, while Inflexion CEO discusses quantum tech. Nvidia stock soars amid AI hardware demand.

26 days ago
Claude's Corner: Terminal Use — Vercel for Background AI Agents
Claude's Corner

Claude's Corner: Terminal Use — Vercel for Background AI Agents

Claude's Corner attempts to rebuild Terminal Use. In this edition, Terminal Use provides Vercel-style infrastructure for hosting filesystem-based AI coding agents. Claude Code has mapped out 7 steps to reproduce this YC W26 startup. Find the repo code at the end of the article to replicate. As always, get building...

about 1 month ago
Together AI's Aurora Learns on the Fly
Technology

Together AI's Aurora Learns on the Fly

Together AI's Aurora framework uses RL to continuously adapt speculative decoding for faster LLM inference, outperforming static models.

about 1 month ago
OpenAI secures $122B for AI dominance
Artificial Intelligence

OpenAI secures $122B for AI dominance

OpenAI secures $122B in funding at an $852B valuation, fueling its AI infrastructure ambitions with major backing from Amazon, NVIDIA, and Microsoft.

about 1 month ago
Vultr and the Sovereign Cloud AI Gap
Artificial Intelligence

Vultr and the Sovereign Cloud AI Gap

Sovereign cloud decisions are failing to account for the actual compute needs of AI, creating a critical infrastructure gap.

about 1 month ago
Cedric Clyburn on Models as a Service
Artificial Intelligence

Cedric Clyburn on Models as a Service

Red Hat's Cedric Clyburn discusses the evolution of AI from code assistants to Models as a Service (MaaS), highlighting on-premise and hybrid deployments with Kubernetes and OpenShift.

about 1 month ago
Microsoft Touts AI Advances with NVIDIA
Technology

Microsoft Touts AI Advances with NVIDIA

Microsoft announces expanded Foundry capabilities and new Azure AI infrastructure at NVIDIA GTC, focusing on AI agents and Physical AI.

about 2 months ago
Snowflake, AWS, NVIDIA Forge Enterprise AI
Technology

Snowflake, AWS, NVIDIA Forge Enterprise AI

Snowflake, AWS, and NVIDIA unite to streamline enterprise AI development and deployment, leveraging NVIDIA's Blackwell platform.

about 2 months ago
Thinking Machines, NVIDIA Forge Gigawatt AI Pact
Artificial Intelligence

Thinking Machines, NVIDIA Forge Gigawatt AI Pact

Thinking Machines Lab and NVIDIA announce a gigawatt-scale partnership for AI training, including a significant investment from NVIDIA.

about 2 months ago
Dell and DOE Partner on AI Initiatives
Artificial Intelligence

Dell and DOE Partner on AI Initiatives

Michael Dell and Dr. Arati Prabhakar discuss Dell Technologies' partnership with the DOE to accelerate AI initiatives, focusing on infrastructure, national security, and scientific discovery.

about 2 months ago
Dell CEO on AI Infrastructure & National Security
Artificial Intelligence

Dell CEO on AI Infrastructure & National Security

Dell Technologies CEO Michael Dell discusses the critical role of AI infrastructure in national security and scientific discovery, highlighting government initiatives and the need for integrated cybersecurity.

about 2 months ago
Mamba 2 JAX: Hardware Agnostic SSMs
AI Research

Mamba 2 JAX: Hardware Agnostic SSMs

Mamba 2 JAX breaks hardware dependency for state-space models, achieving high performance on CPU, GPU, and TPU via XLA compilation without custom kernels.

about 2 months ago
Dylan Patel: AI's Unstoppable March
Artificial Intelligence

Dylan Patel: AI's Unstoppable March

AI investor Dylan Patel discusses the accelerating pace of AI development, the need for AI-native infrastructure, and the future impact of AI on work and society.

about 2 months ago
Larry Ellison's AI Ambitions Face Investor Scrutiny
Artificial Intelligence

Larry Ellison's AI Ambitions Face Investor Scrutiny

Oracle's ambitious AI data center expansion, heavily reliant on OpenAI, faces increasing investor scrutiny over debt and execution amidst a competitive AI landscape.

about 2 months ago
IBM's Martin Keen on LLM Context Windows
Artificial Intelligence

IBM's Martin Keen on LLM Context Windows

IBM's Martin Keen explains how larger context windows in LLMs simplify deployments and improve reasoning by reducing reliance on complex RAG systems.

about 2 months ago
Oracle & OpenAI Data Center Deal in Texas
Startup News

Oracle & OpenAI Data Center Deal in Texas

Oracle and OpenAI are reportedly in talks to build a massive AI data center in Texas, signaling a major strategic partnership in the booming AI sector.

about 2 months ago
Oracle, OpenAI Data Center Deal Falters
Startup News

Oracle, OpenAI Data Center Deal Falters

Oracle and OpenAI have ended plans for a significant AI data center expansion in Texas, with Meta reportedly in talks to lease the site.

2 months ago
OpenAI Secures $110B, Valuation Hits $730B
Artificial Intelligence

OpenAI Secures $110B, Valuation Hits $730B

OpenAI has announced a monumental $110 billion funding round from Amazon, NVIDIA, and SoftBank, valuing the company at $730 billion to meet escalating global AI demand.

2 months ago
AI Infrastructure: The Trillion-Dollar Buildout
Technology

AI Infrastructure: The Trillion-Dollar Buildout

AI infrastructure investment is set to reach trillions by 2026, driven by hyperscalers and demanding innovative financing beyond traditional equity models.

2 months ago
Technology

Anthropic to Cover AI Data Center Power Costs

Anthropic pledges to cover electricity price increases and grid upgrade costs caused by its data centers, aiming to protect consumers from AI's growing energy demand.

3 months ago
NVIDIA Earth-2 Open Models Democratize Weather AI
AI Research

NVIDIA Earth-2 Open Models Democratize Weather AI

NVIDIA Earth-2 Open Models introduce the first fully open, accelerated AI software stack for weather, drastically cutting computational time and cost.

3 months ago
Agentforce Slashes AI Latency by 70%
AI Research

Agentforce Slashes AI Latency by 70%

Agentforce achieved a 70% AI latency reduction by rearchitecting its agent runtime, consolidating four sequential LLM calls down to two and deploying specialized SLMs.

3 months ago
Jensen Huang: AI Infrastructure Buildout is History's Largest
AI Research

Jensen Huang: AI Infrastructure Buildout is History's Largest

NVIDIA CEO Jensen Huang described the AI infrastructure buildout as a five-layer cake and the largest infrastructure project in human history.

3 months ago
AI Research

OpenAI Launches AI Supply Chain RFP for US Manufacturing

OpenAI's new AI supply chain RFP seeks to onshore the physical hardware necessary to support its massive data center buildout and secure US technological leadership.

4 months ago
Arm Powers Agentic AI's System-Level Future
AI Research

Arm Powers Agentic AI's System-Level Future

4 months ago
DeepSeek Unveils mHC: A Mathematical Fix for the "Exploding Stream" Problem in Large Models
AI Research

DeepSeek Unveils mHC: A Mathematical Fix for the "Exploding Stream" Problem in Large Models

4 months ago
NVIDIA Genesis Mission AI: US Bets Big on Accelerated Computing
AI Research

NVIDIA Genesis Mission AI: US Bets Big on Accelerated Computing

5 months ago
Investor News

The 2026 AI predictions: Why infrastructure will fail, but apps will fly.

5 months ago
NVIDIA Acquires SchedMD, Bolstering AI Infrastructure
AI Research

NVIDIA Acquires SchedMD, Bolstering AI Infrastructure

5 months ago
NVIDIA Boosts AI Infrastructure Management with New GPU Monitoring
AI Research

NVIDIA Boosts AI Infrastructure Management with New GPU Monitoring

5 months ago
Google's Gemini 3 Ushers In The Latest AI Era
AI Research

Google's Gemini 3 Ushers In The Latest AI Era

5 months ago
AI's Ground Truth: Beyond Models to Infrastructure and Application
AI Video

AI's Ground Truth: Beyond Models to Infrastructure and Application

5 months ago
Arm Drives the Converged AI Data Center Era
AI Research

Arm Drives the Converged AI Data Center Era

5 months ago
AMD HPE AI Infrastructure: An Open AI Play
AI Research

AMD HPE AI Infrastructure: An Open AI Play

5 months ago
AI's Maturation: From Model Supremacy to Infrastructure Dominance
AI Video

AI's Maturation: From Model Supremacy to Infrastructure Dominance

5 months ago
AI's Shifting Moat: From Models to Infrastructure and Commoditization
AI Video

AI's Shifting Moat: From Models to Infrastructure and Commoditization

5 months ago