#Inference

19 articles with this tag

The Real Reason for LLM Inference Nondeterminism
AI Research

The Real Reason for LLM Inference Nondeterminism

The true cause of LLM inference nondeterminism is not random GPU math, but a systemic failure of "batch invariance" tied to unpredictable server load.

5 months ago
Fal.ai's Generative Media Ascent: A Strategic Pivot to Untapped Frontiers
AI Video

Fal.ai's Generative Media Ascent: A Strategic Pivot to Untapped Frontiers

5 months ago
Baseten Secures $150M Series D for AI Inference Platform
Funding Round

Baseten Secures $150M Series D for AI Inference Platform

Baseten, an AI inference platform, secured $150 million in Series D funding. Bond led the round, valuing the company at $2.15 billion. This investment follows significant revenue growth and will further expand its AI application infrastructure.

5 months ago
Klein's Radical AI Model: Why Inference is Free
AI Video

Klein's Radical AI Model: Why Inference is Free

6 months ago
Unpacking the AI Agenda: Journalism's Lens on Hype, Capital, and Battlegrounds
AI Video

Unpacking the AI Agenda: Journalism's Lens on Hype, Capital, and Battlegrounds

6 months ago
Fal.ai's Blueprint for AI Video Dominance: Speed, Specialization, and Relentless Optimization
AI Video

Fal.ai's Blueprint for AI Video Dominance: Speed, Specialization, and Relentless Optimization

6 months ago
The Unseen Drivers of AI's Transformative Power
Artificial Intelligence

The Unseen Drivers of AI's Transformative Power

"This is the year... that inferencing surpass training,"

7 months ago
Prompt Optimization on Amazon Bedrock and Multi-Adapter Inference with SageMaker
AI Research

Prompt Optimization on Amazon Bedrock and Multi-Adapter Inference with SageMaker

<p>Users can optimize prompts across multiple models with a single API call.</p><p>Dynamic loading of adapters based on requests, facilitating hyper-personalized solutions in various industries.</p>

about 1 year ago
Neural Magic Acquired; Red Hat, Nvidia, and AMD Race to Acquire AI Model Optimization Startups
Startup News

Neural Magic Acquired; Red Hat, Nvidia, and AMD Race to Acquire AI Model Optimization Startups

<p>Red Hat's acquisition of Neural Magic marks a new chapter in AI model optimization, following major moves by Nvidia, AMD, and Microchip Technology.</p>

about 1 year ago
NeuReality Appoints Lynn Comp, Semiconductor Veteran and AMD Corporate VP, to its Board Of Directors
Press Release

NeuReality Appoints Lynn Comp, Semiconductor Veteran and AMD Corporate VP, to its Board Of Directors

over 2 years ago
Accelerating Deep Learning: Transforming Batch Processing into Real-Time Mastery
Opinions

Accelerating Deep Learning: Transforming Batch Processing into Real-Time Mastery

almost 3 years ago
Inference chip maker NeuReality bags $35 million to put AI models into production
Startup News

Inference chip maker NeuReality bags $35 million to put AI models into production

about 3 years ago
Samsung Ventures invests in Israeli AI systems and semiconductor company NeuReality
Startup News

Samsung Ventures invests in Israeli AI systems and semiconductor company NeuReality

over 3 years ago
AAEON Partners with AI Chipmaker Hailo to Enable Next-Gen AI Applications at the Edge
Startup News

AAEON Partners with AI Chipmaker Hailo to Enable Next-Gen AI Applications at the Edge

AAEON’s latest UP Bridge the Gap platforms are now compatible with the Hailo-8 AI module, offering unprecedented AI performance for edge devices across industries

almost 4 years ago
Leading AI Chipmaker Hailo Partners with KAGA FEI America to Support Growing Customer Base
Startup News

Leading AI Chipmaker Hailo Partners with KAGA FEI America to Support Growing Customer Base

By collaborating with KAGA FEI, a global distribution and supply chain specialist, Hailo will be better able to serve its growing base of North American customers.

about 4 years ago
Leading AI Chipmaker Hailo Raises $136 Million to Expand Edge AI Solutions as Global Demand Surges
Startup News

Leading AI Chipmaker Hailo Raises $136 Million to Expand Edge AI Solutions as Global Demand Surges

Series C funding round is the largest in the edge AI chip space to date, highlighting the exploding demand for advanced AI processors for smart cities, smart retail, Industry 4.0, automotive, and beyond

over 4 years ago
MicroSys Partners with Leading AI Chipmaker Hailo to Launch High-Performance, Embedded AI Platform
Startup News

MicroSys Partners with Leading AI Chipmaker Hailo to Launch High-Performance, Embedded AI Platform

MicroSys miriac® embedded modules and platforms combined with Hailo-8™ AI acceleration modules offer a high-performance, scalable embedded platform for AI processing at the edge, with applications in fields such as Industry 4.0, automotive and heavy machinery.

over 4 years ago
NeuReality unveils novel AI-centric platform to empower the growth of real-life AI applications
Startup News

NeuReality unveils novel AI-centric platform to empower the growth of real-life AI applications

NeuReality has redefined today’s outdated AI system architecture by developing an AI-centric inference platform based on a new type of System-on-Chip (SoC). The platform provides a leading compute solution for existing and emerging AI use cases.

over 4 years ago
This Startup's New AutoML Optimizer Supercharges Deep Learning Models to Production
Interview

This Startup's New AutoML Optimizer Supercharges Deep Learning Models to Production

almost 6 years ago