#Multimodal AI

50 articles with this tag

Agentic Vision Gemini 3 Flash: Code Execution Solves Visual Hallucination
AI Research

Agentic Vision Gemini 3 Flash: Code Execution Solves Visual Hallucination

Agentic Vision Gemini 3 Flash shifts multimodal AI from static image processing to an active, code-driven investigation, dramatically improving accuracy and verifiability.

13 days ago
Sparkli AI raises $5M to kill the EdTech chatbot for kids
Funding Round

Sparkli AI raises $5M to kill the EdTech chatbot for kids

Sparkli AI, founded by Google alums, raised a $5 million pre-seed round to develop a multimodal, simulation-based learning engine for children aged 5 to 12.

19 days ago
Argos Framework Delivers Grounded AI Reasoning
AI Research

Argos Framework Delivers Grounded AI Reasoning

Argos is an agentic verification framework that fundamentally changes reinforcement learning by rewarding models only for Grounded AI reasoning based on verifiable evidence.

20 days ago
Gemini API Data Ingestion Gets Production Ready
AI Research

Gemini API Data Ingestion Gets Production Ready

Google has upgraded Gemini API data ingestion to support persistent storage via GCS registration and external signed URLs, boosting the inline limit to 100MB.

28 days ago
The AI Pet Startup That Claims to Translate Your Dog's Thoughts
Funding Round

The AI Pet Startup That Claims to Translate Your Dog's Thoughts

about 1 month ago
Google Gemini 3 Redefines AI Reasoning and Efficiency
AI Research

Google Gemini 3 Redefines AI Reasoning and Efficiency

about 2 months ago
Google AI Tips: A Year of Ubiquitous Intelligence
AI Research

Google AI Tips: A Year of Ubiquitous Intelligence

about 2 months ago
T5Gemma 2 Multimodal Ushers In Efficient AI Future
AI Research

T5Gemma 2 Multimodal Ushers In Efficient AI Future

about 2 months ago
Tinker launches OpenAI API compatibility, challenging vendor lock-in.
AI Research

Tinker launches OpenAI API compatibility, challenging vendor lock-in.

about 2 months ago
Gemini Google Translate Elevates Nuance
AI Research

Gemini Google Translate Elevates Nuance

about 2 months ago
Gemma 3n Powers Real-World Impact at the Edge
AI Research

Gemma 3n Powers Real-World Impact at the Edge

2 months ago
AI Research

FACTS Benchmark Suite Elevates LLM Factuality Scrutiny

2 months ago
AI Precision Oncology Gets Scalable Boost from Microsoft AI
AI Research

AI Precision Oncology Gets Scalable Boost from Microsoft AI

2 months ago
Google's Gemini 3 Ushers In The Latest AI Era
AI Research

Google's Gemini 3 Ushers In The Latest AI Era

2 months ago
VoiceVision RAG: Beyond Text, Towards True Multimodal Document Intelligence
AI Video

VoiceVision RAG: Beyond Text, Towards True Multimodal Document Intelligence

2 months ago
Google TAU AI Partnership Expands Foundational AI Research
AI Research

Google TAU AI Partnership Expands Foundational AI Research

2 months ago
Google Cloud's Nano Banana Transforms Text-to-Vision Capabilities
AI Video

Google Cloud's Nano Banana Transforms Text-to-Vision Capabilities

3 months ago
Gemini 3 Unleashes a New Era of AI-Powered Creation
AI Video

Gemini 3 Unleashes a New Era of AI-Powered Creation

3 months ago
Meta’s Segment Anything Model 3 masters text and video
AI Research

Meta’s Segment Anything Model 3 masters text and video

3 months ago
Gemini 3: Google's Ambitious Leap Towards Universal AI Integration
AI Video

Gemini 3: Google's Ambitious Leap Towards Universal AI Integration

3 months ago
Google Gemini 3 Elevates AI with Agentic Interfaces
AI Research

Google Gemini 3 Elevates AI with Agentic Interfaces

3 months ago
NotebookLM Deep Research Redefines AI Analysis
AI Research

NotebookLM Deep Research Redefines AI Analysis

3 months ago
Marble World Model Goes Public, Redefining 3D Generation
Artificial Intelligence

Marble World Model Goes Public, Redefining 3D Generation

3 months ago
MMCTAgent: Microsoft's Multimodal Reasoning Agent Tackles Long-Form Video
AI Research

MMCTAgent: Microsoft's Multimodal Reasoning Agent Tackles Long-Form Video

3 months ago
Google's Nano Banana: The Human-Centric Evolution of Visual AI
AI Video

Google's Nano Banana: The Human-Centric Evolution of Visual AI

3 months ago
Emotive AI Redefines Customer Experience Dynamics
AI Research

Emotive AI Redefines Customer Experience Dynamics

3 months ago
Signify Elevates Support with Advanced Retrieval Augmented Generation
AI Research

Signify Elevates Support with Advanced Retrieval Augmented Generation

3 months ago
OlmoEarth Redefines Earth Observation Foundation Models
AI Research

OlmoEarth Redefines Earth Observation Foundation Models

3 months ago
OpenAI's Patent Strategy: Why the AI Leader Has Far Fewer Patents Than You'd Expect
Startup News

OpenAI's Patent Strategy: Why the AI Leader Has Far Fewer Patents Than You'd Expect

4 months ago
Automotive AI: Redefining Vehicle Design Quietly
AI Research

Automotive AI: Redefining Vehicle Design Quietly

Artificial intelligence is fundamentally reshaping vehicle design, moving beyond the long-promised fully autonomous car to deliver immediate, tangible improvements in today's vehicles. This evolution, often subtle, is driven by a sophisticated blend of on-device intelligence...

4 months ago
Fal.ai raises funding to advance multimodal AI platform
Funding Round

Fal.ai raises funding to advance multimodal AI platform

4 months ago
Nano Banana AI Elevates NotebookLM Video Overviews
AI Research

Nano Banana AI Elevates NotebookLM Video Overviews

4 months ago
Gemini 2.5 Pro Transforms Video Processing with Single API Calls
AI Video

Gemini 2.5 Pro Transforms Video Processing with Single API Calls

4 months ago
Google AI Plus Expands to 40 New Countries, Shaking Up the AI Race
AI Research

Google AI Plus Expands to 40 New Countries, Shaking Up the AI Race

4 months ago
Gemini App Updates: Google Sharpens Its AI Assistant Edge
AI Research

Gemini App Updates: Google Sharpens Its AI Assistant Edge

5 months ago
Google Gemini Photo Video: Animating Your Stills
AI Research

Google Gemini Photo Video: Animating Your Stills

5 months ago
Google's Gemini Native Image Editing: A New AI Battleground
AI Research

Google's Gemini Native Image Editing: A New AI Battleground

5 months ago
Image Gen API Unlocks Multimodal Design Dialogue
AI Video

Image Gen API Unlocks Multimodal Design Dialogue

5 months ago
AI Research

AI Models that Compete, Mate, and Evolve Like Living Organisms

6 months ago
AI Research

Meta FAIR Wins Algonauts 2025 with a Trimodal Brain Model

6 months ago
GPT-5 Unveils Autonomous Capabilities and Multimodal Understanding
AI Video

GPT-5 Unveils Autonomous Capabilities and Multimodal Understanding

6 months ago
DeepMind Proposes Radical Shift in AI Intelligence Benchmarking
Startup News

DeepMind Proposes Radical Shift in AI Intelligence Benchmarking

Google DeepMind has unveiled a significant new initiative aimed at fundamentally rethinking how artificial intelligence capabilities are measured. In an announcement on its blog, the leading AI research institution detailed a comprehensive framework designed to...

6 months ago
AI Research

Cogito v2: Forging AI Intuition on the Path to Self-Improvement

Cogito v2 introduces a novel approach to AI scaling by internalizing reasoning processes, shifting from extensive search to cultivating genuine intuition. This is achieved by extending Iterated Distillation and Amplification (IDA).

6 months ago
Execution is the Moat: Sarah Guo's State of AI Startups
AI Video

Execution is the Moat: Sarah Guo's State of AI Startups

6 months ago
Multimodal AI Startup Reka AI Raises $110M at $1B Valuation
Funding Round

Multimodal AI Startup Reka AI Raises $110M at $1B Valuation

7 months ago
OpenAI’s New ChatGPT Agent Unifies AI Capabilities
AI Video

OpenAI’s New ChatGPT Agent Unifies AI Capabilities

7 months ago
Genspark Launches No-Code AI Agents with OpenAI Tech
Artificial Intelligence

Genspark Launches No-Code AI Agents with OpenAI Tech

7 months ago
Google France Accelerates AI in Healthcare Solutions
Artificial Intelligence

Google France Accelerates AI in Healthcare Solutions

7 months ago
Funding Round

Thinking Machines Lab Secures $2B Seed Funding at $12B Valuation

7 months ago
Microsoft & Alicante Launch PadChest-GR: AI Radiology Benchmark
Artificial Intelligence

Microsoft & Alicante Launch PadChest-GR: AI Radiology Benchmark

7 months ago