#Natural Language Processing
50 articles with this tag
Databricks Genie Tackles Carbon Data Blind Spots
Databricks Genie uses natural language to turn complex emissions data into actionable decarbonization strategies, moving sustainability from compliance to competitive advantage.
LinkedIn's AI Search Upgrade
LinkedIn is leveraging LLMs for semantic search, transforming how users find jobs and people by understanding intent over keywords.
SpatioRoute VLM: Dynamic Prompting for Video QA
SpatioRoute VLM revolutionizes zero-shot spatial video question answering with dynamic prompt routing, achieving SOTA without fine-tuning or 3D sensors.

AI Learns to See, Hear, and Understand
Multimodal AI analytics is enabling businesses to decode video, audio, and images, unlocking deeper insights from previously unstructured data.
WARDEN: Tackling Low-Resource Language AI
WARDEN pioneers a modular AI system for low-resource languages, using phoneme transfer and LLM-guided dictionaries to transcribe and translate Wardaman with minimal data.
LMPath: Semantics Supercharge UAV Search
LMPath integrates language and vision models to create semantically-aware exploration priors for UAVs, dramatically improving search mission efficiency over traditional geometric methods.
Beyond RGB: Grounding Vision-Language on Raw Sensor Data
PRISM-VL advances vision-language models by grounding them in raw camera measurements, not just RGB, significantly improving performance on challenging visual tasks.
Databricks Genie Tackles Healthcare Readmissions
Databricks Genie aims to bridge the gap between predicting patient readmissions and enabling timely clinical intervention through natural language data access.
Geometric Algebra for NLP Semantics
Geometric algebra offers a richer, structured foundation for natural language semantics, promising enhanced compositionality and interpretability beyond current linear algebra methods.
Healthcare Data: From Months to Minutes
Databricks and Redox cut clinical data integration times from months to minutes with natural language prompts and subsecond data streaming.
BioMiner: Unlocking Drug Discovery Data
BioMiner, a novel multi-modal framework, automates protein-ligand bioactivity extraction, accelerating drug discovery and enabling identification of novel therapeutic candidates.
Beyond Black-Box: Structuring Humor AI Reasoning
New IRS framework moves beyond black-box AI, structuring humor understanding via explicit incongruity-resolution reasoning for expert-level performance.
HiVLA: Decoupling Reasoning for Robotic Control
HiVLA decouples VLM reasoning from motor control using a hierarchical framework, enhancing robotic manipulation performance and preserving zero-shot capabilities.

Eon AI Agent Queries Backups
Eon AI Agent lets you query backup data using natural language, turning static archives into interactive platforms.
Instance-Aware VLP: Beyond Global Understanding
InstAP introduces instance-aware pre-training for VLP, enhancing instance-level reasoning and global understanding with the InstVL dataset.
ChatGPT Adds Voice Interaction
OpenAI introduces voice features to ChatGPT, enabling hands-free interaction for faster, more natural conversations and new use cases.

IBM Master Inventor Explains Multimodal AI
IBM Master Inventor Martin Keen explains the evolution of multimodal AI, contrasting feature-level fusion with native multimodality and the importance of temporal reasoning for video.
Personalized Driving with Vega
The Vega vision-language-action model enhances autonomous driving by enabling personalized, instruction-based navigation through a novel dataset and hybrid AI architecture.
Externalizing Agent Harnesses with Language
Researchers introduce Natural-Language Agent Harnesses (NLAHs) and an Intelligent Harness Runtime (IHR) to externalize agent control logic, enabling greater transferability and scientific study.
Medical VLMs Fail Critical Input Sanity Checks
Medical VLMs fail critical input validation tests, as revealed by the new MedObvious benchmark, highlighting a significant safety risk.
Perceptio: Spatial Grounding for LVLMs
Perceptio LVLM integrates explicit spatial tokens (segmentation, depth) to overcome LVLM limitations in fine-grained visual grounding, achieving SOTA across benchmarks.
3D Spatial Reasoning for VLM
Loc3R-VLM injects 3D spatial reasoning into 2D VLMs using monocular video, achieving SOTA in localization and 3D QA.
AI Drills Deeper for Oilfield Insights
Databricks introduces an AI agent that translates complex drilling data into natural language, simplifying operations and reducing costly downtime.
Descript Masters Multilingual Dubbing
Descript enhances its AI-powered video editor with OpenAI models for natural-sounding multilingual dubbing, overcoming timing and meaning challenges.

Microsoft's Phi-4-reasoning-vision-15B compact AI model
Microsoft Research's Phi-4-reasoning-vision-15B offers efficient multimodal AI, excelling in reasoning and vision tasks with less data and compute.
CHIMERA Dataset Boosts LLM Reasoning
Researchers introduce CHIMERA, a synthetic dataset enabling LLMs to achieve strong cross-domain reasoning capabilities with efficient training.

OpenAI's GPT-4.5 Enhances Web Search Integration
OpenAI researcher Josh discusses how GPT-4.5's web search integration is becoming more natural, conversational, and context-aware.
Multimodal LLMs: What's Lost in Translation?
New research reveals multimodal LLMs struggle to utilize non-textual data due to a 'mismatched decoder problem,' impacting their true understanding.
Less Data, More Alignment: SOTAlign
Researchers introduce SOTAlign, a framework that achieves robust cross-modal alignment using significantly less paired data by leveraging unpaired samples.
NAP: Unlocking Parallel Generation in Diffusion Language Models
Researchers propose NAP, a data-centric approach to enable true parallel generation in Diffusion Language Models by aligning training data with non-autoregressive decoding.
AI Agent for Grounded Chest X-ray Diagnosis
Researchers introduce CXReasonAgent, an AI diagnostic agent enhancing Chest X-ray interpretation by grounding LLM reasoning in clinical tools and visual evidence.
Multilingual LLM Guardrails Tested
Researchers tested how LLM guardrails perform across languages and policy phrasings, revealing significant variations that impact AI safety assessments.

Small language model optimization cracks complex business math
Microsoft’s OptiMind is a 20-billion parameter small language model that achieves high accuracy in converting natural language business problems into mathematical optimization models through expert-aligned training.

Ask Photos Transforms Personal Photo Discovery

Gemini Google Translate Elevates Nuance

AI Powers Railway History: A New Era for Digital Archives

Gemini Android Auto Redefines In-Car AI

Paage raises $2.2M to advance AI social commerce platform
Paage secured $2.2 million in new funding. This capital will advance its AI social commerce platform . The platform empowers creators and brands.
Paage raises $2.2M to advance AI social commerce platform
Paage secured $2.2 million in new funding. This capital will advance its AI social commerce platform . The platform empowers creators and brands.

Google Photos AI Features Redefine Memory Management
Solidatus raises £5M to advance AI data lineage platform
Data lineage provider Solidatus secured £5M to accelerate its AI-powered platform for enterprise data governance and compliance.

RealWear Arc 3 launch: A lighter AR headset with natural language AI for industry

Unpacking the Transformer: From RNNs to AI's Cornerstone

Sesame raises $250M to advance conversational AI

PolyAI’s Agentic AI Redefines Customer Service with Human-Like Empathy

Juicebox’s AI recruiting agents land $30M from Sequoia
Juicebox is betting the future of AI recruiting isn't just better search, but fully autonomous agents that handle the entire hiring pipeline.

AI Won't Kill Language Learning: The German Verb and Human Connection
Kotoba Technologies Lands $11.83M Seed 2 for AI Interpretation
Kotoba Technologies secured $11.83 million in Seed 2 funding. Globis Capital Partners and Boost Capital led the round, accelerating commercialization of its AI-...

Kotoba Technologies Lands $11.83M Seed 2 for AI Interpretation
Kotoba Technologies secured $11.83 million in Seed 2 funding. Globis Capital Partners and Boost Capital led the round. This investment will accelerate commercialization of its AI-powered simultaneous interpretation technology.
Nebulock Raises $8.5M for AI-Powered Threat Hunting
\n Nebulock , a Boston-based cybersecurity startup, has secured $8.5 million in total funding.