#Microsoft Research
13 articles with this tag

AI Agents on the Loose: Network Security Risks Emerge
Microsoft Research reveals how AI agents interacting at scale create new security risks like worms, reputation manipulation, and invisible attacks.

AutoAdapt: Microsoft's LLM Adaptation Fix
Microsoft's AutoAdapt framework automates LLM domain adaptation, making it faster, cheaper, and more reliable for real-world applications.

Microsoft's AsgardBench Tests AI's Planning Skills
Microsoft's AsgardBench benchmark tests AI agents' ability to adapt plans using real-time visual feedback, revealing current limitations in perception and state tracking.

Robots Get Better at Long-Term Planning
Microsoft's GroundedPlanBench and V2GP framework improve robot planning by jointly considering actions and locations, overcoming limitations of decoupled approaches.

AI Brains vs. Human Minds
Exploring the fundamental differences between transformer AI models and the human brain's continuous learning and sensory grounding.

Microsoft Debugs AI Agents with AgentRx
Microsoft Research launches AgentRx, an open-source framework and benchmark for systematically debugging AI agent failures, improving accuracy by over 23%.

AI Memory Gets a Brain Upgrade
Microsoft Research's PlugMem system transforms AI interaction logs into structured knowledge, boosting agent efficiency and performance.

Microsoft's Phi-4-reasoning-vision-15B compact AI model
Microsoft Research's Phi-4-reasoning-vision-15B offers efficient multimodal AI, excelling in reasoning and vision tasks with less data and compute.

Microsoft's AI Future Unpacked
Microsoft Research's new podcast, 'The Shape of Things to Come,' hosted by Doug Burger, explores AI's rapid advancements and future implications.

Microsoft's CORPGEN Boosts AI Multitasking
Microsoft Research unveils CORPGEN, an AI agent framework designed for complex workplace multitasking, boosting productivity by up to 3.5 times.

AI Learns Faster by Predicting the Future
AI learns faster with Predictive Inverse Dynamics Models (PIDMs) by forecasting future states, making imitation learning more data-efficient than traditional methods.

Argos Framework Delivers Grounded AI Reasoning
Argos is an agentic verification framework that fundamentally changes reinforcement learning by rewarding models only for Grounded AI reasoning based on verifiable evidence.

Small language model optimization cracks complex business math
Microsoft鈥檚 OptiMind is a 20-billion parameter small language model that achieves high accuracy in converting natural language business problems into mathematical optimization models through expert-aligned training.