Current AI agents struggle with the complex, interdependent multitasking inherent in real-world corporate environments. While benchmarks typically test one task at a time, a new initiative from Microsoft Research, dubbed CORPGEN, aims to bridge this gap with advanced 'digital employees' designed for genuine workplace productivity.
Traditional agents rapidly degrade under multi-task loads. In Microsoft's Multi-Horizon Task Environments (MHTEs), which simulate dozens of concurrent tasks over several hours, leading systems saw completion rates plummet from 16.7% to just 8.7%. CORPGEN tackles these limitations through hierarchical planning, isolated memory, and adaptive summarization, preventing information overload and cross-task interference.
