The current paradigm for computer-use agents (CUAs), relying on a sequential fetch-screenshot-execute loop with frequent LLM calls, is plagued by high latency and errors stemming from imprecise tool interactions. This approach struggles to meet the demands of efficient, real-world task automation.
Related startups
Compiling Natural Language to Executable Agent Code
A novel approach, termed agent just-in-time compilation, reframes agent execution by compiling natural language task descriptions directly into executable code. This compiled code can seamlessly integrate LLM calls, tool invocations, and sophisticated parallelization strategies, moving beyond the limitations of iterative LLM prompting. The system comprises three core components: a JIT-Planner for generating and validating cost-optimal code plans, a JIT-Scheduler employing Monte Carlo methods for exploring parallelization, and an invariant-enforcing tool protocol to guarantee correct tool usage by specifying preconditions and postconditions.
Unlocking Performance and Reliability Gains
This compiled approach demonstrates substantial improvements over existing methods. On average across five web applications, the JIT-Planner achieved a 10.4x speedup and a 28% increase in accuracy compared to Browser-Use. Furthermore, the JIT-Scheduler delivered a 2.4x speedup and a 9% accuracy boost over OpenAI's CUA. These results highlight the efficacy of agent just-in-time compilation in building more performant and reliable autonomous agents for complex web-based tasks.