If you’re utilizing Large Language Models (LLMs) today, such as ChatGPT or Claude, you’ve likely stumbled upon its quirks: responses that are totally irrelevant, or those that simply aren’t quite what you prompted. And even if you manage to make it work, it’s hard to make changes - change prompts or models. And performance for your customers can quickly be degraded, without notice. Solving this problem is Israeli Generative AI startup Traceloop, a recent Y Combinator graduate that’s setting up guardrails to ensure the Generative AI LLM product you’re building doesn’t veer of course.
In an interview with StartupHub.ai, Traceloop’s CEO Nir Gazit explains it all. Founded in 2022 by Gazit and Gal Kleinman (CTO), graduates of the Israeli defense intelligence corps, Traceloop (formerly Enrolla), originally set out to solve test automation at scale. “We had been accustomed to top-tier software development with so many guardrails preventing you from releasing something errant into production,” explained Gazit during his time as the Tech Lead at Google’s Growth Quality team. Gazit was responsible for optimizing and measuring growth campaigns using machine learning techniques. After his transition to Fiverr as Chief Architect, his first “Aha” moment came to him: the team’s testing configuration was so weak that bad code was pushed into production regularly, prompting him and Kleinman to devise a solution that offers total coverage to the software testing life cycle.
The two set off, applied and got accepted to Y Combinator’s Winter 2022 batch, moved to San Francisco, and secured a Seed funding round. Their startup ambition was set into motion, with substantial momentum. “With a couple design partners, we started working on AI powered test automation,” explained Gazit. “We built autonomous agents that figured out your system and created a test. It’s a fairly complex system to test the system itself, and we went down that rabbit hole.”
