Calvin Qi of Harvey AI and Chang She, co-founder of LanceDB, recently illuminated the intricate challenges and innovative solutions in scaling Retrieval Augmented Generation (RAG) systems for enterprise applications. Speaking at the AI Engineer World's Fair in San Francisco, their discussion centered on the demanding landscape of legal AI, where accuracy, privacy, and massive scale are non-negotiable. Their insights highlight a critical shift in how data infrastructure must evolve to meet the unique demands of multimodal AI workloads.
Harvey, a leading legal AI assistant, processes an immense spectrum of data, ranging from user-uploaded files for on-demand context (1-50 documents) to long-term project vaults (100-100,000 documents), and vast third-party corpuses comprising millions of legal documents like legislation, case laws, and global regulations. This sheer volume presents significant scaling hurdles, complicated by the inherent density and complexity of legal texts. As Calvin Qi noted, "We handle data all different sort of volumes and forms."
