Martin Keen, a Master Inventor at IBM, breaks down two fundamental approaches to how AI models access and remember information: Long Context and Cache Augmented Generation (CAG). In this insightful video, Keen illustrates the distinct mechanisms and trade-offs of each method, offering a clear understanding of how AI models can effectively process and recall information from extended data sources.
Understanding Long Context and CAG
Keen begins by explaining that LLMs inherently rely on their training data. However, to utilize external knowledge, they employ two main strategies. The first, Long Context, involves feeding the model a large amount of information directly within its input prompt. The second, Cache Augmented Generation (CAG), involves a more sophisticated process where relevant information is retrieved and then provided to the model.
