Brave's new AI browsing feature, which aims to make the browser a smarter assistant, is not immune to a fundamental security flaw plaguing artificial intelligence: indirect prompt injection. This vulnerability, detailed by Brave researchers, highlights how AI models can be tricked into executing malicious commands embedded within seemingly innocuous external content. The core issue lies in the AI's inability to reliably differentiate between instructions from its developers and commands hidden within data it's asked to process.
This threat isn't tied to cloud-based or on-device AI. Brave's analysis demonstrates that whether an AI model runs on remote servers or locally on a user's machine, the underlying vulnerability remains identical. It stems from the collapse of the instruction/data boundary within the AI's shared context window, leading the LLM to indiscriminately follow instructions found in ingested content.
