LinkedIn is phasing out its long-standing Kafka infrastructure for a new homegrown system dubbed Northguard. This overhaul aims to address the exponential growth in data volume and complexity since Kafka was first implemented 15 years ago. The move is detailed in a recent LinkedIn Engineering post.
Data streams are fundamental to LinkedIn's thousands of services, enabling them to subscribe to and process information from other services. The ability to reprocess data is crucial for debugging and verification. Kafka, developed by LinkedIn a decade and a half ago, became the backbone for this ordered data pipeline, known as a log, supporting everything from user activity to AI features.