Training custom machine learning models for specific business needs demands high-quality data, often sourced from human annotations. However, these annotations, particularly for video, are prone to errors. Uber Engineering has developed an ML-based system to tackle these bounding box annotation errors, aiming to ensure data integrity before it feeds into model training.
The challenge lies in video annotation, where long footage is split into segments for operators, creating opportunities for mistakes during the rejoining process. Traditional human review workflows are costly and inconsistent. Uber's solution, integrated into their in-house tool uLabel, offers real-time, automated validation.
The Problem with Manual Review
Human annotators can make mistakes. A second pair of eyes helps, but it doubles cost and time. This sequential process is inefficient for large-scale projects.
