Over 285 million individuals globally face visual impairments, presenting persistent challenges in daily navigation, object identification, and personal interactions. Traditional assistive technologies often fall short due to limitations in predefined categories, reliance on cloud infrastructure, or the need for specialized hardware.
Related startups
Bridging the Gap with On-Device Intelligence
The VisionAId application redefines smartphone utility by integrating six on-device deep learning models, including metric monocular depth estimation, instance segmentation, visual and facial embeddings, face detection, and a custom banknote detector, all running efficiently via ONNX Runtime. This approach ensures real-time performance without constant cloud connectivity, a critical factor for accessibility. For enhanced scene understanding and object labeling, an optional cloud-based Google Gemini Flash model can be leveraged.
Personalized Assistance Through Few-Shot Learning
A core innovation is VisionAId's few-shot pipeline for personal object recognition. Users can train the system to identify specific items by providing a few images from different angles. Subsequently, the application can locate these personalized objects within the environment, guiding the user with augmented-reality markers, spatial audio cues, and distance-proportional haptic feedback. This multimodal feedback system, incorporating Romanian speech synthesis and voice commands, significantly boosts user independence.