#Vision Language Models
3 articles with this tag
AI Research
Beyond Observable Data: Imaginative Perception for VLMs
Researchers introduce Imaginative Perception Tokens (IPTs) to enable VLMs to reason about unobserved spatial configurations, outperforming textual chain-of-thought.
16 days ago

AI Research
RL Fixes Overfitting in AI Radiology Reports
Microsoft Research’s UniRG framework uses reinforcement learning guided by clinical error signals to achieve state-of-the-art performance in AI radiology reports.
5 months ago

AI Research
olmOCR 2 Redefines AI Document OCR Accuracy
8 months ago