1 articles with this tag
A new benchmark, M³Eval, reveals critical memory deficiencies in multi-modal models, particularly in disentangled representations, interference patterns, and temporal grounding.