#Kaggle
6 articles with this tag

AI Research
Google DeepMind Tackles AI Evaluation Challenges
Google DeepMind's Nicholas Kang and Michael Aaron discuss the challenges in current AI evaluation and Kaggle's innovative solutions like Hackathons, Agent Exams, and Game Arena.
26 days ago

AI Research
DeepMind's AGI Roadmap
Google DeepMind unveils a cognitive framework and Kaggle hackathon to standardize AGI progress measurement, offering $200K in prizes.
3 months ago

AI Research
Kaggle Community Benchmarks Decentralize AI Evaluation
Kaggle Community Benchmarks provide a dynamic, transparent framework for evaluating LLMs on complex, real-world tasks like code generation and tool use.
5 months ago

AI Research
Kaggle's AI Agents Course Signals Industry Shift
6 months ago
AI Research
FACTS Benchmark Suite Elevates LLM Factuality Scrutiny
6 months ago

Podcast
Sorted and Sifted Machine Learning, with Anthony Goldbloom
over 4 years ago