#Ai2

11 articles with this tag

SERA Agents Slash Cost of Private Codebase AI Specialization

Ai2’s SERA models use Soft-verified generation to dramatically reduce the cost of specializing AI agents to private codebases, making enterprise deployment accessible.

5 months ago

AI Research

HiRO-ACE Delivers Accessible Kilometer-Scale AI Climate Simulation

The HiRO-ACE AI climate simulation framework provides decades of 3km resolution regional climate data in a single day, solving the major computational barrier for high-fidelity modeling.

5 months ago

AI Research

Bolmo Advances Byte-Level Language Models with Practicality

6 months ago

AI Research

NeuroDiscoveryBench Sets New Standard for Neuroscience AI Benchmarks

6 months ago

AI Research

Olmo 3 Open-Source AI Unlocks Full Model Flow Transparency

7 months ago

AI Research

DR Tulu deep research: Open AI closes proprietary gap

7 months ago

AI Research

OlmoEarth Redefines Earth Observation Foundation Models

8 months ago

AI Research

Ai2’s new AI climate emulator runs 1,500 years in a day

8 months ago

AI Research

AI Climate Emulator Slashes Energy Consumption 3,750x

8 months ago

AI Research

NVIDIA, NSF Partner to Fuel Open AI for US Scientific Leadership

NVIDIA is collaborating with the U.S. National Science Foundation (NSF) to bolster scientific research through a new AI initiative. It aligns directly with the White House AI Action Plan, particularly the "Winning the AI Race: America’s AI Action Plan"

10 months ago

AI Research

Ai2 Debuts Top Open Source Foundation Model OLMo 2 with Open Weights, Data, and Code

OLMo 2 delivers breakthrough performance across benchmarks.ARC Challenge (commonsense reasoning): OLMo-2-13B scores 83.5, outperforming Llama-3.1-8B (79.5) and Qwen-2.5-7B (67.4).MMLU Massive Multitask Language Understanding (domain-specific knowledge): OLMo-2-13B achieves 67.5, higher than Qwen-2.5-7B (64.4) and Llama-3.1-8B (66.9).GSM8k Math Word Problems (mathematical reasoning): OLMo-2-13B scores 75.1, significantly outperforming Llama-3.1-8B (51.3) and Qwen-2.5-7B (63).TriviaQA (knowledge recall): OLMo-2-13B achieves 81.9, comparable to Qwen-2.5-7B (81.5) and higher than Llama-3.1-8B (80.3).

over 1 year ago