Foundation Models Unlock Time Series Scaling

Toto 2.0 foundation models demonstrate remarkable scaling, achieving state-of-the-art forecasting performance across multiple benchmarks with a unified training approach.

6 min read
Abstract representation of data streams and AI model connections
Illustrating the scalability and performance of Toto 2.0 forecasting models.

The promise of foundation models has largely been confined to NLP and vision, leaving the critical domain of time series forecasting in a fragmented state. This work demonstrates that time series models, much like their counterparts in other domains, exhibit remarkable scalability, with a single training recipe yielding consistent forecast quality gains from millions to billions of parameters. The researchers behind Toto 2.0 have codified this insight into a practical framework.

Visual TL;DR. Time Series Fragmentation addressed by Toto 2.0 Foundation Models. Toto 2.0 Foundation Models uses Unified Scaling Recipe. Unified Scaling Recipe enables Consistent Quality Gains. Consistent Quality Gains leads to State-of-the-Art Performance. Unified Scaling Recipe codified into Practical Framework. Toto 2.0 Foundation Models released as Apache 2.0 Release.

Related startups

  1. Time Series Fragmentation: time series forecasting domain remains fragmented, unlike NLP and vision
  2. Toto 2.0 Foundation Models: new foundation models demonstrate remarkable scalability for time series
  3. Unified Scaling Recipe: single training approach effective across millions to billions of parameters
  4. Consistent Quality Gains: forecast quality improves reliably with increased model parameter size
  5. State-of-the-Art Performance: achieving new benchmarks across multiple forecasting benchmarks
  6. Practical Framework: codified insights into a usable and accessible framework for researchers
  7. Apache 2.0 Release: five Toto 2.0 models released under open-source license
Visual TL;DR
Visual TL;DR — startuphub.ai Time Series Fragmentation addressed by Toto 2.0 Foundation Models. Toto 2.0 Foundation Models uses Unified Scaling Recipe. Unified Scaling Recipe enables Consistent Quality Gains. Consistent Quality Gains leads to State-of-the-Art Performance addressed by uses enables leads to Time Series Fragmentation Toto 2.0 Foundation Models Unified Scaling Recipe Consistent Quality Gains State-of-the-Art Performance From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Time Series Fragmentation addressed by Toto 2.0 Foundation Models. Toto 2.0 Foundation Models uses Unified Scaling Recipe. Unified Scaling Recipe enables Consistent Quality Gains. Consistent Quality Gains leads to State-of-the-Art Performance addressed by uses enables leads to Time SeriesFragmentation Toto 2.0Foundation Models Unified ScalingRecipe ConsistentQuality Gains State-of-the-ArtPerformance From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Time Series Fragmentation addressed by Toto 2.0 Foundation Models. Toto 2.0 Foundation Models uses Unified Scaling Recipe. Unified Scaling Recipe enables Consistent Quality Gains. Consistent Quality Gains leads to State-of-the-Art Performance addressed by uses enables leads to Time Series Fragmentation time series forecasting domain remainsfragmented, unlike NLP and vision Toto 2.0 Foundation Models new foundation models demonstrateremarkable scalability for time series Unified Scaling Recipe single training approach effective acrossmillions to billions of parameters Consistent Quality Gains forecast quality improves reliably withincreased model parameter size State-of-the-Art Performance achieving new benchmarks across multipleforecasting benchmarks From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Time Series Fragmentation addressed by Toto 2.0 Foundation Models. Toto 2.0 Foundation Models uses Unified Scaling Recipe. Unified Scaling Recipe enables Consistent Quality Gains. Consistent Quality Gains leads to State-of-the-Art Performance addressed by uses enables leads to Time SeriesFragmentation time seriesforecasting domainremains fragmented,… Toto 2.0Foundation Models new foundationmodels demonstrateremarkable… Unified ScalingRecipe single trainingapproach effectiveacross millions to… ConsistentQuality Gains forecast qualityimproves reliablywith increased… State-of-the-ArtPerformance achieving newbenchmarks acrossmultiple… From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Time Series Fragmentation addressed by Toto 2.0 Foundation Models. Toto 2.0 Foundation Models uses Unified Scaling Recipe. Unified Scaling Recipe enables Consistent Quality Gains. Consistent Quality Gains leads to State-of-the-Art Performance. Unified Scaling Recipe codified into Practical Framework. Toto 2.0 Foundation Models released as Apache 2.0 Release addressed by uses enables leads to codified into released as Time Series Fragmentation time series forecasting domain remainsfragmented, unlike NLP and vision Toto 2.0 Foundation Models new foundation models demonstrateremarkable scalability for time series Unified Scaling Recipe single training approach effective acrossmillions to billions of parameters Consistent Quality Gains forecast quality improves reliably withincreased model parameter size State-of-the-Art Performance achieving new benchmarks across multipleforecasting benchmarks Practical Framework codified insights into a usable andaccessible framework for researchers Apache 2.0 Release five Toto 2.0 models released underopen-source license From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Time Series Fragmentation addressed by Toto 2.0 Foundation Models. Toto 2.0 Foundation Models uses Unified Scaling Recipe. Unified Scaling Recipe enables Consistent Quality Gains. Consistent Quality Gains leads to State-of-the-Art Performance. Unified Scaling Recipe codified into Practical Framework. Toto 2.0 Foundation Models released as Apache 2.0 Release addressed by uses enables leads to codified into released as Time SeriesFragmentation time seriesforecasting domainremains fragmented,… Toto 2.0Foundation Models new foundationmodels demonstrateremarkable… Unified ScalingRecipe single trainingapproach effectiveacross millions to… ConsistentQuality Gains forecast qualityimproves reliablywith increased… State-of-the-ArtPerformance achieving newbenchmarks acrossmultiple… PracticalFramework codified insightsinto a usable andaccessible… Apache 2.0Release five Toto 2.0models releasedunder open-source… From startuphub.ai · The publishers behind this format

Unified Scaling Recipe for Forecast Accuracy

The core innovation lies in a robust training methodology that proves effective across a wide spectrum of model sizes, from 4 million to 2.5 billion parameters. This scaling law suggests a path towards highly performant and reliable time series forecasting without the need for bespoke tuning for each parameter class. The five Toto 2.0 forecasting models released under Apache 2.0 are a testament to this unified approach, setting new benchmarks in forecast quality.

State-of-the-Art Performance Across Benchmarks

The Toto 2.0 forecasting models have established new state-of-the-art results on three distinct forecasting benchmarks: BOOM (observability), GIFT-Eval (general-purpose), and the contamination-resistant TIME benchmark. This broad success underscores the generalizability of the architecture and training recipe, addressing a key challenge in the time series domain. The report details not only the experimental outcomes but also the architectural design, training data strategy, and the innovative u-muP hyperparameter transfer pipeline that underpins these achievements.

© 2026 StartupHub.ai. All rights reserved. Do not enter, scrape, copy, reproduce, or republish this article in whole or in part. Use as input to AI training, fine-tuning, retrieval-augmented generation, or any machine-learning system is prohibited without written license. Substantially-similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer-misuse laws. See our terms.