NF-CoT: High-Bandwidth Latent Reasoning

The inherent seriality and discrete nature of textual chain-of-thought (CoT) in large language models impose significant limitations on computational bandwidth for reasoning. Verbalizing each intermediate step before proceeding, even for semantic or partial computations, creates a bottleneck.

Visual TL;DR. Textual CoT Bottleneck solves NF-CoT Framework. NF-CoT Framework uses Normalizing Flows. NF-CoT Framework maintains Preserves Autoregressive Strengths. NF-CoT Framework enables High-Bandwidth Latent Reasoning. High-Bandwidth Latent Reasoning leads to Boosted LLM Performance.

Textual CoT Bottleneck: seriality and discrete nature of textual chain-of-thought limits computational bandwidth
NF-CoT Framework: novel latent reasoning framework leveraging normalizing flows for continuous thoughts
Normalizing Flows: model continuous thoughts, offering higher-bandwidth alternative to explicit textual CoT
Preserves Autoregressive Strengths: native left-to-right generation, probabilistic sampling, KV-cache compatibility, tractable likelihood
High-Bandwidth Latent Reasoning: enables generation of continuous thought positions via NF head alongside text
Boosted LLM Performance: improves LLM performance and efficiency in tasks like code generation

Visual TL;DRQuickExplainDeeper

Bridging Continuous States and Autoregressive Generation

To address this, the researchers propose NF-CoT, a novel latent reasoning framework. It leverages normalizing flows to model continuous thoughts, offering a higher-bandwidth alternative to explicit textual CoT. Crucially, NF-CoT preserves key advantages of traditional autoregressive language models, including native left-to-right generation, probabilistic sampling, compatibility with KV-cache decoding, and tractable likelihood estimation. This is achieved by integrating a TARFlow-style normalizing flow directly within the LLM backbone, enabling the generation of continuous thought positions via an NF head alongside standard text generation from the LM head.

Efficiency and Performance Gains in Code Generation

The NF-CoT latent reasoning approach demonstrates tangible benefits, particularly on code-generation benchmarks. The framework not only improves pass rates compared to explicit CoT and prior latent-reasoning methods but also substantially reduces the intermediate-reasoning cost. This efficiency gain, coupled with enhanced performance, positions NF-CoT as a significant advancement in making complex reasoning more tractable and performant within LLMs.

NF-CoT: High-Bandwidth Latent Reasoning

Bridging Continuous States and Autoregressive Generation

Related startups

Efficiency and Performance Gains in Code Generation

AI Daily Digest