The prevailing wisdom in AI posits that vast datasets and powerful predictors are sufficient for robust understanding. However, a critical failure mode has been identified, challenging this paradigm in the realm of causal inference in AI. Researchers observed that while strong predictors excel at tasks with sufficient observational and interventional data, they falter significantly when estimating unidentified quantities, specifically, the couplings between counterfactual worlds.
The Unseen Couplings: Prediction's Counterfactual Collapse
Across hundreds of structural causal models, a strong predictor collapses on unidentified quantities, failing to represent uncertainty over counterfactual couplings. In 28% of models, the predictor defaults to a point estimate that no valid model can produce, while the true value resides within an admissible interval that more data cannot narrow. This isn't a data scarcity issue; it's a structural limitation where standard predictive models cannot capture the nuanced relationships across hypothetical scenarios. This gap highlights a fundamental challenge in moving beyond correlation to true causation.