Estimating heterogeneous treatment effects (HTEs) from survival data is paramount for precision medicine and individualized policy. However, the inherent complexities of survival analysis—censoring, unobserved counterfactuals, and intricate identification assumptions—have led to inconsistent and fragmented evaluation practices for existing HTE estimation methods. This paper introduces SurvHTE-Bench, the first comprehensive benchmark designed to address this critical gap.
Bridging the Evaluation Chasm in Survival HTE
The introduction of SurvHTE-Bench marks a significant step towards standardizing the evaluation of methods for estimating heterogeneous treatment effects in the presence of censored survival data. Prior to this work, the landscape of survival HTE estimation was characterized by a lack of unified assessment protocols, hindering direct comparisons and progress. This new benchmark aims to rectify this by providing a standardized framework that spans synthetic, semi-synthetic, and real-world datasets, enabling a more rigorous and reproducible comparison of current and future survival HTE methods under diverse conditions and realistic assumption violations.