1 articles with this tag
New framework LLMSurgeon enables post-hoc analysis of LLM pretraining data mixtures using only generated text, addressing the critical need for auditing foundation models.