OpenAI is codifying how its artificial intelligence systems should act with the introduction of its formal Model Spec. This document details the intended behavior for AI models, aiming for clarity and public debate on how these powerful tools should operate.
The Model Spec is designed to make intended AI conduct explicit, moving beyond internal training processes to a format accessible to users, developers, researchers, and policymakers. It’s not a claim of current perfection but a target for future development, guiding training, evaluation, and improvement.
This initiative is part of OpenAI's broader strategy for safe and accountable AI, complementing efforts like the Preparedness Framework which addresses risks from advanced capabilities. The ultimate goal is to foster a gradual, iterative, and democratically legible transition to advanced AI, ensuring it aligns with human interests.