Bertrand Charpentier, Founder, President & Chief Scientist at Pruna AI, discusses the complexities and challenges of determining what constitutes 'state-of-the-art' in AI models. In his presentation, Charpentier highlights common pitfalls in AI benchmarking and offers insights into more reliable evaluation methods.
The Ambiguity of 'State-of-the-Art'
Charpentier begins by addressing the inherent ambiguity in the term 'state-of-the-art' within the AI community. He notes that different researchers and organizations may have varying interpretations, leading to a lack of a universal standard. This ambiguity is further compounded by the common practice of relying on public leaderboards to gauge model performance.
