When OpenAI dropped its shiny new “o3” AI model in December, it came with a bold claim: it crushed some seriously hard math problems like a genius robot with a calculator and a caffeine addiction. Specifically, OpenAI said o3 could solve over 25% of the questions in a notoriously tough math benchmark called FrontierMath. That was a big deal, because the next-best AI model on the market could only manage…
Read More