How Good Are LLMs at Maths and Physics? Jonathan Oppenheim Ponders

ZZeeya Merali · 16 Apr

In a recent Substack post (https://superposer.substack.com/p/insufferable-mathematicians), FQxI's Jonathan Oppenheim discusses how far Large Language Models have progressed in terms of their ability to solve math problems. His quick summary is that "they’re pretty terrible at anything close to research level maths, but the speed at which they’re improving is astounding. But also, they’re being trained to be insufferable and a bit psychopathic."

VVesuvius Now · 2025-09-03T18:54:24+00:00

Though he said this, I think this still misses the bigger picture of the incredible rate at which these AIs are getting better. In my experience, they are getting better much much faster than a (human) student (assuming 4 years of undergraduate etc.). That means in a year or two they will be doing research-level stuff (though still not perfect). They are improving algorithms and compute both exponentially.