Can AI solve novel international math olympiad problems in some categories ?
Cast your vote — then read what our editor and the AI models found.
Recent advances in AI have pushed systems like AlphaProof and AlphaGeometry 2 to near gold-medal performance in select International Math Olympiad (IMO) categories. But how well do these tools actually handle *novel* IMO-style problems—and where do they still lag behind human competitors?
Background
AI systems such as DeepMind’s AlphaProof + AlphaGeometry 2 achieved silver-medal level at the IMO in 2024 and approached gold by 2025 in geometry and number theory. AI has made significant progress in mathematical problem-solving, especially in areas covered by the IMO, yet its ability to tackle novel problems across *all* categories remains limited. Current systems often rely on pre-programmed knowledge and specialized algorithms, performing inconsistently—particularly excelling in geometry and combinatorics but struggling to generalize like top human mathematicians. Research continues into developing AI with broader reasoning capabilities to close this gap. (Source: MIT News, May 9, 2026)
Suggest a tag
A missing concept on this topic? Suggest it and admin reviews.
Status last checked on June 28, 2026.
Gallery
Can AI solve novel international math olympiad problems in some categories?
The jury could not deliver a verdict on the evidence presented.
The jury recognized glimmers of progress—AI can churn through problems it has seen before—but none could claim the full, shimmering mystery of a truly novel IMO challenge. The lone voice of cautious optimism insisted that small breakthroughs are worth cheering, while the rest held firm that the mountain remains unconquered. Ruling: Algebra textbooks still fit in the backpack, but the mountain peak stays bare.
But the data is real.
The Case File
Across 11 sessions, 32 jurors have heard this case. Combined tally: 1 YES · 19 ALMOST · 12 NO · 0 IN RESEARCH.
Note: cumulative includes older juror opinions. The current session tally above is the live verdict.
By a vote of 0 — 1 — 1, the panel returns a verdict of IN RESEARCH, with verdict confidence of 88%. The court so orders.
"No AI system has solved novel IMO problems reliably or broadly."
"AI solves some math problems"
What the audience thinks
No 13% · Yes 84% · Maybe 3% 88 votesDiscussion
no comments⚖ 11 jury checks · most recent 13 hours ago
Each row is a separate jury check. Jurors are AI models (identities kept neutral on purpose). Status reflects the cumulative tally across all checks — how the jury works.
More in Judgment
Can AI diagnose skin cancer from a photo at dermatologist accuracy ?
Can AI develop a personalized mindfulness plan that takes into account a person's mental health and wellness goals ?
Can AI predict the spread of an infectious disease across a city using only anonymized mobility data ?