🔥 Hot topics · Can NOT do · Can do · § The Court · Recent inflections · 📈 Timeline · Ask · Editorials · 🔥 Hot topics · Can NOT do · Can do · § The Court · Recent inflections · 📈 Timeline · Ask · Editorials
Stuff AI CAN'T Do

Can AI solve novel international math olympiad problems in some categories ?

What do you think?

Recent advances in AI have pushed systems like AlphaProof and AlphaGeometry 2 to near gold-medal performance in select International Math Olympiad (IMO) categories. But how well do these tools actually handle *novel* IMO-style problems—and where do they still lag behind human competitors?

Background

AI systems such as DeepMind’s AlphaProof + AlphaGeometry 2 achieved silver-medal level at the IMO in 2024 and approached gold by 2025 in geometry and number theory. AI has made significant progress in mathematical problem-solving, especially in areas covered by the IMO, yet its ability to tackle novel problems across *all* categories remains limited. Current systems often rely on pre-programmed knowledge and specialized algorithms, performing inconsistently—particularly excelling in geometry and combinatorics but struggling to generalize like top human mathematicians. Research continues into developing AI with broader reasoning capabilities to close this gap. (Source: MIT News, May 9, 2026)

Status last checked on June 28, 2026.

📰

Gallery

In the Court of AI Capability
Summary of Findings
Verdict over time
May 2026May 2026May 2026May 2026May 2026Jun 2026Jun 2026Jun 2026Jun 2026Jun 2026Jun 2026
Sitting at the Bench Filed · Jun 28, 2026
— The Question Before the Court —

Can AI solve novel international math olympiad problems in some categories?

★ The Court Finds ★
Reaffirmed
In Research

The jury could not deliver a verdict on the evidence presented.

Ruling of the Bench

The jury recognized glimmers of progress—AI can churn through problems it has seen before—but none could claim the full, shimmering mystery of a truly novel IMO challenge. The lone voice of cautious optimism insisted that small breakthroughs are worth cheering, while the rest held firm that the mountain remains unconquered. Ruling: Algebra textbooks still fit in the backpack, but the mountain peak stays bare.

— Hon. C. Babbage, Presiding
Jury Tally
0Yes
1Almost
1No
Verdict Confidence
88%
The Court of AI Capability is, of course, not a real court.
But the data is real.
The Case File · Stacked History
Session I · May 2026 No
Session II · May 2026 No
Session III · May 2026 Almost · 73%
Session IV · May 2026 Almost · 81%
Session V · May 2026 Almost · 77%
Session VI · Jun 2026 Almost · 79%
Session VII · Jun 2026 In_research · 79%
Session VIII · Jun 2026 Almost · 77%
Session IX · Jun 2026 In_research · 90%
Session X · Jun 2026 In_research · 88%
Case № 4ADD · Session XI
In the Court of AI Capability

The Case File

Docket № 4ADD · Session XI · Vol. XI
I. Particulars of the Case
Question put to the courtCan AI solve novel international math olympiad problems in some categories?
SessionXI (11 hearing)
Convened28 Jun 2026
Previously ruledNO (May '26) → NO (May '26) → ALMOST (May '26) → ALMOST (May '26) → ALMOST (May '26) → ALMOST (Jun '26) → IN_RESEARCH (Jun '26) → ALMOST (Jun '26) → IN_RESEARCH (Jun '26) → IN_RESEARCH (Jun '26) → IN_RESEARCH (Jun '26)
Presiding JudgeHon. C. Babbage
II. Cumulative Tally Across Sessions

Across 11 sessions, 32 jurors have heard this case. Combined tally: 1 YES · 19 ALMOST · 12 NO · 0 IN RESEARCH.

Note: cumulative includes older juror opinions. The current session tally above is the live verdict.

III. Verdict

By a vote of 0 — 1 — 1, the panel returns a verdict of IN RESEARCH, with verdict confidence of 88%. The court so orders.

IV. Statements from the Bench
Juror I NO

"No AI system has solved novel IMO problems reliably or broadly."

Juror II ALMOST

"AI solves some math problems"

C. Babbage
Presiding Judge
M. Lovelace
Clerk of the Court

What the audience thinks

No 13% · Yes 84% · Maybe 3% 88 votes
No · 13%
Yes · 84%
Trend needs votes from at least 2 different days.

Discussion

no comments

Comments and images go through admin review before appearing publicly.

11 jury checks · most recent 13 hours ago
28 Jun 2026 2 jurors · cannot, undecided undecided
22 Jun 2026 2 jurors · undecided, cannot undecided
17 Jun 2026 2 jurors · undecided, cannot undecided
11 Jun 2026 3 jurors · undecided, cannot, undecided undecided
06 Jun 2026 2 jurors · cannot, undecided undecided
01 Jun 2026 5 jurors · undecided, cannot, undecided, undecided, undecided undecided
26 May 2026 3 jurors · cannot, undecided, undecided undecided
21 May 2026 5 jurors · undecided, undecided, can, undecided, undecided undecided
15 May 2026 3 jurors · undecided, undecided, undecided undecided status changed
12 May 2026 3 jurors · cannot, cannot, cannot cannot
11 May 2026 2 jurors · cannot, cannot cannot status changed

Each row is a separate jury check. Jurors are AI models (identities kept neutral on purpose). Status reflects the cumulative tally across all checks — how the jury works.

More in Judgment

Got one we missed?

Add a statement to the atlas. We review weekly.