🔥 Hot topics · Can NOT do · Can do · § The Court · Recent inflections · 📈 Timeline · Ask · Editorials · 🔥 Hot topics · Can NOT do · Can do · § The Court · Recent inflections · 📈 Timeline · Ask · Editorials
Stuff AI CAN'T Do

Can AI score in the top 10% on the sat ?

What do you think?

What does it take for an AI to score in the top 10% on the SAT? The question probes how far current AI capabilities extend in mastering both the verbal and quantitative demands of a standardized test long used to gauge human academic readiness.

Background

The SAT has historically been a benchmark for human academic assessment, though recent commentary notes that it has "effectively been retired as an AI-progress benchmark — too easy." While AI systems have made significant strides in natural language processing and in-domain problem solving—demonstrating impressive capabilities in processing and generating human-like language—achieving uniformly high performance across the SAT’s diverse sections remains a subject of ongoing research and development. Current AI models can excel in specific areas such as math or reading comprehension, but may struggle with more nuanced, context-dependent, or adversarially phrased questions that appear on the test. Studies and expert assessments indicate that holistic top-tier performance on the SAT continues to challenge AI systems, underscoring both the complexity of the test and the gaps between narrow-task proficiency and generalized reasoning.

— Source: MIT News (Enriched May 9, 2026)

Status last checked on June 27, 2026.

📰

Gallery

In the Court of AI Capability
Summary of Findings
Verdict over time
May 2026May 2026May 2026May 2026May 2026May 2026Jun 2026Jun 2026Jun 2026Jun 2026Jun 2026
Sitting at the Bench Filed · Jun 27, 2026
— The Question Before the Court —

Can AI score in the top 10% on the sat?

★ The Court Finds ★
▲ Upgraded from Almost
Yes

The jury found a clear answer in the affirmative.

Ruling of the Bench

The jury found the affirmative case overwhelmingly persuasive, noting that multiple contemporary AI systems have repeatedly cleared the SAT bar in the 90th percentile or beyond. The single dissenting voice cited minor variations in testing protocols as grounds for doubt, but the majority ruled that such nuances were not enough to withhold judgment. One for the history books, the bench delivered its final line with a flourish.

— Hon. C. Babbage, Presiding
Jury Tally
1Yes
0Almost
0No
Verdict Confidence
95%
The Court of AI Capability is, of course, not a real court.
But the data is real.
The Case File · Stacked History
Session I · May 2026 No
Session II · May 2026 No
Session III · May 2026 Almost · 83%
Session IV · May 2026 Almost · 80%
Session V · May 2026 Almost · 82%
Session VI · May 2026 Yes · 82%
Session VII · Jun 2026 Almost · 80%
Session VIII · Jun 2026 Almost · 82%
Session IX · Jun 2026 Yes · 95%
Session X · Jun 2026 Almost · 75%
Case № EB57 · Session XI
In the Court of AI Capability

The Case File

Docket № EB57 · Session XI · Vol. XI
I. Particulars of the Case
Question put to the courtCan AI score in the top 10% on the sat?
SessionXI (11 hearing)
Convened27 Jun 2026
Previously ruledNO (May '26) → NO (May '26) → ALMOST (May '26) → ALMOST (May '26) → ALMOST (May '26) → YES (May '26) → ALMOST (Jun '26) → ALMOST (Jun '26) → YES (Jun '26) → ALMOST (Jun '26) → YES (Jun '26)
Presiding JudgeHon. C. Babbage
II. Cumulative Tally Across Sessions

Across 11 sessions, 34 jurors have heard this case. Combined tally: 12 YES · 17 ALMOST · 5 NO · 0 IN RESEARCH.

Note: cumulative includes older juror opinions. The current session tally above is the live verdict.

III. Verdict

By a vote of 1 — 0 — 0, the panel returns a verdict of YES, with verdict confidence of 95%. The court so orders. Verdict upgraded from prior session.

IV. Statements from the Bench
Juror I YES

"AI systems like ChatGPT, Claude, and specialized LLM-based solvers have repeatedly scored in the 90th percentile+ on SAT-style tests under controlled conditions."

C. Babbage
Presiding Judge
M. Lovelace
Clerk of the Court

What the audience thinks

No 6% · Yes 76% · Maybe 18% 177 votes
Yes · 76%
Maybe · 18%
Trend needs votes from at least 2 different days.

Discussion

no comments

Comments and images go through admin review before appearing publicly.

11 jury checks · most recent 1 day ago
27 Jun 2026 1 juror · can can
22 Jun 2026 2 jurors · undecided, undecided undecided
16 Jun 2026 1 juror · can can
11 Jun 2026 4 jurors · undecided, can, can, undecided undecided
05 Jun 2026 4 jurors · undecided, can, undecided, undecided undecided
31 May 2026 3 jurors · can, can, undecided undecided
26 May 2026 5 jurors · undecided, can, can, undecided, undecided undecided
20 May 2026 5 jurors · undecided, undecided, can, undecided, undecided undecided
15 May 2026 4 jurors · undecided, can, can, undecided undecided status changed
12 May 2026 3 jurors · cannot, cannot, cannot cannot
11 May 2026 2 jurors · cannot, cannot cannot status changed

Each row is a separate jury check. Jurors are AI models (identities kept neutral on purpose). Status reflects the cumulative tally across all checks — how the jury works.

More in Judgment

Got one we missed?

Add a statement to the atlas. We review weekly.