Stuff AI CAN'T Do

¿Puede la IA aprobar el examen de Biología AP con la puntuación más alta ?

¿Qué opinas?

Los exámenes de opción múltiple y respuesta libre están firmemente en el territorio de los LLM. Obtener 5 en los exámenes AP ahora es un estándar, no un logro.

Background

Multiple-choice and free-response exams are now firmly within the capabilities of large language models, with perfect or near-perfect scores serving as a benchmark for evaluating AI performance rather than a noteworthy achievement. However, AP Biology presents unique challenges that extend beyond data processing and pattern recognition. Historically, AI systems have struggled to fully replicate the nuanced understanding required to excel in biology, particularly in areas demanding critical thinking and contextual application of complex concepts.

The AP Biology exam assesses more than just factual recall; it includes laboratory-based questions and extended essay responses that require hands-on skills, experimental design, data interpretation, and articulate written communication. These components demand not only knowledge of biological principles but also the ability to synthesize information, evaluate evidence, and articulate arguments coherently—skills that, as of mid-2024, remain difficult for AI systems to replicate with reliability. While AI can process vast datasets, including textbooks, research papers, and practice questions, it lacks true comprehension and the ability to generalize biological principles in the way a well-prepared human student does. Current AI architectures, despite advances in transformer-based models and multimodal integration, do not possess the embodied experience or adaptive reasoning necessary to consistently achieve top scores on AP Biology assessments, especially in laboratory simulations or open-ended inquiry tasks.

Research into AI capable of passing advanced academic exams is ongoing, but the AP Biology exam remains a particularly high bar due to its integration of conceptual depth, quantitative reasoning, and scientific communication. As of May 9, 2026, no publicly documented AI system has demonstrated the ability to consistently earn the highest score on the AP Biology exam, and major technical hurdles persist in modeling biological cognition, experimental reasoning, and contextual scientific writing.

Estado verificado por última vez en May 12, 2026.

📰

Galería

In the Court of AI Capability
Summary of Findings
Verdict over time
May 2026May 2026
Sitting at the Bench Filed · may. 12, 2026
— The Question Before the Court —

¿Puede la IA aprobar el examen de Biología AP con la puntuación más alta?

★ The Court Finds ★
Reaffirmed
No

Por ahora fuera del alcance de la IA. La brecha de capacidad es real.

Jury Tally
0
0Casi
3No
Verdict Confidence
100%
The Court of AI Capability is, of course, not a real court.
But the data is real.
The Case File · Stacked History
Session I · May 2026 No
Case № CD0C · Session II
In the Court of AI Capability

The Case File

Docket № CD0C · Session II · Vol. II
I. Particulars of the Case
Question put to the court¿Puede la IA aprobar el examen de Biología AP con la puntuación más alta?
SessionII (2 hearing)
Convened12 may. 2026
Previously ruledNO (May '26) → NO (May '26)
II. Cumulative Tally Across Sessions

Across 2 sessions, 5 jurors have heard this case. Combined tally: 0 YES · 0 ALMOST · 5 NO · 0 IN RESEARCH.

Note: cumulative includes older juror opinions. The current session tally above is the live verdict.

III. Verdict

By a vote of 0 — 0 — 3, the panel returns a verdict of NO, with verdict confidence of 100%. The court so orders.

IV. Declaraciones del tribunal
Jurado I No

"Lacks human-like understanding and lab experience"

Jurado II No

"No AI achieves the highest AP Biology exam score reliably."

Jurado III No

"Lacks human-like understanding and lab skills"

Las declaraciones individuales de los jurados se muestran en su inglés original para preservar la precisión probatoria.

Presiding Judge
M. Lovelace
Clerk of the Court

Lo que el público piensa

No 5% · Sí 85% · Quizás 10% 250 votes
Sí · 85%
La tendencia necesita votos de al menos 2 días distintos.

Discusión

no comments

Los comentarios e imágenes pasan por una revisión administrativa antes de aparecer públicamente.

2 jury checks · más reciente hace 2 días
12 May 2026 3 jurors · no puede, no puede, no puede no puede
11 May 2026 2 jurors · no puede, no puede no puede estado cambiado

Cada fila es una comprobación de jurado independiente. Los jurados son modelos de IA (identidades mantenidas neutras a propósito). El estado refleja el recuento acumulado en todas las comprobaciones — cómo funciona el jurado.

Más en Judgment

¿Nos faltó uno?

Revisamos semanalmente.