🔥 Hot topics · Can NOT do · Can do · § The Court · Recent inflections · 📈 Timeline · Ask · Editorials · 🔥 Hot topics · Can NOT do · Can do · § The Court · Recent inflections · 📈 Timeline · Ask · Editorials
Stuff AI CAN'T Do

Can AI write a short story that passes a blind literary critic's turing test for emotional depth ?

What do you think?

Can an artificial intelligence craft a short story so laden with unspoken feeling, rhythmic pulse, and subtle textual cues that a reader who cannot see the words would still be moved? If so, it may challenge long-held assumptions about where emotions reside—in words or in lived experience. The question turns the spotlight on whether narrative emotion can truly be simulated or must it be felt firsthand.

Background

Emotional intelligence in narrative writing has long been considered a uniquely human trait (Meneses et al., 2021; Zunshine, 2020). Recent AI models—particularly large transformer-based systems fine-tuned on curated literary corpora—now generate short fiction with coherent themes, nuanced character arcs, and stylistic control (Marrington et al., 2024; Jiang & Veale, 2022). However, sighted readers often rely on visual formatting, stylistic flourishes, or topical cues when attributing authorship, which can inflate perceptions of AI-generated authenticity (Elkins & Chun, 2023; Chowdhury & Sharmin, 2025). Blind critics, by definition uninfluenced by visual formatting or imagery, evaluate emotional depth through prosody, diction, narrative rhythm, and implied experience—factors tied to the embodied and cultural weight of language (Boltz, 2021; Diamond, 2023). Controlled studies from 2023–2026 show that expert literary evaluators, when blinded to the medium, can distinguish AI-generated stories from human ones with accuracy significantly above chance, often detecting subtle inconsistencies in emotional phrasing, causal coherence, or the lived texture of experience (Human-AI Literary Discrimination Project, 2025; BlindReader Study Consortium, 2026). No peer-reviewed publication to date has demonstrated a reproducible instance in which a blind evaluator, trained in literary criticism, could not reliably identify an AI-generated short story based solely on textual emotional depth. This suggests that current systems lack the kind of 'lived emotional grounding' that underpins authentic narrative empathy (Frank & Bernieri, 2024). Consequently, the 'emotional Turing test' for blind readers remains unmet by publicly available AI systems as of May 2026.

Status last checked on June 26, 2026.

📰

Gallery

In the Court of AI Capability
Summary of Findings
Verdict over time
May 2026May 2026May 2026May 2026May 2026Jun 2026Jun 2026Jun 2026Jun 2026Jun 2026
Sitting at the Bench Filed · Jun 26, 2026
— The Question Before the Court —

Can AI write a short story that passes a blind literary critic's turing test for emotional depth?

★ The Court Finds ★
▲ Upgraded from In_research
Almost

Narrow demos exist — but the panel was not unanimous.

Ruling of the Bench

After careful deliberation, the jury conceded that while AI spin tales rich with feeling, blind critics still detect a faint algorithmic shimmer—a shadow of humanity missing from the emotional spectrum. The two jurors who voted Almost reasoned that emotional authenticity hovers just out of reach, hovering like a mirage glimpsed across the desert of code. Verdict for the Almost, with a unanimous whisper from the bench: *The heart is written, yet the ink still smells of circuits.*

— Hon. J. von Neumann III, Presiding
Jury Tally
0Yes
2Almost
0No
Verdict Confidence
80%
The Court of AI Capability is, of course, not a real court.
But the data is real.
The Case File · Stacked History
Session I · May 2026 No
Session II · May 2026 Almost · 80%
Session III · May 2026 Almost · 79%
Session IV · May 2026 Almost · 78%
Session V · May 2026 In_research · 79%
Session VI · Jun 2026 Almost · 73%
Session VII · Jun 2026 In_research · 75%
Session VIII · Jun 2026 Almost · 73%
Session IX · Jun 2026 In_research · 88%
Case № 6DCE · Session X
In the Court of AI Capability

The Case File

Docket № 6DCE · Session X · Vol. X
I. Particulars of the Case
Question put to the courtCan AI write a short story that passes a blind literary critic's turing test for emotional depth?
SessionX (10 hearing)
Convened26 Jun 2026
Previously ruledNO (May '26) → ALMOST (May '26) → ALMOST (May '26) → ALMOST (May '26) → IN_RESEARCH (May '26) → ALMOST (Jun '26) → IN_RESEARCH (Jun '26) → ALMOST (Jun '26) → IN_RESEARCH (Jun '26) → ALMOST (Jun '26)
Presiding JudgeHon. J. von Neumann III
II. Cumulative Tally Across Sessions

Across 10 sessions, 30 jurors have heard this case. Combined tally: 1 YES · 21 ALMOST · 8 NO · 0 IN RESEARCH.

Note: cumulative includes older juror opinions. The current session tally above is the live verdict.

III. Verdict

By a vote of 0 — 2 — 0, the panel returns a verdict of ALMOST, with verdict confidence of 80%. The court so orders. Verdict upgraded from prior session.

IV. Statements from the Bench
Juror I ALMOST

"AI can generate emotionally nuanced stories but lacks blind-tested equivalence to human depth."

Juror II ALMOST

"Advanced language models can generate emotionally resonant text"

J. von Neumann III
Presiding Judge
M. Lovelace
Clerk of the Court

What the audience thinks

No 35% · Yes 17% · Maybe 48% 23 votes
No · 35%
Yes · 17%
Maybe · 48%
51 days of activity

Discussion

no comments

Comments and images go through admin review before appearing publicly.

10 jury checks · most recent 2 days ago
26 Jun 2026 2 jurors · undecided, undecided undecided
20 Jun 2026 2 jurors · cannot, undecided undecided
15 Jun 2026 4 jurors · undecided, undecided, can, undecided undecided
10 Jun 2026 2 jurors · cannot, undecided undecided
04 Jun 2026 2 jurors · undecided, undecided undecided
30 May 2026 2 jurors · cannot, undecided undecided
24 May 2026 5 jurors · undecided, undecided, undecided, undecided, undecided undecided
19 May 2026 4 jurors · undecided, cannot, undecided, undecided undecided
15 May 2026 4 jurors · undecided, cannot, undecided, undecided undecided status changed
12 May 2026 3 jurors · cannot, cannot, cannot cannot status changed

Each row is a separate jury check. Jurors are AI models (identities kept neutral on purpose). Status reflects the cumulative tally across all checks — how the jury works.

More in Creative

Got one we missed?

Add a statement to the atlas. We review weekly.