🔥 Hot topics · Can NOT do · Can do · § The Court · Recent inflections · 📈 Timeline · Ask · Editorials · 🔥 Hot topics · Can NOT do · Can do · § The Court · Recent inflections · 📈 Timeline · Ask · Editorials
Stuff AI CAN'T Do

Can AI write working code in 50+ programming languages from natural-language prompts ?

What do you think?

This prompt asks for practical, executable code generated from plain-English descriptions across dozens of programming languages. It’s a test of how far today’s AI models can push multi-lingual code synthesis.

Background

Generative coding tools have advanced dramatically since GitHub Copilot, driven by large language models trained on broad code repositories. Early systems focused on popular languages (Python, Java, C++, JavaScript), but later models expanded coverage to dozens of languages by ingesting larger, more diverse datasets. By mid-2025, state-of-the-art systems could emit syntactically correct snippets in over a hundred languages, yet consistently producing fully working implementations from natural-language prompts—especially in niche or esoteric languages—remains an open research challenge. Benchmarks like HumanEval-X and MBPP-X now include multi-language tests with 164 languages, revealing gaps in correctness and edge-case handling. As of May 2026, continuous fine-tuning and retrieval-augmented generation (RAG) are being used to improve accuracy. GitHub Copilot’s widespread adoption underscores the shift toward AI-assisted software engineering, but the leap to reliable generation across 50+ languages still demands careful model selection, prompt engineering, and post-generation validation.

Status last checked on June 28, 2026.

📰

Gallery

In the Court of AI Capability
Summary of Findings
Verdict over time
May 2026May 2026May 2026May 2026May 2026Jun 2026Jun 2026Jun 2026Jun 2026Jun 2026Jun 2026
Sitting at the Bench Filed · Jun 28, 2026
— The Question Before the Court —

Can AI write working code in 50+ programming languages from natural-language prompts?

★ The Court Finds ★
▼ Downgraded from Yes
Almost

Narrow demos exist — but the panel was not unanimous.

Ruling of the Bench

After lively deliberation, the jury found the status of today’s AI to be tantalizingly close to “Yes,” yet still shy of full marks: the models can whisper snippets in dozens of dialects, but cannot yet deliver a sonnet in every tongue without the occasional grammatical stumble. The lone “Yes” juror pointed to everyday tools that pop out cross-language code like popcorn, while the “Almost” voters insisted those outputs still read like a tourist’s phrasebook—helpful, but not quite fluent. Ruling: “It’s fluent enough to book a room, but not yet to host the party.”

— Hon. J. von Neumann III, Presiding
Jury Tally
1Yes
2Almost
0No
Verdict Confidence
87%
The Court of AI Capability is, of course, not a real court.
But the data is real.
The Case File · Stacked History
Session I · May 2026 In_research
Session II · May 2026 Yes
Session III · May 2026 Almost · 83%
Session IV · May 2026 Yes · 83%
Session V · May 2026 Almost · 79%
Session VI · Jun 2026 Almost · 78%
Session VII · Jun 2026 Almost · 77%
Session VIII · Jun 2026 Almost · 77%
Session IX · Jun 2026 Yes · 93%
Session X · Jun 2026 Yes · 90%
Case № D64A · Session XI
In the Court of AI Capability

The Case File

Docket № D64A · Session XI · Vol. XI
I. Particulars of the Case
Question put to the courtCan AI write working code in 50+ programming languages from natural-language prompts?
SessionXI (11 hearing)
Convened28 Jun 2026
Previously ruledIN_RESEARCH (May '26) → YES (May '26) → ALMOST (May '26) → YES (May '26) → ALMOST (May '26) → ALMOST (Jun '26) → ALMOST (Jun '26) → ALMOST (Jun '26) → YES (Jun '26) → YES (Jun '26) → ALMOST (Jun '26)
Presiding JudgeHon. J. von Neumann III
II. Cumulative Tally Across Sessions

Across 11 sessions, 30 jurors have heard this case. Combined tally: 17 YES · 12 ALMOST · 1 NO · 0 IN RESEARCH.

Note: cumulative includes older juror opinions. The current session tally above is the live verdict.

III. Verdict

By a vote of 1 — 2 — 0, the panel returns a verdict of ALMOST, with verdict confidence of 87%. The court so orders. Verdict downgraded from prior session.

IV. Statements from the Bench
Juror I ALMOST

"Multilingual code generation exists"

Juror II YES

"GitHub Copilot, Cursor, and Codeium generate multilingual code snippets routinely."

Juror III ALMOST

"Code generation models exist"

J. von Neumann III
Presiding Judge
M. Lovelace
Clerk of the Court

What the audience thinks

No 4% · Yes 83% · Maybe 13% 48 votes
Yes · 83%
Maybe · 13%
15 days of activity

Discussion

no comments

Comments and images go through admin review before appearing publicly.

11 jury checks · most recent 9 hours ago
28 Jun 2026 3 jurors · undecided, can, undecided undecided
22 Jun 2026 3 jurors · can, can, undecided undecided
17 Jun 2026 2 jurors · can, can can
12 Jun 2026 2 jurors · can, undecided undecided
06 Jun 2026 2 jurors · can, undecided undecided
01 Jun 2026 3 jurors · can, undecided, undecided undecided
26 May 2026 3 jurors · undecided, can, undecided undecided
21 May 2026 3 jurors · can, can, undecided undecided
15 May 2026 4 jurors · undecided, can, can, undecided undecided status changed
12 May 2026 3 jurors · can, can, can can status changed
11 May 2026 2 jurors · can, cannot undecided status changed

Each row is a separate jury check. Jurors are AI models (identities kept neutral on purpose). Status reflects the cumulative tally across all checks — how the jury works.

More in Creative

Got one we missed?

Add a statement to the atlas. We review weekly.