Join the Newsletter!

Exclusive content & updates. No spam.

Skip to content

Watching OpenAI's o3 Model Sweat Over a Paul Morphy Mate-in-2

Published: at 

When I gave OpenAI’s o3 model a tough chess puzzle, it behaved almost like a human: thinking, doubting, retrying, and finally googling the answer. 🤣

Before I break it down step-by-step, here’s the funniest part: it spent 8 minutes calculating and pixel-measuring squares… and then cheated by using Bing.

Comic LLM
How o3 Solves hard Chess Problems

First the Problem if you want you can try to solve it yourself

Chess Puzzle
Chess Puzzle Checkmate in 2 White

This is a famous chess puzzle from Paul Morphy check out this link for more.

See the full conversation here.

1. Careful Board Analysis

Careful Board Analysis
Careful Board Analysis

First, o3 spent several minutes carefully reconstructing the board based on the image.

“The board has White’s King on c8, White’s Rook on a1, and a Pawn on b6. Black’s King on a8, Bishop on b8, and Pawns on a7 and b7.”

It was being very cautious. It wanted to fully understand the position before trying any move.
Already a very agent like behavior: care first, then move.

2. First Ideas and Doubts

It tested obvious ideas like b7# and Rxa7#, but immediately started doubting itself.

“After 1. Rxa7+, Black’s King cannot capture because of the White pawn on b6."
"But after Bxa7, it is no longer check. So it is not mate."
"1. b7+ is illegal because the pawn on b7 blocks it.”

It realized the first easy looking moves did not work and started getting more careful.

3. Attempt to Use Python

When pure reasoning was not enough, o3 tried programming its way out of the situation.

“I should probably check using something like a chess engine to confirm.”
(tries to import chess module, but fails: “ModuleNotFoundError”).

Attempt to Use Python
Attempt to Use Python

It wanted to run a simulation, but of course, it had no real chess engine installed.

4. Pixel-by-Pixel Image Analysis

Still stuck, it tried manually measuring the chessboard squares.

“The board is 726 pixels high, each square is about 88 pixels…"
"Coordinates for b7 would be approximately x=88–176, y=88–176.”

It was literally trying to deduce the pieces based on pixel math.

Massive stubbornness.

It also thought maybe that it could not be solved

“though I wonder if this is a typical “mate in 2” puzzle. I’m double-checking the whole board, including the pieces’ positions based on the FEN and manual inspection of the image. For example, looking at the black king on a8, white king on c8, and possible pawn or bishop on b7”

5. Still Thinking

o3 started showing mild signs of panic. It was still calculating but was not sure about the best move.

“The best move might involve centralizing the king to prevent black’s pawn advance or establish a checkmate net.”

When it then was using bing

Web Search
Web Search

Finally, right before hitting 8 minutes total:

“I found a chess forum that mentions this exact position…"
"Suggested move: Ra6.”

It binged the answer. 😂
Still, it did not just copy. It rechecked and understood why Ra6 works.

Timeline Summary

InitialAnalysis (~0-2min)Board analysisCarefullyreconstructedthe board fromthe image.Exploration & Doubt (~2-4 min)Idea testingTested obviousmoves like b7#and Rxa7#.Self-correctionRealizedinitial movesdidn't work.Failed Attempts (~4-6 min)Python attemptTried to use achess enginevia Python(failed).Pixel analysisTried to deducepieces viapixel math.Feeling stuckExpressed doubtaboutsolvability.Resolution (~6-8 min)Web SearchUsed Bing tofind thesolutiononline.VerificationConfirmed andunderstood thesuggested move(Ra6).o3 Model's Chess Puzzle Journey

Why This is Fascinating

o3 doesn’t just output an answer. It reasons. It struggles. It switches tools. It self-corrects. It even cheats (but only after serious effort). This feels very human. It also shows where current models shine—problem-solving—and where they still need external help.

Finding the hidden zugzwang-style solutions in complex chess puzzles might still require that missing “spark” of true creativity. You can read more about that in my post: “Are LLMs Creative?”.

Stay Updated!

Subscribe to my newsletter for more TypeScript, Vue, and web dev insights directly in your inbox.

  • Background information about the articles
  • Weekly Summary of all the interesting blog posts that I read
  • Small tips and trick
Subscribe Now

Related Posts