AI finally masters Stratego—on a budget

Kari Jaaskelainen

11 Nov 2025 — 1 min read

AI finally masters Stratego—on a budget

Stratego has long been a worst-case test for AI: you must plan deeply while most pieces are hidden. Big-budget attempts still fell short of top humans.

This paper reports a step change: an AI that reaches vastly superhuman Stratego performance—trained for only a few thousand dollars.

Self-play reinforcement learning: the system learns by playing itself, discovering tactics and long-term plans without human data.
Test-time search with hidden info: a lookahead that reasons about uncertainty, not just perfect knowledge.
General methods: designed for imperfect-information problems beyond games.

Why it matters: strong decision-making under uncertainty, at low cost, opens doors for research and applications in security, logistics, and robotics.

Authors: Samuel Sokota, Eugene Vinitsky, Hengyuan Hu, J. Zico Kolter, Gabriele Farina. Paper: http://arxiv.org/abs/2511.07312v1

Paper: http://arxiv.org/abs/2511.07312v1

Register: https://www.AiFeta.com

AI ReinforcementLearning Stratego GameAI MachineLearning ImperfectInformation Research SelfPlay Arxiv

AI finally masters Stratego—on a budget

Kari Jaaskelainen

AI finally masters Stratego—on a budget

Read more

Tekoäly myötäilee toteamuksia enemmän kuin kysymyksiä

Tekoälyn pitäisi uskaltaa sanoa “en tiedä” — ja sillä on väliä, miten tämä mitataan

Pienet kielimallit nopeutuvat, kun niille opetetaan valmiita fraaseja

Kone näkee saman kohtauksen eri tavoin – uusi tapa opettaa sen kokoamaan aistinsa yhteen