PerfDojo: An AI coach for faster ML on any chip

Kari Jaaskelainen

06 Nov 2025 — 1 min read

In a nutshell

Making ML run fast on every chip is hard. Different CPUs, GPUs, and accelerators need different tricks, especially with features like sparsity and quantization. Manual tuning is slow; many automatic tools are opaque.

What’s new

PerfLLM + PerfDojo turn optimization into a reinforcement learning game guided by large language models.
They use a human-readable, math-inspired code form; transformations guarantee semantic validity while exploring faster variants.
Works without prior hardware-specific knowledge, enabling both human insight and effective agent training.

Why it matters

Portable speed-ups: the paper reports gains across CPUs (x86, Arm, RISC-V) and GPUs.
Fewer black-box heuristics; more interpretable optimization.

Automated performance without arcane hardware hacks.

By Andrei Ivanov, Siyuan Shen, Gioele Gottardo, Marcin Chrapek, Afif Boudaoud, Timo Schneider, Luca Benini, Torsten Hoefler. Read more: http://arxiv.org/abs/2511.03586v1

Paper: http://arxiv.org/abs/2511.03586v1

Register: https://www.AiFeta.com

#MachineLearning #AI #Performance #LLM #ReinforcementLearning #CPUs #GPUs #RISCV #Arm #Systems #HPC #Research

Safe Answers Can Still Teach Risky Skills, Study Finds

Even when advanced AI systems refuse to give dangerous instructions, their seemingly harmless answers can be reused to teach smaller models risky skills. A new study shows that safety filters at the output level are not enough on their own. This matters because it affects how quickly powerful know‑how

Graph neural networks can act as fast problem‑solving shortcuts

Cornell University researchers report that a type of AI called a graph neural network can learn to solve classic routing puzzles on its own and produce answers in one shot. This matters because many real tasks — from delivery planning to chip design — boil down to such puzzles, where speed and

Making AI steadier at reading emotions in mental‑health texts

Researchers have built a method to make artificial intelligence more reliable when it reads emotions in text, such as clinical notes, counselling chats and posts in online support groups. This matters because early triage and risk assessment often depend on what people write and how that writing is interpreted. Why

An AI that designs its own safety tests for other AI systems

A research team has built an AI system that designs and improves safety tests for other AI models on its own. In trials, it found ways to make models break their own rules more often than methods designed by people. This matters because safety testing needs to keep pace with