paperMarch 2026

Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments

AuthorsKonstantin Dobler†**‡, Simon Lehnerer‡, Federico Scozzafava, Jonathan Janke, Mohamed Ali

We present the Multilingual Reasoning Gym, an extension of Reasoning Gym (Stojanovski et al., 2025), that procedurally generates verifiable reasoning problems across 14 languages. We translate templates for 94 tasks with native-speaker validation in 10 languages and targeted code or template adaptations to ensure linguistic naturalness. The Multilingual Reasoning Gym preserves the core benefits of the procedural generation approach used in the original Reasoning Gym, such as virtually unlimited problem instance generation and adjustable difficulty, and remains directly usable for Reinforcement Learning from Verifiable Rewards and evaluation settings. Problems in the Multilingual Reasoning Gym are parallel across languages, enabling crosslingually parallel data generation at massive scale due to the procedural nature of the environments. We release our implementation to support research into multilingual reasoning models.

† Hasso Plattner Institute & ELLIS Unit Potsdam
** Work done while at Apple
‡ Equal contribution

Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments

Related readings and updates.

mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR

Training Software Engineering Agents and Verifiers with SWE-Gym

Discover opportunities in Machine Learning.