A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
[ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym
Weihua Du, Hailei Gong, Zhan Ling, Kang Liu, Lingfeng Shen, Xuesong Yao, Yufei Xu, Dingyuan Shi, Yiming Yang, Jiecao Chen
"Generalizable End-to-End Tool-Use RL with Synthetic CodeGym" (2025)
CodeGym is a synthetic environment generation framework for LLM agent reinforcement learning on multi-turn tool-use tasks. It automatically converts static code problems into interactive CodeGym environments where agents can learn to use tools to solve complex tasks in various configurations.
We are open-sourcing the following key parts of the project:
gym/README.md for details.online_server/README.md for details.A community reproduction of the synthetic dataset is available at HuggingFace.
CodeGym transforms traditional code problems into interactive environments where LLM agents can learn to:
We designed an elaborate process for CodeGym environment synthesis and verification:
Gym Synthesis:
Gym Verification:
The example/ folder contains sample CodeGym environments to help you get started:
example/example_envs contains some CodeGym environments examplesexample/training_instance.jsonl contains some instances for RL trainingexample/raw_problems.jsonl contains some raw coding problems for generation pipeline demonstrationBy training in CodeGym, LLMs show stronger generalization on out-of-distribution (OOD) tool-use and multi-turn benchmarks:
We release the pipeline for environment synthesis and verification. Please refer to gym/README.md for details.
We release a highly concurrent server for launching CodeGym environments aimed at large-scale reinforcement learning. Please refer to online_server/README.md for details.
This project and dataset are released under the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license.
If you find this work useful, please cite our paper:
@article{du2025generalizable,
title={Generalizable End-to-End Tool-Use RL with Synthetic CodeGym},
author={Du, Weihua and Gong, Hailei and Ling, Zhan and Liu, Kang and Shen, Lingfeng and Yao, Xuesong and Xu, Yufei and Shi, Dingyuan and Yang, Yiming and Chen, Jiecao},
journal={arXiv preprint arXiv:2509.17325},
year={2025}
}
Pocket Flow: Codebase to Tutorial
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
干净、强大、属于你的 AI Agent 平台 --AI agents, without the clutter.
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.