rollout

Introduction

A high-performance, multi-node reinforcement-learning framework for large language models. Written in Rust. Pluggable in Python.

rollout is a Rust-core reinforcement-learning framework for large language models. It supports PPO, GRPO, DPO/IPO/KTO, SFT, and reward-model training across training, batch inference, and online inference modes, with multi-node distribution from day one. AWS and GCP are first-class infra targets; vLLM is the default inference backend; plugins can be authored in Python or Rust.

LayerWhat it owns
AlgorithmsSFT · RM · PPO · GRPO · DPO / IPO / KTO
SubstrateCoordinator + workers + plugin host (PyO3 / sidecar RPC)
Storage / CloudEmbedded · Postgres · S3 · GCS — AWS/GCP behind a layered trait

See Architecture for the full layered breakdown.