VeRL-Omni

Getting Started

  • Installation
  • Quickstart: FlowGRPO training on Qwen-Image OCR dataset
  • Training Metrics

Advanced Features

  • Rollout Correction for Diffusion Training (Experimental)
  • Using an External HTTP Scorer Service

Algorithms

  • Flow-GRPO
  • DiffusionNFT
  • GRPO-Guard
  • Mix-GRPO
  • Performance Reference

Performance Tuning Guide

  • Diffusion FLOPs / MFU
  • Profiling FlowGRPO / diffusion training in VeRL-Omni

Hardware Support

  • Quickstart: FlowGRPO training on Qwen-Image OCR dataset with Ascend NPU

API Reference

  • Trainer Interface
  • Workers Interface
  • Rollout & Agent Loop
  • Reward Interface
  • Pipelines Interface
  • Utilities

Developer Guide

  • Editing Agent Instructions
  • How to Integrate a New Diffusion Model for FlowGRPO Training
  • How to Integrate a New Policy-Gradient Algorithm for Diffusion Model
  • How to Integrate a New Direct-Preference Algorithm for Diffusion Model
  • Common Pitfalls
VeRL-Omni
  • Search


© Copyright 2026 Bytedance Ltd. and/or its affiliates.

Built with Sphinx using a theme provided by Read the Docs.