# Quickstart: FlowGRPO training on Qwen-Image OCR dataset with Ascend NPU Last updated: 05/09/2026 Post-train a diffusion image generation model with FlowGRPO on Atlas 800T A2. ## Introduction This guide launches FlowGRPO LoRA training for `Qwen-Image` OCR generation on Ascend NPU. ## Prerequisite Prepare an Atlas 800T A2 server with 8 NPUs, and install the necessary software stack. 1. Install CANN by following the [Ascend CANN installation guide](https://www.hiascend.com/document/detail/zh/CANNCommunityEdition/900/softwareinst/instg/instg_0003.html?OS=openEuler&InstallType=local). 2. Install VeRL-Omni and its dependencies as described in the [installation guide](install.md#install). 3. Install the FlowGRPO-specific reward dependency: ```bash uv pip install Levenshtein ``` ## Launch Training Refer to [flowgrpo_quickstart](flowgrpo_quickstart.md) for details on the OCR dataset format, preprocessing commands, and general FlowGRPO task descriptions. The launch script for Ascend NPU is located at `examples/flowgrpo_trainer/qwen_image/run_qwen_image_ocr_lora_npu.sh`. Run the FlowGRPO training script for Ascend NPU from the repository root: ```bash bash examples/flowgrpo_trainer/qwen_image/run_qwen_image_ocr_lora_npu.sh ``` The script executes: ```bash python3 -m verl_omni.trainer.main_diffusion ``` Checkpoints are saved to: ```bash checkpoints/${trainer.project_name}/${trainer.experiment_name} ``` TensorBoard logs are saved to: ```bash tensorboard_log/${trainer.project_name}/${trainer.experiment_name} ``` To enable logging with Weights & Biases (WandB), modify `examples/flowgrpo_trainer/qwen_image/run_qwen_image_ocr_lora_npu.sh` and set: ```bash trainer.logger='["console", "wandb"]' ```