RSM

Reward Score Matching - Unifying Reward-based Fine-tuning for Flow and Diffusion Models