publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
-
Reward Score Matching: Unifying Reward-based Fine-tuning for Flow and Diffusion ModelsIn arxiv, 2026 -
PCPO: Proportionate Credit Policy Optimization for Preference Alignment of Image Generation ModelsIn The Fourteenth International Conference on Learning Representations, 2026