
#208 of 2292 in Artificial Intelligence (All Time)
Breaking $\textit{Winner-Takes-All}$: Cooperative Policy Optimization Improves Diverse LLM Reasoning
Congratulate the authors
Know the authors? Send them a congratulation.

Know the authors? Send them a congratulation.