Resource Info Paper https://arxiv.org/abs/2502.12134 Code & Data https://github.com/xuyige/SoftCoT Public ACL Date 2025.07.04
Resource Info Paper https://arxiv.org/abs/2310.12931 Code & Data https://github.com/eureka-research/Eureka Public ICLR Date 2025.06.30
Resource Info Paper https://openreview.net/pdf?id=qZMLrURRr9 Code & Data / Public ICML Date 2025.06.26
Resource Info Paper http://arxiv.org/abs/2505.13417 Code & Data https://github.com/THU-KEG/AdaptThink Public arXiv Date 2025.06.20
Cite from: DAPO: An Open-Source LLM Reinforcement Learning System at Scale