Open R1: Update #3
Over the last few weeks, we have focused our efforts on reproducing the competitive programming (code reasoning) aspects of the DeepSeek-R1 recipe. In this post, we are excited to share: The construction of CodeForces-CoTs: a dataset of nearly 100k high-quality samples distilled from R1 to produce solutions in C++ and Python. The IOI benchmark: a new benchmark of challenging problems from the 2024 International Olympiad in Informatics (IOI). OlympicCoder: two fine-tuned 7B and 32B code models that outperform closed-source frontier […]
Read more