Maintained implementation availablenonePretrained Models Available

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

January 1, 2025arXiv: 2501.12948

1 repo91,959 stars~a few days to reproduce

Abstract

Task	Dataset	Metric	Value
Reinforcement learning	MATH	pass@1	500

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

deepseek-ai/DeepSeek-R1

92.0k 11.7k Jun 2025 MIT

License ✓

CI ✓

Deps –

Docker –

Selected deepseek-ai/deepseek-r1 as the strongest maintained implementation for new work.
Includes CI workflow signals.
Repository activity is within the last 24 months.

1
Start with deepseek-ai/deepseek-r1 and validate setup instructions in README.
2
Reproduce the baseline result with the provided defaults before modifying hyperparameters.
3
Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few daysDependency manifest is missing

No additional verified repositories beyond the primary recommendation.

No direct paper-linked artifacts were found. Showing strongest curated related artifacts.

Curated Related