Results & Benchmarks
| Task | Dataset | Metric | Value |
|---|---|---|---|
| Reinforcement learning | MATH | pass@1 | 500 |
Hardware Requirements
- Expect multi-day setup/compute for meaningful reproduction based on current guidance.
Best Implementation
- Selected deepseek-ai/deepseek-r1 as the strongest maintained implementation for new work.
- Includes CI workflow signals.
- Repository activity is within the last 24 months.
Reproduction Path
- 1
Start with deepseek-ai/deepseek-r1 and validate setup instructions in README.
- 2
Reproduce the baseline result with the provided defaults before modifying hyperparameters.
- 3
Log exact dependency versions and runtime environment for reproducibility.
Time to first repro: a few daysDependency manifest is missing
Additional Implementations
No additional verified repositories beyond the primary recommendation.
Hugging Face Artifacts
No direct paper-linked artifacts were found. Showing strongest curated related artifacts.
Curated Related
- deepseek-ai/DeepSeek-R13.1M 13.2k