DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Q: What is the best open-source implementation of "DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search"?

The best maintained implementation is deepseek-ai/deepseek-prover-v1.5 with 574 stars on GitHub. Confidence: high. Reproducibility: Moderate.

Q: How reproducible is "DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search"?

Estimated time to first reproduction: a few hours. Risk flags: No CI workflows detected. Start with deepseek-ai/deepseek-prover-v1.5 and validate setup instructions in README.

Q: Are there pretrained models available for "DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search"?

Yes, 3 Hugging Face models found. The top result is deepseek-ai/DeepSeek-Prover-V1.5-RL with 1,479 downloads.

Q: What framework is used to implement "DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search"?

The primary implementation uses pytorch.

Published: Aug 1, 2024

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 1

Top repo stars: 574

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: pytorch

Time to first repro: a few hours

1 risk flag

arXiv PDF

Technical details

Canonical key: arxiv-2408.08152

Cache status: Fresh

Generated at: May 30, 2026, 4:37 PM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: thin evidence

Time to repro: a few hours

1 risk flag

pytorch

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

Some benchmark signal exists in the extracted evidence, but it is not structured strongly enough yet for a confident benchmark decision.

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search presents a reinforcement learning approach for instruction tuning.

Use This Implementation Because…

Confidence: high

deepseek-ai/deepseek-prover-v1.5 is the strongest maintained implementation based on ranking signals. License is declared (MIT). Dependency/environment manifests are present.

Open deepseek-ai/deepseek-prover-v1.5

Reproduction Risks

No CI workflows detected

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 90/100, grounding 95/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

deepseek-ai/deepseek-prover-v1.5

best maintained

Maintenance: Stale

Confidence: High

Reproducibility: Moderate

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 574
Last push: Aug 16, 2024 (654d ago)

Dependencies

Risk flags

No push in 12+ months
No CI pipeline detected
No tagged releases

augustepoiroux/LeanInteract

alternative

Maintenance: Active

Confidence: Low

Reproducibility: Strong

Community adoption signal (124 stars)

Stars: 124
Last push: May 14, 2026 (17d ago)

CIReleasesDependencies

Risk flags

No Docker setup
Low confidence match

deepseek-ai/DeepSeek-Prover-V1.5

alternative

Maintenance: Stale

Confidence: Low

Reproducibility: Moderate

Matched via arXiv identifier search · Community adoption signal (574 stars)

Stars: 574
Last push: Aug 16, 2024 (654d ago)

Dependencies

Risk flags

No push in 12+ months
No CI pipeline detected
No tagged releases

Best implementation now

deepseek-ai/deepseek-prover-v1.5

Confidence: High

Reproducibility: Moderate

deepseek-ai/DeepSeek-Prover-V1.5

Stars: 574

Forks: 241

Last push: Aug 16, 2024

License: MIT

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Community adoption signal (574 stars)

License ✓

CI –

Deps ✓

Docker –

Selected deepseek-ai/deepseek-prover-v1.5 as the strongest maintained implementation for new work.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Reproduction readiness

Setup Required

Time to first repro: hours

Last checked: May 30, 2026

Dependencies pinned, manual setup needed

· deepseek-ai/deepseek-prover-v1.5 has requirements.txt but requires manual environment setup.
· Last push was 654 days ago — expect possible dependency version conflicts.
· No Dockerfile — you will set up the environment manually.
· No CI pipeline — test coverage is unknown.

Open deepseek-ai/deepseek-prover-v1.5

Quick start

git clone https://github.com/deepseek-ai/deepseek-prover-v1.5.git
pip install -r requirements.txt

Additional implementations

No additional verified repositories beyond the primary recommendation.

Possible but unverified matches (5)

These repositories had low-confidence matching signals and are hidden by default.

augustepoiroux/LeanInteract

Confidence: Low

Stars: 124
deepseek-ai/DeepSeek-Prover-V1.5

Confidence: Low

Stars: 574
airen3339/DeepSeek-Prover-V1.5

Confidence: Low

Stars: 0
anaghasatav27/DeepSeek-Prover-V1.5

Confidence: Low

Stars: 0
joshuaongg21/lean-compilation

Confidence: Low

Stars: 0

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

deepseek-ai/DeepSeek-Prover-V1.5-RL

Curated Related

Downloads: 1,479

Likes: 65
deepseek-ai/DeepSeek-Prover-V1.5-SFT

Curated Related

Downloads: 3,566

Likes: 14
deepseek-ai/DeepSeek-Prover-V1.5-Base

Curated Related

Downloads: 193

Likes: 19

Broaden model search

Reinforcement learning Instruction tuning Instruction tuning Reinforcement learning

Datasets

No trustworthy dataset matches right now.

Search datasets on Hugging Face

Spaces

No trustworthy demo spaces right now.

Search spaces on Hugging Face

Explore on Hugging Face

Search models Search datasets Search spaces

Research context

Tasks

Instruction tuning

Methods

Reinforcement learning

Domains

None detected

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Instruction tuning Reinforcement learning

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote