R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO

Q: What is the best open-source implementation of "R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO"?

The best maintained implementation is hjyao00/r1-sharevl with 36 stars on GitHub. Confidence: high. Reproducibility: Moderate.

Q: How reproducible is "R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO"?

Estimated time to first reproduction: a few hours. Risk flags: No CI workflows detected. Start with hjyao00/r1-sharevl and validate setup instructions in README.

Q: Are there pretrained models available for "R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO"?

Yes, 1 Hugging Face model found. The top result is microsoft/Phi-4-reasoning-vision-15B with 22,840 downloads.

Q: What framework is used to implement "R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO"?

The primary implementation uses pytorch.

Published: May 1, 2025

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 2

Top repo stars: 36

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: pytorch

Time to first repro: a few hours

1 risk flag

arXiv PDF

Technical details

Canonical key: arxiv-2505.16673

Cache status: Fresh

Generated at: May 1, 2026, 1:36 AM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: thin evidence

Time to repro: a few hours

1 risk flag

pytorch

Results & Benchmarks

Freshness tier: hot

Direct + Inferred Evidence

Natural language processing

GPT-4o

AI2D

84.9

Source: paper fulltext

Benchmark evidence drill-down

1 findings

Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.

Task	Dataset	Metric	Value	Source	Evidence refs
Natural language processing	GPT-4o	AI2D	84.9	paper-derived	No explicit refs

R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

hjyao00/r1-sharevl is the strongest maintained implementation based on ranking signals. License is declared (Apache-2.0). Dependency/environment manifests are present.

Open hjyao00/r1-sharevl

Reproduction Risks

No CI workflows detected

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 90/100, grounding 95/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

hjyao00/r1-sharevl

best maintained

Maintenance: Stale risk

Confidence: High

Reproducibility: Moderate

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 36
Last push: Sep 19, 2025 (225d ago)

DockerfileDependencies

Risk flags

No CI pipeline detected
No tagged releases

HJYao00/R1-ShareVL

alternative

Maintenance: Stale risk

Confidence: Medium

Reproducibility: Moderate

Matched via arXiv identifier search · Strong overlap with paper title keywords

Stars: 36
Last push: Sep 19, 2025 (225d ago)

DockerfileDependencies

Risk flags

No CI pipeline detected
No tagged releases

Jasper0068/arxiv-papers-daily

alternative

Maintenance: Active

Confidence: Low

Reproducibility: Strong

Matched via arXiv identifier search

Stars: 14
Last push: Apr 30, 2026 (2d ago)

CIDependencies

Risk flags

No tagged releases
No Docker setup
Low confidence match

Best implementation now

hjyao00/r1-sharevl

Confidence: High

Reproducibility: Moderate

[NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward

Stars: 36

Forks: 1

Last push: Sep 19, 2025

License: Apache-2.0

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Strong overlap with paper title keywords

Community adoption signal (36 stars)

License ✓

CI –

Deps ✓

Docker ✓

Selected hjyao00/r1-sharevl as the strongest maintained implementation for new work.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Reproduction readiness

Setup Required

Time to first repro: hours

Last checked: May 1, 2026

Dependencies pinned, manual setup needed

· hjyao00/r1-sharevl has pyproject.toml but requires manual environment setup.
· Last push was 225 days ago — expect possible dependency version conflicts.
· No CI pipeline — test coverage is unknown.

Open hjyao00/r1-sharevl

Quick start

git clone https://github.com/hjyao00/r1-sharevl.git
pip install -e .

Additional implementations

Official

No additional official repositories detected.

Community

HJYao00/R1-ShareVL
Confidence: Medium

[NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward

Stars: 36

Last push: Sep 19, 2025

License: Apache-2.0