What is the best open-source implementation of "Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start"?

The best maintained implementation is hiyouga/easyr1 with 5,018 stars on GitHub. Confidence: high. Reproducibility: Strong.

What framework is used to implement "Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start"?

The primary implementation uses pytorch.

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Q: How reproducible is "Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start"?

Estimated time to first reproduction: a few hours. No risk flags identified. Start with hiyouga/easyr1 and validate setup instructions in README.

Published: May 1, 2025

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 3

Top repo stars: 5,018

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: pytorch

Time to first repro: a few hours

No risk flags

arXiv PDF

Technical details

Canonical key: arxiv-2505.22334

Cache status: Stale (SWR served)

Generated at: Jun 17, 2026, 9:38 AM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: missing

Time to repro: a few hours

pytorch

Results & Benchmarks

Freshness tier: hot

Direct + Inferred Evidence

No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start presents a reinforcement learning method.

Use This Implementation Because…

Confidence: high

hiyouga/easyr1 is the strongest maintained implementation based on ranking signals. CI workflows are present. License is declared (Apache-2.0).

Open hiyouga/easyr1

Reproduction Risks

No repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.

Evidence disclosure

Evidence graph: 3 refs, 3 links.

Utility signals: depth 55/100, grounding 75/100, status medium.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

hiyouga/easyr1

best maintained

Maintenance: Recently updated

Confidence: High

Reproducibility: Strong

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 5,018
Last push: Apr 6, 2026 (73d ago)

CIDockerfileReleasesDependencies

Risk flags

No obvious maintenance or reproducibility risks detected.

waltonfuture/rl-with-cold-start

historical official

Maintenance: Stale risk

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 49
Last push: Jun 27, 2025 (356d ago)

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

Jasper0068/arxiv-papers-daily

alternative

Maintenance: Active

Confidence: Low

Reproducibility: Strong

Matched via arXiv identifier search

Stars: 14
Last push: Jun 15, 2026 (3d ago)

CIDependencies

Risk flags

No tagged releases
No Docker setup
Low confidence match

Best implementation now

hiyouga/easyr1

Confidence: High

Reproducibility: Strong

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Stars: 5,018

Forks: 373

Last push: Apr 6, 2026

License: Apache-2.0

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Community adoption signal (5018 stars)

License ✓

CI ✓

Deps ✓

Docker ✓

Selected hiyouga/easyr1 as the strongest maintained implementation for new work.
Includes CI workflow signals.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Historical official implementation

Preserved for provenance. Not recommended as the default path for new builds.

waltonfuture/rl-with-cold-start

Stars: 49

Last push: Jun 27, 2025

Reproduction readiness

Ready to Run

Time to first repro: hours

Last checked: Jun 17, 2026

Ready to reproduce

· Clone hiyouga/easyr1 and install dependencies from pyproject.toml.
· Dockerfile available for containerized reproduction.
· CI pipeline detected — automated tests are in place.
· Last updated 73 days ago.

Open hiyouga/easyr1

Quick start

git clone https://github.com/hiyouga/easyr1.git
pip install -e .

No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.

Additional implementations

Official

No additional official repositories detected.

Community

waltonfuture/RL-with-Cold-Start
Confidence: Medium

SFT+RL boosts multimodal reasoning

Stars: 49

Last push: Jun 27, 2025

Possible but unverified matches (1)

These repositories had low-confidence matching signals and are hidden by default.

Jasper0068/arxiv-papers-daily

Confidence: Low

Stars: 14

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

No trustworthy model matches right now.

Search models on Hugging Face

Datasets

WaltonFuture/Multimodal-RL-Data

Curated Related

Downloads: 164

Likes: 8

Updated: Jul 24, 2025
WaltonFuture/Multimodal-Cold-Start

Curated Related

Downloads: 58

Likes: 11

Updated: Jul 24, 2025