What is the best open-source implementation of "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts"?

The best maintained implementation is neuir/m2rag with 44 stars on GitHub. Confidence: high. Reproducibility: Moderate.

Are there pretrained models available for "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts"?

Yes, 1 Hugging Face model found. The top result is bosonai/higgs-audio-v2-generation-3B-base with 189,624 downloads.

What framework is used to implement "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts"?

The primary implementation uses pytorch.

Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts

Q: How reproducible is "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts"?

Estimated time to first reproduction: a few hours. Risk flags: No CI workflows detected. Start with neuir/m2rag and validate setup instructions in README.

Published: Feb 1, 2025

Best maintained implementation now

Evidence: Direct

Domain fit: AI-adjacent

Verified repos: 2

Top repo stars: 44

Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.

Framework: pytorch

Time to first repro: a few hours

1 risk flag

arXiv PDF

Technical details

Canonical key: arxiv-2502.17297

Cache status: Stale (SWR served)

Generated at: Jun 18, 2026, 2:24 PM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: thin evidence

Time to repro: a few hours

1 risk flag

pytorch

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

Retrieval / indexing

MSCOCO

ROUGE-L

30.68

Source: paper fulltext

Retrieval / indexing

M 2 RAG

ROUGE-L

17.58

Source: paper fulltext

Benchmark evidence drill-down

2 findings

Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.

Task	Dataset	Metric	Value	Source	Evidence refs
Retrieval / indexing	MSCOCO	ROUGE-L	30.68	paper-derived	No explicit refs
Retrieval / indexing	M 2 RAG	ROUGE-L	17.58	paper-derived	No explicit refs

Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts focuses on retrieval / indexing.

Use This Implementation Because…

Confidence: high

neuir/m2rag is the strongest maintained implementation based on ranking signals. License is declared (MIT). Dependency/environment manifests are present.

Open neuir/m2rag

Reproduction Risks

No CI workflows detected

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 90/100, grounding 95/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

neuir/m2rag

best maintained

Maintenance: Stale risk

Confidence: High

Reproducibility: Moderate

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 44
Last push: Sep 27, 2025 (266d ago)

Dependencies

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

open-dataflow/rare

alternative

Maintenance: Recently updated

Confidence: Low

Reproducibility: Moderate

Partial overlap with paper title keywords · Community adoption signal (184 stars)

Stars: 184
Last push: May 20, 2026 (31d ago)

Dependencies

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

Jasper0068/arxiv-papers-daily

alternative

Maintenance: Active

Confidence: Low

Reproducibility: Strong

Matched via arXiv identifier search

Stars: 14
Last push: Jun 15, 2026 (5d ago)

CIDependencies

Risk flags

No tagged releases
No Docker setup
Low confidence match

Best implementation now

neuir/m2rag

Confidence: High

Reproducibility: Moderate

[MM '25] This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".

Stars: 44

Forks: 4

Last push: Sep 27, 2025

License: MIT

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Strong overlap with paper title keywords

Community adoption signal (44 stars)

License ✓

CI –

Deps ✓

Docker –

Selected neuir/m2rag as the strongest maintained implementation for new work.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Reproduction readiness

Setup Required

Time to first repro: hours

Last checked: Jun 18, 2026

Dependencies pinned, manual setup needed

· neuir/m2rag has requirements.txt but requires manual environment setup.
· Last push was 266 days ago — expect possible dependency version conflicts.
· No Dockerfile — you will set up the environment manually.
· No CI pipeline — test coverage is unknown.

Open neuir/m2rag

Quick start

git clone https://github.com/neuir/m2rag.git
pip install -r requirements.txt

Additional implementations

Official

No additional official repositories detected.

Community

NEUIR/M2RAG
Confidence: Medium

[MM '25] This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".

Stars: 44

Last push: Sep 27, 2025

License: MIT