OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Q: What is the best open-source implementation of "OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation"?

The best maintained implementation is shikiw/opera with 411 stars on GitHub. Confidence: high. Reproducibility: Moderate.

Q: How reproducible is "OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation"?

Estimated time to first reproduction: a few hours. Risk flags: No CI workflows detected. Start with shikiw/opera and validate setup instructions in README.

Q: What framework is used to implement "OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation"?

The primary implementation uses jax.

Published: Nov 1, 2023

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 1

Top repo stars: 411

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: jax

Time to first repro: a few hours

1 risk flag

arXiv PDF

Technical details

Canonical key: arxiv-2311.17911

Cache status: Stale (SWR served)

Generated at: Jun 17, 2026, 7:32 AM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: missing

Time to repro: a few hours

1 risk flag

jax

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

shikiw/opera is the strongest maintained implementation based on ranking signals. License is declared (MIT). Dependency/environment manifests are present.

Open shikiw/opera

Reproduction Risks

No CI workflows detected

Evidence disclosure

Evidence graph: 3 refs, 3 links.

Utility signals: depth 55/100, grounding 75/100, status medium.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

shikiw/opera

best maintained

Maintenance: Stale

Confidence: High

Reproducibility: Moderate

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 411
Last push: Aug 24, 2024 (663d ago)

Dependencies

Risk flags

No push in 12+ months
No CI pipeline detected
No tagged releases

huofushuo/SID

alternative

Maintenance: Stale

Confidence: Low

Reproducibility: Limited

Community adoption signal (136 stars)

Stars: 136
Last push: Jan 16, 2025 (518d ago)

Dependencies

Risk flags

No push in 12+ months
No CI pipeline detected
No tagged releases

XiaomingX/CVPR2024-Papers-with-Code

alternative

Maintenance: Stale risk

Confidence: Low

Reproducibility: Limited

Matched via arXiv identifier search

Stars: 9
Last push: Sep 16, 2025 (275d ago)

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

Best implementation now

shikiw/opera

Confidence: High

Reproducibility: Moderate

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Stars: 411

Forks: 33

Last push: Aug 24, 2024

License: MIT

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Strong overlap with paper title keywords

Community adoption signal (411 stars)

License ✓

CI –

Deps ✓

Docker –

Selected shikiw/opera as the strongest maintained implementation for new work.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Reproduction readiness

Setup Required

Time to first repro: hours

Last checked: Jun 17, 2026

Dependencies pinned, manual setup needed

· shikiw/opera has environment.yml but requires manual environment setup.
· Last push was 663 days ago — expect possible dependency version conflicts.
· No Dockerfile — you will set up the environment manually.
· No CI pipeline — test coverage is unknown.

Open shikiw/opera

Quick start

git clone https://github.com/shikiw/opera.git
conda env create -f environment.yml && conda activate <env-name>

No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.

Additional implementations

No additional verified repositories beyond the primary recommendation.

Possible but unverified matches (3)

These repositories had low-confidence matching signals and are hidden by default.

huofushuo/SID

Confidence: Low

Stars: 136
XiaomingX/CVPR2024-Papers-with-Code

Confidence: Low

Stars: 9
LeMei/CausalLLMs-ReadingGroup

Confidence: Low

Stars: 11

Hugging Face artifacts

No trustworthy direct or curated related Hugging Face artifacts were found yet.

Continue with targeted Hugging Face searches derived from the paper title and method context:

Models

arxiv:2311.17911 OPERA Multi-Modal

Datasets

arxiv:2311.17911 OPERA dataset

Spaces

arxiv:2311.17911 OPERA demo

Tip: start with models, then check datasets/spaces if you need evaluation data or demos.

Direct artifact matches are currently sparse. Use targeted Hugging Face searches to quickly locate candidate models, datasets, and demos.

Search models Search datasets Search spaces

Research context

Tasks

None detected

Methods

Transformer

Domains

Natural Language Processing

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Transformer Natural Language Processing

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote