What is the best open-source implementation of "Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training"?

The best maintained implementation is salesforce/lavis with 11,228 stars on GitHub. Confidence: high. Reproducibility: Strong.

Are there pretrained models available for "Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training"?

Yes, 3 Hugging Face models found. The top result is ostris/zimage_turbo_training_adapter with 82,180 downloads.

Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training

Q: How reproducible is "Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training"?

Estimated time to first reproduction: a few hours. No risk flags identified. Start with salesforce/lavis and validate setup instructions in README.

Q: What framework is used to implement "Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training"?

The primary implementation uses pytorch.

Published: Oct 1, 2022

Best maintained implementation now

Evidence: Direct

Domain fit: AI-adjacent

Verified repos: 1

Top repo stars: 11,228

Framework: pytorch

Time to first repro: a few hours

No risk flags

arXiv PDF

Technical details

Canonical key: arxiv-2210.08773

Cache status: Fresh

Generated at: Jun 1, 2026, 9:12 AM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: thin evidence

Time to repro: a few hours

pytorch

Results & Benchmarks

Direct + Inferred Evidence

Question answering

GPT-J

VQAv2

28.7

Source: paper fulltext

Benchmark evidence drill-down

1 findings

Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.

Task	Dataset	Metric	Value	Source	Evidence refs
Question answering	GPT-J	VQAv2	28.7	paper-derived	No explicit refs

Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

salesforce/lavis is the strongest maintained implementation based on ranking signals. CI workflows are present. License is declared (BSD-3-Clause).

Open salesforce/lavis

Reproduction Risks

No repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 90/100, grounding 95/100, status high.

Implementation Comparison

Top 1 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

salesforce/lavis

best maintained

Maintenance: Stale

Confidence: High

Reproducibility: Strong

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 11,228
Last push: Nov 18, 2024 (560d ago)

CIReleasesDependencies

Risk flags

No push in 12+ months
No Docker setup

Best implementation now

salesforce/lavis

Confidence: High

Reproducibility: Strong

LAVIS - A One-stop Library for Language-Vision Intelligence

Stars: 11,228

Forks: 1,102

Last push: Nov 18, 2024

License: BSD-3-Clause

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Community adoption signal (11228 stars)

License ✓

CI ✓

Deps ✓

Docker –

Selected salesforce/lavis as the strongest maintained implementation for new work.
Includes CI workflow signals.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Reproduction readiness

Setup Required

Time to first repro: hours

Last checked: Jun 1, 2026

Dependencies pinned, manual setup needed

· salesforce/lavis has pyproject.toml but requires manual environment setup.
· Last push was 560 days ago — expect possible dependency version conflicts.
· No Dockerfile — you will set up the environment manually.

Open salesforce/lavis

Quick start

git clone https://github.com/salesforce/lavis.git
pip install -e .

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

ostris/zimage_turbo_training_adapter

Curated Related

Downloads: 82,180

Likes: 137
ostris/FLUX.1-schnell-training-adapter

Curated Related

Downloads: 1,338

Likes: 92
allenai/OLMo-2-0425-1B-early-training

Curated Related

Downloads: 805

Likes: 6

Broaden model search

Question answering plug play zero shot

Datasets

No trustworthy dataset matches right now.

Search datasets on Hugging Face

Spaces

No trustworthy demo spaces right now.

Search spaces on Hugging Face

Explore on Hugging Face

Search models Search datasets Search spaces

Research context

Tasks

Question answering

Methods

None detected

Domains

None detected

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Question answering

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote