What is the best open-source implementation of "VOLO: Vision Outlooker for Visual Recognition"?

The best maintained implementation is rwightman/pytorch-image-models with 36,893 stars on GitHub. Confidence: high. Reproducibility: Strong.

How reproducible is "VOLO: Vision Outlooker for Visual Recognition"?

Estimated time to first reproduction: a few hours. No risk flags identified. Start with rwightman/pytorch-image-models and validate setup instructions in README.

What framework is used to implement "VOLO: Vision Outlooker for Visual Recognition"?

The primary implementation uses pytorch.

VOLO: Vision Outlooker for Visual Recognition

Published: Jun 1, 2021

Best maintained implementation now

Evidence: Direct

Domain fit: AI-adjacent

Verified repos: 2

Top repo stars: 36,893

Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.

Framework: pytorch

Time to first repro: a few hours

No risk flags

arXiv PDF

Technical details

Canonical key: arxiv-2106.13112

Cache status: Fresh

Generated at: Jun 18, 2026, 3:29 AM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: missing

Time to repro: a few hours

pytorch

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.

VOLO: Vision Outlooker for Visual Recognition is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

rwightman/pytorch-image-models is the strongest maintained implementation based on ranking signals. CI workflows are present. License is declared (Apache-2.0).

Open rwightman/pytorch-image-models

Reproduction Risks

No repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.

Evidence disclosure

Evidence graph: 3 refs, 3 links.

Utility signals: depth 55/100, grounding 75/100, status medium.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

rwightman/pytorch-image-models

best maintained

Maintenance: Active

Confidence: High

Reproducibility: Strong

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 36,893
Last push: Jun 3, 2026 (15d ago)

CIReleasesDependencies

Risk flags

No Docker setup

sail-sg/volo

historical official

Maintenance: Stale

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 948
Last push: Sep 18, 2022 (1369d ago)

Releases

Risk flags

No push in 12+ months
No CI pipeline detected
No Docker setup

xmu-xiaoma666/External-Attention-pytorch

alternative

Maintenance: Recently updated

Confidence: Low

Reproducibility: Limited

Community adoption signal (12177 stars)

Stars: 12,177
Last push: Mar 16, 2026 (94d ago)

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

Best implementation now

rwightman/pytorch-image-models

Confidence: High

Reproducibility: Strong

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Stars: 36,893

Forks: 5,166

Last push: Jun 3, 2026

License: Apache-2.0

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Community adoption signal (36893 stars)

License ✓

CI ✓

Deps ✓

Docker –

Selected rwightman/pytorch-image-models as the strongest maintained implementation for new work.
Includes CI workflow signals.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Historical official implementation

Preserved for provenance. Not recommended as the default path for new builds.

sail-sg/volo

Stars: 948

Last push: Sep 18, 2022

Reproduction readiness

Ready to Run

Time to first repro: hours

Last checked: Jun 18, 2026

Ready to reproduce

· Clone rwightman/pytorch-image-models and install dependencies from pyproject.toml.
· CI pipeline detected — automated tests are in place.
· Last updated 15 days ago.

Open rwightman/pytorch-image-models

Quick start

git clone https://github.com/rwightman/pytorch-image-models.git
pip install -e .

No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.

Additional implementations

No additional verified repositories beyond the primary recommendation.

Possible but unverified matches (5)

These repositories had low-confidence matching signals and are hidden by default.

xmu-xiaoma666/External-Attention-pytorch

Confidence: Low

Stars: 12,177
leondgarse/keras_cv_attention_models

Confidence: Low

Stars: 627
BR-IDL/PaddleViT

Confidence: Low

Stars: 1,237
AIFengheshu/Plug-play-modules

Confidence: Low

Stars: 1,559
Jittor-Image-Models/Jittor-Image-Models

Confidence: Low

Stars: 53

Hugging Face artifacts

No trustworthy direct or curated related Hugging Face artifacts were found yet.

Continue with targeted Hugging Face searches derived from the paper title and method context:

Models

arxiv:2106.13112 VOLO Computer vision

Datasets

arxiv:2106.13112 VOLO dataset

Spaces

arxiv:2106.13112 VOLO demo

Tip: start with models, then check datasets/spaces if you need evaluation data or demos.

Direct artifact matches are currently sparse. Use targeted Hugging Face searches to quickly locate candidate models, datasets, and demos.

Search models Search datasets Search spaces

Research context

Tasks

None detected

Methods

None detected

Domains

Computer vision

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Computer vision

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote