What is the best open-source implementation of "Going deeper with Image Transformers"?

The best maintained implementation is rwightman/pytorch-image-models with 36,732 stars on GitHub. Confidence: high. Reproducibility: Strong.

How reproducible is "Going deeper with Image Transformers"?

Estimated time to first reproduction: a few hours. No risk flags identified. Start with rwightman/pytorch-image-models and validate setup instructions in README.

What framework is used to implement "Going deeper with Image Transformers"?

The primary implementation uses pytorch.

Going deeper with Image Transformers

Hugo Touvron, Matthieu Cord, Alexandre Sablayrolles, Gabriel Synnaeve, Hervé Jégou

Published: Mar 31, 2021

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 2

Top repo stars: 36,732

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: pytorch

Time to first repro: a few hours

No risk flags

arXiv PDF

Transformers have been recently adapted for large scale image classification, achieving high scores shaking up the long supremacy of convolutional neural networks. However the optimization of image transformers has been little studied so far. In this work, we build and optimize deeper transformer networks for image classification. In particular, we investigate the interplay of architecture and optimization of such de ...

Read full abstract

dicated transformers. We make two transformers architecture changes that significantly improve the accuracy of deep transformers. This leads us to produce models whose performance does not saturate early with more depth, for instance we obtain 86.5% top-1 accuracy on Imagenet when training with no external data, we thus attain the current SOTA with less FLOPs and parameters. Moreover, our best model establishes the new state of the art on Imagenet with Reassessed labels and Imagenet-V2 / match frequency, in the setting with no additional training data. We share our code and models.

Technical details

Canonical key: arxiv-2103.17239

Cache status: Fresh

Generated at: Apr 30, 2026, 9:34 PM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: thin evidence

Time to repro: a few hours

pytorch

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

Some benchmark signal exists in the extracted evidence, but it is not structured strongly enough yet for a confident benchmark decision.

Transformers have been recently adapted for large scale image classification, achieving high scores shaking up the long supremacy of convolutional neural networks.

Use This Implementation Because…

Confidence: high

rwightman/pytorch-image-models is the strongest maintained implementation based on ranking signals. CI workflows are present. License is declared (Apache-2.0).

Open rwightman/pytorch-image-models

Reproduction Risks

No repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.

Evidence disclosure

Evidence graph: 3 refs, 3 links.

Utility signals: depth 90/100, grounding 85/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

rwightman/pytorch-image-models

best maintained

Maintenance: Active

Confidence: High

Reproducibility: Strong

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 36,732
Last push: Apr 29, 2026 (2d ago)

CIReleasesDependencies

Risk flags

No Docker setup

facebookresearch/deit

historical official

Maintenance: Archived

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 4,340
Last push: Mar 15, 2024 (777d ago)

Dependencies

Risk flags

Repository archived
No push in 12+ months
No CI pipeline detected

DarshanDeshpande/jax-models

alternative

Maintenance: Stale

Confidence: Low

Reproducibility: Moderate

Community adoption signal (162 stars) · Repository appears stale (>24 months since last push)

Stars: 162
Last push: Jun 25, 2022 (1406d ago)

ReleasesDependencies

Risk flags

No push in 12+ months
No CI pipeline detected
No Docker setup

Best implementation now

rwightman/pytorch-image-models

Confidence: High

Reproducibility: Strong

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Stars: 36,732

Forks: 5,148

Last push: Apr 29, 2026

License: Apache-2.0

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Partial overlap with paper title keywords

Community adoption signal (36732 stars)

License ✓

CI ✓

Deps ✓

Docker –

Selected rwightman/pytorch-image-models as the strongest maintained implementation for new work.
Includes CI workflow signals.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Historical official implementation

Preserved for provenance. Not recommended as the default path for new builds.

facebookresearch/deit

Stars: 4,340

Last push: Mar 15, 2024

Archived

Reproduction readiness

Ready to Run

Time to first repro: hours

Last checked: Apr 30, 2026

Ready to reproduce

· Clone rwightman/pytorch-image-models and install dependencies from pyproject.toml.
· CI pipeline detected — automated tests are in place.
· Last updated 2 days ago.

Open rwightman/pytorch-image-models

Quick start

git clone https://github.com/rwightman/pytorch-image-models.git
pip install -e .

Additional implementations

No additional verified repositories beyond the primary recommendation.

Possible but unverified matches (1)

These repositories had low-confidence matching signals and are hidden by default.

DarshanDeshpande/jax-models

Confidence: Low

Stars: 162

Hugging Face artifacts

No trustworthy direct or curated related Hugging Face artifacts were found yet.

Continue with targeted Hugging Face searches derived from the paper title and method context:

Models

arxiv:2103.17239 Transformer Image classification

Datasets

arxiv:2103.17239 Image classification dataset Transformer benchmark

Spaces

arxiv:2103.17239 Image classification demo Transformer gradio

Tip: start with models, then check datasets/spaces if you need evaluation data or demos.

Direct artifact matches are currently sparse. Use targeted Hugging Face searches to quickly locate candidate models, datasets, and demos.

Search models Search datasets Search spaces

Research context

Tasks

Image classification

Methods

Transformer

Domains

Computer vision

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Image classification Transformer Computer vision

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote