Skip to content
implementation starting point
Benchmarks: thin evidence
Time to repro: a few hours
1 risk flag
none

Results & Benchmarks

Freshness tier: cold
Direct + Inferred Evidence
Autonomous driving
COCO
AP
50
Source: paper fulltext
Autonomous driving
PASCAL VOC
AP
50
Source: paper fulltext

Benchmark evidence drill-down

2 findings

Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.

Task Dataset Metric Value Source Evidence refs
Autonomous driving COCO AP 50 paper-derived No explicit refs
Autonomous driving PASCAL VOC AP 50 paper-derived No explicit refs

The ability to detect objects regardless of image distortions or weather conditions is crucial for real-world applications of deep learning like autonomous driving.

Use This Implementation Because…

Confidence: high

bethgelab/imagecorruptions is the strongest maintained implementation based on ranking signals. License is declared (Apache-2.0). Dependency/environment manifests are present.

Open bethgelab/imagecorruptions

Reproduction Risks

  • No CI workflows detected
Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 90/100, grounding 95/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

Maintenance: Stale risk
Confidence: High
Reproducibility: Moderate

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars
470
Last push
May 6, 2025 (361d ago)
ReleasesDependencies

Risk flags

  • No CI pipeline detected
  • No Docker setup
bethgelab/stylize-datasets
historical official
Maintenance: Stale
Confidence: High
Reproducibility: Moderate

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars
166
Last push
Jul 5, 2021 (1762d ago)
Dependencies

Risk flags

  • No push in 12+ months
  • No CI pipeline detected
  • No tagged releases
Maintenance: Stale
Confidence: High
Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars
194
Last push
Jul 25, 2019 (2473d ago)
Releases

Risk flags

  • No push in 12+ months
  • No CI pipeline detected
  • No Docker setup

Best implementation now

bethgelab/imagecorruptions
Confidence: High
Reproducibility: Moderate

Python package to corrupt arbitrary images.

Stars: 470
Forks: 75
Last push: May 6, 2025
License: Apache-2.0
Official implementation from Papers with Code
Repository link is mentioned in the paper metadata
Matched via arXiv identifier search
Community adoption signal (470 stars)
License ✓
CI –
Deps ✓
Docker –
  • Selected bethgelab/imagecorruptions as the strongest maintained implementation for new work.
  • Includes dependency/environment manifest signals.
  • Repository activity is within the last 24 months.
  • Official repository is preserved separately as historical context.

Historical official implementation

Preserved for provenance. Not recommended as the default path for new builds.

bethgelab/stylize-datasets
Stars: 166
Last push: Jul 5, 2021

Reproduction readiness

Setup Required
Time to first repro: hours
Last checked: May 1, 2026

Dependencies pinned, manual setup needed

  • · bethgelab/imagecorruptions has requirements.txt but requires manual environment setup.
  • · Last push was 361 days ago — expect possible dependency version conflicts.
  • · No Dockerfile — you will set up the environment manually.
  • · No CI pipeline — test coverage is unknown.
Open bethgelab/imagecorruptions

Quick start

git clone https://github.com/bethgelab/imagecorruptions.git
pip install -r requirements.txt

Additional implementations

Official

  • Code, data and benchmark from the paper "Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming" (NeurIPS 2019 ML4AD)

    Stars: 194
    Forks: 24
    Last push: Jul 25, 2019
    License: MIT
  • bethgelab/mmdetection
    Confidence: High

    Fork of the MMDetection Toolbox containing the Robustness Benchmark from the paper "Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming" (merged)

    Stars: 6
    Forks: 5
    Last push: Aug 5, 2019
    License: Apache-2.0

Community

No additional community repositories detected yet.

These repositories had low-confidence matching signals and are hidden by default.

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

Datasets

Research context

Tasks

Autonomous driving

Methods

None detected

Domains

Computer vision, Autonomous Driving

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.