Skip to content
implementation starting point
Benchmarks: missing
Time to repro: a few days
3 risk flags
none

Results & Benchmarks

Freshness tier: cold
Direct + Inferred Evidence

No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.

Assemblage: Automatic Binary Dataset Construction for Machine Learning is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

assemblage-dataset/assemblage is the strongest maintained implementation based on ranking signals.

Open assemblage-dataset/assemblage

Reproduction Risks

  • License metadata missing
  • No CI workflows detected
  • Dependency manifest is missing

Hardware Notes

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 65/100, grounding 85/100, status medium.

Implementation Comparison

Top 1 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

Maintenance: Recently updated
Confidence: High
Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars
45
Last push
May 5, 2026 (46d ago)

Risk flags

  • No CI pipeline detected
  • No tagged releases
  • No Docker setup

Best implementation now

assemblage-dataset/assemblage
Confidence: High
Reproducibility: Limited

Assemblage-Dataset/Assemblage

Stars: 45
Forks: 6
Last push: May 5, 2026
Official implementation from Papers with Code
Repository link is mentioned in the paper metadata
Partial overlap with paper title keywords
Community adoption signal (45 stars)
License –
CI –
Deps –
Docker –
  • Selected assemblage-dataset/assemblage as the strongest maintained implementation for new work.
  • Repository activity is within the last 24 months.

Reproduction readiness

Major Work
Time to first repro: days
Last checked: Jun 18, 2026

Hardware requirements

  • Expect multi-day setup/compute for meaningful reproduction based on current guidance.

No dependency manifest — manual reconstruction required

  • · assemblage-dataset/assemblage has no requirements.txt, environment.yml, pyproject.toml, or Dockerfile.
  • · You will need to reverse-engineer dependencies from import statements in the source code.
Open assemblage-dataset/assemblage

No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

Datasets

Spaces

No trustworthy demo spaces right now.

Search spaces on Hugging Face

Research context

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.