What is the best open-source implementation of "Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control"?

The best maintained implementation is aangelopoulos/ltt with 74 stars on GitHub. Confidence: high. Reproducibility: Moderate.

What framework is used to implement "Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control"?

The primary implementation uses pytorch.

Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

Q: How reproducible is "Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control"?

Estimated time to first reproduction: a few hours. Risk flags: No CI workflows detected. Start with aangelopoulos/ltt and validate setup instructions in README.

Published: Oct 1, 2021

Best maintained implementation now

Evidence: Direct

Domain fit: AI-adjacent

Verified repos: 1

Top repo stars: 74

Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.

Framework: pytorch

Time to first repro: a few hours

1 risk flag

arXiv PDF

Technical details

Canonical key: arxiv-2110.01052

Cache status: Stale (SWR served)

Generated at: Jun 18, 2026, 7:49 PM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: missing

Time to repro: a few hours

1 risk flag

pytorch

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.

Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

aangelopoulos/ltt is the strongest maintained implementation based on ranking signals. License is declared (MIT). Dependency/environment manifests are present.

Open aangelopoulos/ltt

Reproduction Risks

No CI workflows detected

Evidence disclosure

Evidence graph: 3 refs, 3 links.

Utility signals: depth 55/100, grounding 75/100, status medium.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

aangelopoulos/ltt

best maintained

Maintenance: Stale

Confidence: High

Reproducibility: Moderate

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 74
Last push: Nov 17, 2024 (581d ago)

Dependencies

Risk flags

No push in 12+ months
No CI pipeline detected
No tagged releases

tail-unica/conformal_hopwise

alternative

Maintenance: Active

Confidence: Low

Reproducibility: Strong

Matched via arXiv identifier search · Partial overlap with paper title keywords

Stars: 1
Last push: May 29, 2026 (22d ago)

CIDependencies

Risk flags

No tagged releases
No Docker setup
Low confidence match

justwantrich/Text-risk-warning---Switch-to-manual-mode

alternative

Maintenance: Active

Confidence: Low

Reproducibility: Limited

Matched via arXiv identifier search

Stars: 0
Last push: Jun 10, 2026 (10d ago)

Dependencies

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

Best implementation now

aangelopoulos/ltt

Confidence: High

Reproducibility: Moderate

Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

Stars: 74

Forks: 10

Last push: Nov 17, 2024

License: MIT

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Strong overlap with paper title keywords

Community adoption signal (74 stars)

License ✓

CI –

Deps ✓

Docker –

Selected aangelopoulos/ltt as the strongest maintained implementation for new work.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Reproduction readiness

Setup Required

Time to first repro: hours

Last checked: Jun 18, 2026

Dependencies pinned, manual setup needed

· aangelopoulos/ltt has environment.yml but requires manual environment setup.
· Last push was 581 days ago — expect possible dependency version conflicts.
· No Dockerfile — you will set up the environment manually.
· No CI pipeline — test coverage is unknown.

Open aangelopoulos/ltt

Quick start

git clone https://github.com/aangelopoulos/ltt.git
conda env create -f environment.yml && conda activate <env-name>

No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.

Additional implementations

No additional verified repositories beyond the primary recommendation.

Possible but unverified matches (3)

These repositories had low-confidence matching signals and are hidden by default.

tail-unica/conformal_hopwise

Confidence: Low

Stars: 1
justwantrich/Text-risk-warning---Switch-to-manual-mode

Confidence: Low

Stars: 0
yyLabPhysAI/TS-LTT

Confidence: Low

Stars: 0

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

No trustworthy model matches right now.

Search models on Hugging Face

Datasets

EddyGiusepe/Modified_dataset_for_predictive_maintenance

Curated Related

Downloads: 102

Likes: 5

Updated: Jul 24, 2024
akash140500/Predictive_Maintenance_Dataset

Curated Related

Downloads: 71

Likes: 3

Updated: Dec 4, 2023

Broaden dataset search

learn then test calibrating dataset

Spaces

AGC2024-P/predictive-world-model-2024

Curated Related

Likes: 11
KushagraisTaken/predictive-maintenance-AI4I

Curated Related

Likes: 2

Broaden demo search

learn then test calibrating demo

Explore on Hugging Face

Search models Search datasets Search spaces

Research context

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote