Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
- Stars
- 154
- Last push
- Apr 27, 2026 (4d ago)
Risk flags
- No Docker setup
- Dependency manifest missing
Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.
Some benchmark signal exists in the extracted evidence, but it is not structured strongly enough yet for a confident benchmark decision.
Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC is the primary contribution described in this paper.
stan-dev/loo is the strongest maintained implementation based on ranking signals. CI workflows are present. License is declared (NOASSERTION).
Open stan-dev/looHardware Notes
Expect multi-day setup/compute for meaningful reproduction based on current guidance.
Evidence graph: 4 refs, 4 links.
Utility signals: depth 95/100, grounding 95/100, status high.
Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
Strong overlap with paper title keywords · Community adoption signal (154 stars)
Risk flags
loo R package for approximate leave-one-out cross-validation (LOO-CV) and Pareto smoothed importance sampling (PSIS)
Preserved for provenance. Not recommended as the default path for new builds.
Hardware requirements
No dependency manifest — manual reconstruction required
No additional verified repositories beyond the primary recommendation.
These repositories had low-confidence matching signals and are hidden by default.
No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.
Broaden model search
No trustworthy dataset matches right now.
Search datasets on Hugging FaceBroaden demo search
Evaluation & Human Feedback Data
Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.
Open in HFEPXNeed human evaluators for your AI research? Scale annotation with expert AI Trainers.