Skip to content
implementation starting point
Benchmarks: thin evidence
Time to repro: a few hours
none

Results & Benchmarks

Freshness tier: cold
Direct + Inferred Evidence
Natural language processing
Bamboo-7B-PowerInfer
GSM8K
70.54
Source: paper fulltext
Natural language processing
Bamboo-7B-dense
GSM8K
70.28
Source: paper fulltext

Benchmark evidence drill-down

2 findings

Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.

Task Dataset Metric Value Source Evidence refs
Natural language processing Bamboo-7B-PowerInfer GSM8K 70.54 paper-derived No explicit refs
Natural language processing Bamboo-7B-dense GSM8K 70.28 paper-derived No explicit refs

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

sjtu-ipads/powerinfer is the strongest maintained implementation based on ranking signals. CI workflows are present. License is declared (MIT).

Open sjtu-ipads/powerinfer

Reproduction Risks

  • No repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.
Evidence disclosure

Evidence graph: 3 refs, 3 links.

Utility signals: depth 90/100, grounding 85/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

sjtu-ipads/powerinfer
best maintained
Maintenance: Recently updated
Confidence: High
Reproducibility: Strong

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars
9,570
Last push
May 11, 2026 (40d ago)
CIDependencies

Risk flags

  • No tagged releases
  • No Docker setup
Tiiny-AI/PowerInfer
alternative
Maintenance: Recently updated
Confidence: Medium
Reproducibility: Strong

Matched via arXiv identifier search · Strong overlap with paper title keywords

Stars
9,570
Last push
May 11, 2026 (40d ago)
CIDependencies

Risk flags

  • No tagged releases
  • No Docker setup
sanbuphy/SmartPaper
alternative
Maintenance: Stale
Confidence: Low
Reproducibility: Moderate

Matched via arXiv identifier search · Community adoption signal (57 stars)

Stars
57
Last push
May 6, 2025 (410d ago)
Dependencies

Risk flags

  • No push in 12+ months
  • No CI pipeline detected
  • No tagged releases

Best implementation now

sjtu-ipads/powerinfer
Confidence: High
Reproducibility: Strong

High-speed Large Language Model Serving for Local Deployment

Stars: 9,570
Forks: 582
Last push: May 11, 2026
License: MIT
Official implementation from Papers with Code
Repository link is mentioned in the paper metadata
Strong overlap with paper title keywords
Community adoption signal (9570 stars)
License ✓
CI ✓
Deps ✓
Docker –
  • Selected sjtu-ipads/powerinfer as the strongest maintained implementation for new work.
  • Includes CI workflow signals.
  • Includes dependency/environment manifest signals.
  • Repository activity is within the last 24 months.

Reproduction readiness

Ready to Run
Time to first repro: hours
Last checked: Jun 18, 2026

Ready to reproduce

  • · Clone sjtu-ipads/powerinfer and install dependencies from requirements.txt.
  • · CI pipeline detected — automated tests are in place.
  • · Last updated 40 days ago.
Open sjtu-ipads/powerinfer

Quick start

git clone https://github.com/sjtu-ipads/powerinfer.git
pip install -r requirements.txt

Additional implementations

Official

No additional official repositories detected.

Community

  • Tiiny-AI/PowerInfer
    Confidence: Medium

    High-speed Large Language Model Serving for Local Deployment

    Stars: 9,570
    Last push: May 11, 2026
    License: MIT

These repositories had low-confidence matching signals and are hidden by default.

Showing top 6 by score. 1 additional low-confidence matches are hidden.

Hugging Face artifacts

No trustworthy direct or curated related Hugging Face artifacts were found yet.

Continue with targeted Hugging Face searches derived from the paper title and method context:

Tip: start with models, then check datasets/spaces if you need evaluation data or demos.

Direct artifact matches are currently sparse. Use targeted Hugging Face searches to quickly locate candidate models, datasets, and demos.

Research context

Tasks

Natural language processing

Methods

Transformer

Domains

Natural Language Processing

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.