OpenTrain AI
Maintained implementation availablepytorch

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

October 1, 2024arXiv: 2410.07348
1 repo264 stars~a few days to reproduce
arXiv PDF

Abstract

Results & Benchmarks

TaskDatasetMetricValue
Accelerating Mixture-of-experts Methods Zero-computation ExpertsARCAccuracy0

Hardware Requirements

  • Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Best Implementation

[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

264 13 Oct 2024 Apache-2.0
License
CI
Deps
Docker
  • Selected skyworkai/moe-plus-plus as the strongest maintained implementation for new work.
  • Repository activity is within the last 24 months.

Reproduction Path

  1. 1

    Start with skyworkai/moe-plus-plus and validate setup instructions in README.

  2. 2

    Reproduce the baseline result with the provided defaults before modifying hyperparameters.

  3. 3

    Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few daysNo CI workflows detectedDependency manifest is missing

Additional Implementations

No additional verified repositories beyond the primary recommendation.

Hugging Face Artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts.