Maintained implementation availablepytorch

MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts

April 1, 2024arXiv: 2404.15159

2 repos203 stars~a few hours to reproduce

Abstract

Summary

MixLoRA is a parameter-efficient fine-tuning method that extends LoRA with a Mixture-of-Experts design for large language models, implemented within the MoE-PEFT framework. This page includes benchmark evidence for commonsense QA PEFT fine-tuning on ARC-e/ARC-c/BoolQ/OBQA (LLaMA2-7B). Reproduction guidance focuses on implementation viability and concrete risk controls.

Key Contributions

MixLoRA is a parameter-efficient fine-tuning method that extends LoRA with a Mixture-of-Experts design for large language models, implemented within the MoE-PEFT framework.
The MixLoRA implementation is designed to be used via the MoE-PEFT library and also provides integrations with HuggingFace Transformers for inference.
The official MixLoRA repository includes scripts to reproduce the evaluation results by running MoE-PEFT v1.0.1 or later with predefined launch configurations.

Implementation Guidance

Use TUDB-Labs/MixLoRA first because deterministic ranking and extracted evidence align on implementation viability. Start with the repo setup path, then validate benchmark reproduction before adaptation.

Reproducibility Notes

Reproduction quality may degrade if the original preprocessing pipeline and task mixing details from the paper are not matched closely.
Differences in hyperparameters such as router loss coefficient, rank, or training schedule from those used in the paper can lead to noticeably different benchmark scores.

Results & Benchmarks

Task	Dataset	Metric	Value
LoRA	ARC	Router Loss Coef.	77.7
LoRA	ARC(Avg)	Accuracy	71.3

Best Implementation

TUDB-Labs/MixLoRA

State-of-the-art Parameter-Efficient MoE Fine-tuning Method

203 19 Aug 2024 Apache-2.0

License ✓

CI ✓

Deps ✓

Docker –

Selected TUDB-Labs/MixLoRA as the strongest maintained implementation for new work.
Includes CI workflow signals.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Reproduction Path

1
Start with TUDB-Labs/MixLoRA and validate setup instructions in README.
2
Reproduce the baseline result with the provided defaults before modifying hyperparameters.
3
Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few hoursNo repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.

Additional Implementations

No additional verified repositories beyond the primary recommendation.

Hugging Face Artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts.

Curated Related

Kevin3777/Enhancing_Intent_Understanding
11 1

Research Context