OpenTrain AI
Maintained implementation availablepytorchPretrained Models Available

DAPE: Data-Adaptive Positional Encoding for Length Extrapolation

May 1, 2024arXiv: 2405.14722
2 repos41 stars~a few hours to reproduce
arXiv PDF

Abstract

Results & Benchmarks

TaskDatasetMetricValue
Data-adaptive Positional Encoding Length ExtrapolationArxivPerplexity0.0216

Best Implementation

The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"

41 0 Oct 2024 Apache-2.0
License
CI
Deps
Docker
  • Selected chuanyang-zheng/cape as the strongest maintained implementation for new work.
  • Repository activity is within the last 24 months.
  • Official repository is preserved separately as historical context.

Reproduction Path

  1. 1

    Start with chuanyang-zheng/cape and validate setup instructions in README.

  2. 2

    Reproduce the baseline result with the provided defaults before modifying hyperparameters.

  3. 3

    Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few hoursNo CI workflows detectedDependency manifest is missing

Additional Implementations

No additional verified repositories beyond the primary recommendation.

Hugging Face Artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts.