No verified implementation yet

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Jeonghye Kim, Xufang Luo, Minbeom Kim, Sangmook Lee, Dohyung Kim +3 more

March 25, 2026arXiv: 2603.24472

1 repo~a few hours to reproduce

Abstract

Self-distillation has emerged as an effective post-training paradigm for LLMs, often improving performance while shortening reasoning traces. However, in mathematical reasoning, we find that it can reduce response length while degrading performance. We trace this degradation to the suppression of epistemic verbalization - the model's expression of uncertainty during reasoning. Through controlled experiments varying c...

Best Implementation

Maintained implementation evidence is not confirmed for this paper yet.

Use the Implementation Status and Reproduction Path sections below for the current action plan.

Reproduction Path

Follow this baseline workflow to decide if this paper is worth immediate prototyping.

1
This is primarily a method paper. Reproduce it within a maintained framework baseline instead of chasing paper-specific repos.
2
Start with framework-native implementations (e.g. PyTorch optimizer module, Optax, or Transformers training loops).
3
Replicate the paper ablation settings first, then compare against modern baselines.

Time to first repro: a few hoursNo maintained paper-verified implementation is currently available

Additional Implementations

Official

No additional official repositories detected.

Community

beanie00/self-distillation-analysisConfidence: low
Codebase for the work “Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?”
Stars: 46Forks: 6Last push: Mar 2026License: Apache-2.0

Hugging Face Artifacts

No trustworthy direct or curated related Hugging Face artifacts were found yet.

Continue with targeted Hugging Face searches:

models

arxiv:2603.24472 Self-Distillation LLMs

datasets

arxiv:2603.24472 Self-Distillation dataset Reasoning / puzzle solving dataset

spaces

arxiv:2603.24472 Self-Distillation demo Reasoning / puzzle solving demo

Research Context

Tasks

Reasoning / puzzle solving