Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
- Stars
- 35
- Last push
- Dec 17, 2024 (550d ago)
Risk flags
- No push in 12+ months
- No CI pipeline detected
- No tagged releases
Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.
No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities is the primary contribution described in this paper.
alexandervnikitin/kernel-language-entropy is the strongest maintained implementation based on ranking signals. License is declared (BSD-3-Clause-Clear). Dependency/environment manifests are present.
Open alexandervnikitin/kernel-language-entropyEvidence graph: 3 refs, 3 links.
Utility signals: depth 55/100, grounding 75/100, status medium.
Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
Community adoption signal (477 stars)
Risk flags
Matched via arXiv identifier search · Strong overlap with paper title keywords
Risk flags
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)
Dependencies pinned, manual setup needed
Quick start
git clone https://github.com/alexandervnikitin/kernel-language-entropy.git
conda env create -f environment.yml && conda activate <env-name> No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.
No additional official repositories detected.
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)
These repositories had low-confidence matching signals and are hidden by default.
No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.
No trustworthy model matches right now.
Search models on Hugging FaceBroaden dataset search
Evaluation & Human Feedback Data
Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.
Open in HFEPXNeed human evaluators for your AI research? Scale annotation with expert AI Trainers.