HFEPX Archive Slice
HFEPX Daily Papers for 2026-05-20
Daily archive slice for 2026-05-20 from the HFEPX corpus. Updated from current HFEPX corpus (2026-06-08); covers 1 papers from 2026-05-20.
HFEPX Archive Slice
Daily archive slice for 2026-05-20 from the HFEPX corpus. Updated from current HFEPX corpus (2026-06-08); covers 1 papers from 2026-05-20.
Use this archive page for time-slice monitoring (what changed in evaluation methods, metrics, and protocol quality this period). Quality band: Developing .
High-Signal Coverage
100.0%
1 / 1 papers are not low-signal flagged.
Benchmark Anchors
0.0%
Papers with benchmark/dataset mentions in extraction output.
Metric Anchors
0.0%
Papers with reported metric mentions in extraction output.
Primary action: Use this slice as early signal only; benchmark/metric anchoring is limited for rigorous period-over-period claims.
Get this digest every Friday →
SubscribeRanked by protocol completeness and evidence density for faster period-over-period review.
May 20, 2026 · Citations: 0 · Score: 2.5
Eval: Not reported · Metrics: Not reported
Quickly compare method ingredients across this archive slice.
| Paper | Eval Modes | Benchmarks | Metrics | Quality Controls |
|---|---|---|---|---|
| Fine-grained Claim-level RAG Benchmark for Law May 20, 2026 | Not reported | Not reported | Not reported | Not reported |
Gap: Human feedback
Human feedback is present in 0 of 1 papers.
Gap: Quality controls
Quality controls is present in 0 of 1 papers.
Gap: Benchmarks
Benchmarks is present in 0 of 1 papers.
Gap: Metrics
Metrics is present in 0 of 1 papers.
Strong: Known rater population
Known rater population is present in 1 of 1 papers.
Gap: Known annotation unit
Known annotation unit is present in 0 of 1 papers.
Evaluation Modes
Top Metrics
Top Benchmarks
Quality Controls
Souvick Das, Sallam Abualhaija, Domenico Bianculli · May 20, 2026 · Citations: 0
Nonetheless, prior work shows that RAG systems, whether general-purpose or legal-specific, still hallucinate at varying rates, making fine-grained evaluation essential.