Skip to content
← Back to explorer

Metric Hub

Agreement Metric Papers

Updated from current HFEPX corpus (Feb 26, 2026). 24 papers are grouped in this metric page. Common evaluation modes: Automatic Metrics, Human Eval. Frequently cited benchmark: contentbench. Common metric signal: agreement. Newest paper in this set is from Feb 24, 2026.

Papers: 24 Last published: Feb 24, 2026 Global RSS

Research Utility Snapshot

Human Feedback Mix

  • Pairwise Preference (3)
  • Expert Verification (2)
  • Rubric Rating (2)
  • Critique Edit (1)

Agentic Evaluation

  • Long Horizon (1)

Top Papers Reporting This Metric

Other Metric Hubs