Skip to content
← Back to explorer

Metric Hub

Accuracy Metric Papers

Updated from current HFEPX corpus (Feb 26, 2026). 218 papers are grouped in this metric page. Common evaluation modes: Automatic Metrics, Simulation Env. Frequently cited benchmark: retrieval. Common metric signal: accuracy. Newest paper in this set is from Feb 25, 2026.

Papers: 218 Last published: Feb 25, 2026 Global RSS

Research Utility Snapshot

Human Feedback Mix

  • Expert Verification (8)
  • Pairwise Preference (8)
  • Critique Edit (3)
  • Rubric Rating (3)

Agentic Evaluation

  • Long Horizon (16)
  • Multi Agent (7)
  • Web Browsing (3)
  • Tool Use (1)

Top Papers Reporting This Metric

Other Metric Hubs