Researcher Tools
Human Feedback and Eval Paper Explorer
A focused feed for RLHF, preference data, rater protocols, agent evaluation, and LLM-as-judge research. Every paper includes structured metadata for quick triage.
Filter by tag
No papers matched this filter set. Try removing tags or using a broader query.
Protocol Hubs
Expert Verification Papers (32)
CS.CL + Expert Verification Papers (24)
Pairwise Preference Papers (89)
CS.CL + Pairwise Preference Papers (74)
Coding Papers (69)
CS.CL Human Feedback And Eval Papers (1,020)
CS.AI + Expert Verification Papers (20)
CS.AI Human Feedback And Eval Papers (794)
Expert Verification Or Pairwise Preference Papers (118)
Pairwise Preference Papers (Last 120 Days) (59)
Pairwise Preference Papers (Last 90 Days) (58)
Pairwise Preference Papers (Last 60 Days) (57)
Long Horizon Papers (101)
CS.AI + Pairwise Preference Papers (52)
Expert Verification Or Rubric Rating Papers (50)
CS.CL + Coding Papers (51)
Benchmark Hubs
Metric Hubs
- Accuracy & Pass Rate Metric Papers (88)
- Accuracy Metric Papers (82)
- Accuracy & Pass Rate Metric Papers In CS.CL (63)
- Accuracy & Pass Rate Metric Papers + Automatic Metrics (74)
- Accuracy In CS.CL Papers (58)
- Accuracy & Pass Rate Metric Papers In CS.AI (58)
- Accuracy + Automatic Metrics Metric Papers (70)
- Accuracy + Automatic Metrics Metric Papers (Last 120 Days) (53)
- Accuracy + Automatic Metrics Metric Papers (Last 90 Days) (51)
- Accuracy + Automatic Metrics Metric Papers (Last 30 Days) (47)
Need human evaluators for your AI research? Scale annotation with expert AI Trainers.