OpenTrain Research Tools
Human Feedback and Eval Paper Explorer
A focused feed for RLHF, preference data, rater protocols, agent evaluation, and LLM-as-judge research. Every paper includes structured metadata for quick triage.
Filter by tag
No papers matched this filter set. Try removing tags or using a broader query.
Protocol Hubs
Expert Verification Papers (23)
CS.CL + Pairwise Preference Papers (56)
Pairwise Preference Papers (64)
CS.AI + Pairwise Preference Papers (39)
General + Pairwise Preference Papers (38)
CS.CL + Expert Verification Papers (18)
Automatic Metrics + Pairwise Preference Papers (51)
Expert Verification Or Rubric Rating Papers (36)
CS.CL + Medicine Papers (52)
Automatic Metrics + Expert Verification Papers (19)
Human Eval Papers (36)
CS.CL + Math Papers (71)
CS.CL + Human Eval Papers (33)
Long Horizon Papers (74)
Critique Edit Or Expert Verification Papers (41)
Automatic Metrics + General + Pairwise Preference Papers (29)
Benchmark Hubs
- Retrieval Benchmark Papers (101)
- Retrieval Benchmark Papers (Last 365 Days) (97)
- Retrieval Or MATH Benchmark Papers (131)
- Retrieval Or GSM8K Benchmark Papers (113)
- Retrieval Or MMLU Benchmark Papers (114)
- Retrieval Or DROP Benchmark Papers (114)
- Retrieval Or MATH Or GSM8K Benchmark Papers (140)
- Retrieval Or MATH Or MMLU Benchmark Papers (142)
- Retrieval Or MMLU Or GSM8K Benchmark Papers (124)
- Retrieval Or MATH Or DROP Benchmark Papers (144)