Skip to content
context only
Benchmarks: thin evidence
Time to repro: a few days
2 risk flags

Results & Benchmarks

Freshness tier: cold
Direct + Inferred Evidence

Some benchmark signal exists in the extracted evidence, but it is not structured strongly enough yet for a confident benchmark decision.

End-to-end Neural Coreference Resolution is the primary contribution described in this paper.

Implementation Evidence Summary

Confidence: low

Recommendation evidence is currently too limited for a maintained-repo choice. Use Implementation Status and Reproduction Path for a practical baseline plan.

Reproduction Risks

  • Estimate is based on paper-only reproduction flow

Hardware Notes

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Evidence disclosure

Evidence graph: 3 refs, 2 links.

Utility signals: depth 95/100, grounding 78/100, status high.

Implementation Status

No verified maintained repo

There is no verified maintained implementation yet. Use this baseline plan to decide whether to prototype now or defer.

  • No direct maintained implementation was found. Use the paper PDF and citation graph to design a baseline reproduction.
  • Track assumptions and missing details in an experiment log before coding.

Reproduction readiness

No Repo
Time to first repro: days
Last checked: Jun 20, 2026

Hardware requirements

  • Expect multi-day setup/compute for meaningful reproduction based on current guidance.

No verified implementation available

  • · No maintained repository has been identified for this paper. Check adjacent implementations or HF artifacts below.

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Datasets

Spaces

Research context

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.